Benjamin Cane

#Bengineering 🧐

Practical notes from Benjamin Cane on distributed systems, reliability, architecture, and engineering leadership. I share most of these thoughts on LinkedIn first, then keep them here for reading later and following by RSS.

42 posts August 8, 2025 → May 28, 2026
Portrait of Benjamin Cane

Follow the feed

#Bengineering 🧐

Read the posts here, join the discussion on LinkedIn, or subscribe by RSS.

Subscribe via RSS Connect on LinkedIn

Latest Post

Portrait of Benjamin Cane
Benjamin Cane
May 28, 2026

Your coding agent is missing one thing: architectural context.

I’ve been a big believer in Architecture Decision Records (ADRs) long before coding agents came along.

Documenting decisions gives engineers context:

Why is the system designed a certain way? What constraints existed at the time? What tradeoffs were made?

That context matters. It also matters for agents.

🤖 Agents Need Context Too

Unlike human engineers, agents don’t get context from hallway conversations, shadowing others, or tribal knowledge.

They only know what you capture. The best way to capture architectural context? Write it down as a decision record—and make it accessible to agents.

The only question is, what’s the best way to make decision records accessible?

🏗️ Option 1: MCP Server

If your ADRs live in a wiki or documentation system, you can expose them through an MCP server.

This works well when documentation is spread across teams or multiple systems that need to be aggregated.

You want a unified interface for agents. MCP is a good approach, but it comes with some infrastructure overhead.

🧱 Option 2: Keep ADRs in Git

I’ve long preferred storing ADRs in Git.

It provides versioning, review workflows, automated validation, and is where engineering work happens. Storing ADRs in Git, ideally alongside your code, is the fastest way to give agents usable context.

The challenge is that architecture often spans multiple services and repositories. So many centralize their architecture into a single repository, which is not where your code lives.

🌉 Bridging the Gap

Most modern coding agents let you include additional directories or sources at runtime, either through slash commands or CLI options.

That means you can: open your codebase, include your architecture repository, and run the agent with context.

Just adding another directory gives your agent an understanding of system constraints, architecture decisions, technology choices, and surrounding systems. These are not things an agent can reliably infer from code alone.

💡 Final Thought: Why Context Matters

With architectural context, agents produce code that aligns with your system.

When engineers understand the system end to end, they make better decisions. The same applies to agents.

If you want better results, give better context.

Keep Reading

  • May 21, 2026 Health-check the listener your gRPC traffic actually uses
  • May 14, 2026 Weighted load balancing has saved me more times than I can count
  • May 7, 2026 YOLO Is a Terrible Strategy for Validating Production Changes

All Posts

  • May 28, 2026 Your coding agent is missing one thing: architectural context
  • May 21, 2026 Health-check the listener your gRPC traffic actually uses
  • May 14, 2026 Weighted load balancing has saved me more times than I can count
  • May 7, 2026 YOLO Is a Terrible Strategy for Validating Production Changes
  • April 30, 2026 Deterministic routing is one of the most effective ways distributed systems reduce consistency problems at scale
  • April 23, 2026 When you think of microservices, you probably think of centralized shared services. But there's another valid pattern that is rarely discussed
  • April 16, 2026 Are you using traffic mirroring in production? If not, try it out.
  • April 9, 2026 Agent Skills Are Becoming the Best Way to Capture Institutional Knowledge
  • April 2, 2026 Saved Prompts Are Dead. Agent Skills Are the Future.
  • March 26, 2026 Generating Code Faster Is Only Valuable If You Can Validate Every Change With Confidence
  • March 19, 2026 When You Go to Production with gRPC, Make Sure You’ve Solved Load Distribution First
  • March 12, 2026 You may be building for availability, but are you building for resiliency?
  • March 5, 2026 When your coding agent doesn’t understand your project, you’ll get junk
  • February 26, 2026 You can have 100% Code Coverage and still have ticking time bombs in your code. 💣
  • February 19, 2026 Getting More Out of Agentic Coding Tools
  • February 12, 2026 Why is Infrastructure-as-Code so important? Hint: It's correctness
  • February 5, 2026 Optimizing the team’s workflow can be more impactful than building business features
  • January 29, 2026 I follow an architecture principle I call The Law of Collective Amnesia
  • January 22, 2026 Performance testing without a target is like running a race with no finish line
  • January 15, 2026 Many teams think performance testing means throwing traffic at a system until it breaks. That approach is fine, but it misses how systems are actually stressed in the real world.
  • January 8, 2026 Pre-populating caches is a “bolt-on” cache-optimization I've used successfully in many systems. It works, but it adds complexity
  • January 1, 2026 Don't be afraid to build a tool. Just don't become too attached to it.
  • December 26, 2025 One of the toughest engineering skills to develop is accepting a decision you disagree with. 😖
  • December 19, 2025 Canary deployments are an operational superpower, but the complexity they bring isn’t for everyone.
  • December 12, 2025 Everyone has bias, yes, even you. 🫵
  • December 5, 2025 Do you use Architecture Decision Records? I’m a big fan, and I think they’re a best practice every engineering org should adopt.
  • November 28, 2025 Does resource usage within your application or database suddenly spike periodically? Does it cause system slowdown?
  • November 21, 2025 When you shut down an application instance, don't stop the listener immediately — that's how you end up with failed requests during every application rollout. 😢
  • November 14, 2025 A common issue I see when teams first adopt gRPC is managing persistent connections, especially during failovers.
  • November 7, 2025 A dangerous mindset I’ve seen—and been guilty of—is assuming code doesn't change.
  • October 31, 2025 ⚡️Does saving 1 millisecond really matter? Answer: more than you’d think.
  • October 27, 2025 Have you heard of Store and Forward? It’s a resiliency design prevalent in card & bank payments, telecommunications, and other industries.
  • October 24, 2025 When Building Low-Latency, High-Scale Systems, Push as Much Processing as Possible to Later
  • October 10, 2025 Coding is a small part of software engineering.
  • October 3, 2025 Should I be an individual contributor or a people leader?
  • September 26, 2025 Improve performance and reduce chances of request failures with this one simple trick! Avoid cross-region calls.
  • September 19, 2025 Did you know Kube-proxy doesn’t perform load-balancing itself? It’s iptables (by default).
  • September 12, 2025 You’ve heard of feature flags, but what about operational flags? ⏯️
  • September 5, 2025 A core capability for building low-latency platforms is quickly detecting and reacting to issues.
  • August 22, 2025 Sometimes when I tell people that logging can impact a microservices response time, I get strange looks. 🤨
  • August 15, 2025 How many times have you seen analytics on an operational database create issues? I’ve seen it far too often.
  • August 8, 2025 I can't count how often I've seen issues made worse by minor oversights—like not setting a timeout value. ⏱️

Practical engineering notes by Benjamin Cane.