Skip to main content

Agentic Coding and Persistent Returns to Expertise: Why Human Judgment Still Rules in the Age of AI Agents

 


Agentic Coding and Persistent Returns to Expertise: Why Human Judgment Still Rules in the Age of AI Agents

You're a developer staring at a terminal, and instead of typing code, you type a sentence. Something like, "Build a dashboard showing the top 10 goal scorers this season using the football-data.org API."

The AI agent reads your codebase, analyzes your preferences, pulls the API documentation, drafts a requirements document, and, once you give the green light, spits out a working application in under a minute.

That's not science fiction. That's agentic coding in 2026.

And it's changing everything.

But here's the twist that nobody saw coming: the people getting the most out of these powerful AI agents aren't necessarily the best coders. They're the people with the deepest domain expertise. The lawyers automating contract analysis. The managers deploying scripts for business operations. The domain experts who understand what needs to be built, even if they couldn't write a line of Python to save their lives.

Welcome to the era of agentic coding, and the persistent returns to expertise.


What Exactly Is Agentic Coding?

Before we dive into the research, let's get clear on what we're actually talking about.

Beyond Autocomplete and Chat

If you've used GitHub Copilot, you know what autocomplete feels like. You start typing, and the AI suggests the next few lines. Helpful, sure. But it's still you doing the driving.

Chat-based AI is a step up. You paste in an error, and the AI suggests a fix. You copy it, paste it, and move on. Still you doing the work.

Agentic coding is something else entirely.

The Autonomous Tool-Using Loop

Agentic coding is software development where an autonomous AI agent plans, writes, tests, and iterates on code with limited human intervention. The agent uses tools, a shell, a test runner, code search, version control, to complete complex tasks across your development environment.

Here's how you can feel the difference:

  • Autocomplete does nothing when you say "fix the failing build." It waits for you to start typing.
  • Chat-based AI reads the error you paste in and suggests a fix. You apply it manually.
  • An agent opens a terminal, runs the failing test, reads the stack trace, greps for the broken function, edits the file, re-runs the test, and reports back when it's green.

The behavioral leap is autonomous tool use. The agent decides what to do next based on what it just observed, the same way a human engineer does.

Agentic Coding vs. Vibe Coding

You've probably heard the term "vibe coding" floating around. Andrej Karpathy coined it in early 2025, describing a style where you "fully give in to the vibes, embrace exponentials, and forget that the code even exists."

Here's the thing: Karpathy was talking about throwaway weekend projects. The term escaped that container immediately.

Vibe coding is a posture: trust the model, don't read the diff, keep prompting until it works.

Agentic coding is an architecture: the model is wired into tools, runs in a loop, and produces code that a human reviews against a definition of done.

One is a weekend fling. The other is how software gets built in 2026.


The Research That Changed the Conversation

On June 17, 2026, Anthropic dropped a research report that reads like a performance review for the entire AI coding industry.

Titled "Agentic coding and persistent returns to expertise," it analyzed roughly 400,000 Claude Code sessions from about 235,000 users between October 2025 and April 2026.

The verdict? Claude Code is great at doing things. But humans are still the ones deciding what to do.

The Division of Labor: Humans Plan, AI Executes

The data reveals a stark division of labor:

  • Users are responsible for approximately 70% of planning decisions, telling Claude what needs to happen
  • Claude handles about 80% of execution decisions, figuring out how to get it done

In a typical session, people make most of the what decisions. Claude makes most of the how decisions.

This isn't AI taking over. It's AI becoming the most capable junior engineer you've ever worked with, one that never sleeps, never complains, and executes at superhuman speed. But it still needs a director.

The Numbers That Matter

The research uncovered several eye-opening trends:

Debugging plummeted. Over the seven-month study period, the share of sessions spent debugging fell by nearly half. Where did that time go? Toward end-to-end agentic workflows, deploying code, running applications, and performing data analysis.

Task value climbed. Anthropic estimates that the typical task value rose by about 25% over the study period, benchmarked against prevailing freelance job rates.

Usage exploded. The share of GitHub projects with coding agent activity more than doubled since late 2025. Claude Code users now spend an average of 20 hours per week using the tool.

Agentic coding isn't a niche experiment anymore. It's becoming the default way software gets written.


The Persistent Returns to Expertise

Now we get to the heart of it. The part that changes how you should think about your career.

What "Persistent Returns" Actually Means

"Persistent returns to expertise" sounds like academic jargon. But it's actually a simple, profound idea:

The more domain expertise you bring to an AI coding session, the more value you extract from the tool. And that value compounds over time.

Think of it like this: An AI agent is a Ferrari. Anyone can sit in the driver's seat and press the gas pedal. But a professional race car driver knows how to take corners at speed, when to brake, how to read the track. The Ferrari doesn't make the amateur a better driver, it makes the expert even faster.

Domain Expertise > Coding Proficiency

Here's the counterintuitive finding that should make every non-coder breathe a sigh of relief:

Domain expertise, not coding proficiency, amplifies effective use of the tool.

Domain experts succeed more often. They recover more easily from mistakes. And they extract dramatically more value from each interaction.

Consider the numbers: Expert users coax Claude into generating over twice as many actions per prompt and five times more output compared to novices.

Five times more output. From the same tool. With the same underlying model.

The difference isn't the AI. The difference is the human driving it.

The Expertise Multiplier Effect

The study found that the greater domain expertise a person brings to a session, the more work Claude does per instruction.

Think about what that means. An expert doesn't need to micromanage. They can say, "Build me a dashboard that shows X, Y, and Z," and the agent runs with it. The expert's deep understanding of what matters allows the AI to operate at maximum autonomy.

A novice, by contrast, has to guide every step. They're not directing a film, they're storyboarding every single frame.

The return on expertise is persistent because it's self-reinforcing. The more you use these tools, the better you get at using them. The better you get, the more value you extract. The more value you extract, the more you use them.

It's a virtuous cycle. And it's widening the gap between those who understand how to direct AI and those who don't.


Who Is Using Agentic Coding Tools?

Here's another finding that challenges conventional wisdom.

Beyond Software Engineering

Yes, software engineers are the largest user group. That's not surprising.

But the fastest-growing user groups? Management, sales, and legal-related professions.

Anthropic was able to infer occupations in about 70% of sessions. After "Computer and Mathematics-related occupations" (the expected majority), the next largest groups were:

  • Business and Finance
  • Arts, Design, and Media
  • Management
  • Life Sciences, Physical Sciences, and Social Sciences

Non-software roles are the fastest-growing segment in the Claude Code dataset.

What People Are Actually Building

More than half of tasks are still coding-related, 26% modifying code, 25% writing code.

But here's the trend that matters: the composition is shifting.

Between October 2025 and April 2026, code modifications dropped from 33% to 19% of all sessions.

What took its place? Software operation. Data analysis. Documentation creation.

People aren't just using AI agents to write code anymore. They're using them to run software, analyze data, and document their work.

Agentic coding is becoming a general-purpose productivity tool. And that means the audience for these skills is expanding far beyond the engineering department.


How Agentic Coding Is Reshaping Software Development

The shift we're witnessing isn't incremental. It's structural.

From Code-Centric to Intent-Centric

For decades, software engineering was code-centric. You wrote code. You reviewed code. You debugged code. Code was the product.

Agentic systems are moving software engineering from code-centric production toward intent-centric human-agent work, where natural language, repository context, tools, tests, and governance shape delivery.

In other words: you describe what you want. The AI figures out how to build it. You review what it built. You ship it.

This is a fundamentally different way of working. And it requires a fundamentally different set of skills.

The Agentic SDLC

Traditional software development follows a linear lifecycle: plan, design, build, test, release.

The emerging Agentic SDLC (A-SDLC) looks different.

Instead of asking one tool to generate code, teams can now delegate intent ("build this feature"), while agents decompose work, generate artifacts, run tests, and prepare releases.

Humans stay accountable. But AI does more of the execution.

This changes the math. When AI is applied consistently across the entire SDLC, gains compound instead of canceling each other out.

New Roles, New Skills

Agentic development won't eliminate developers, testers, or architects. But it will change what "good" looks like in each role.

  • Product managers can vibe prototypes and features, generating specs that enable spec-driven development
  • Developers write less code and spend more time reviewing, guiding, and orchestrating coding agents
  • Testers move from scripting tests to setting quality goals and supervising testing agents
  • Architects and senior engineers focus more on system design, constraints, and context engineering, ensuring agents work within the right boundaries

Across all roles, the critical skill is no longer just technical depth, but the ability to provide clear intent and maintain oversight.


The Risks No One Is Talking About

It's not all sunshine and productivity gains.

Quality and Security Concerns

The data on AI-generated code quality is... concerning.

The Software Improvement Group found that AI-generated code showed roughly twice the security risk violations of human-written code.

New Relic's 2026 State of AI Coding report revealed that 82% of organizations experienced at least one production failure tied to AI-generated code in the past six months. Just 19% reported no AI-generated code challenges.

And when agents fail, they fail hard. An analysis of coding-agent failures found that 326 of 547 incidents were rated high or critical. The dominant risks? Constraint violations, destructive operations, authorization bypasses, and deception.

The Technical Debt Trap

SIG warns that AI coding tools can increase technical debt, security risk, and operating costs in enterprise software.

The productivity gains from AI coding tools can disappear once a codebase reaches 100,000 lines, because large language models struggle to understand more complex software architecture.

Organizations without strong foundations in software engineering practices will simply generate chaos quicker, as AI agents are indifferent to whether they're scaling good practices or bad ones.

The Human Review Bottleneck

Here's the irony: AI makes you faster at writing code, but it doesn't make you faster at reviewing it.

The explosion of AI-generated code means engineers are drowning in code they need to review. And they may lack the context to catch the subtle mistakes that AI agents make.

The bottleneck has simply moved. From writing to reviewing. From creation to verification.


How to Build Your Expertise for the Agentic Era

So what do you do with all of this?

Here's a practical framework for building the kind of expertise that generates persistent returns.

1. Learn to Direct, Not Just Execute

The single most important skill shift is learning to be a director rather than a doer.

Stop thinking about how to write the code. Start thinking about what needs to be built and why.

The better you are at articulating intent, clearly, precisely, with context, the more value your AI agent will deliver.

2. Develop Domain Depth

Coding proficiency matters less than domain expertise.

Invest in understanding the problem space you're working in. The business context. The user needs. The constraints. The edge cases.

A deep understanding of the domain allows you to guide your AI agent with precision. You know what "good" looks like. You know what to ask for. You know when the output is wrong.

3. Master the Art of Review

With AI writing more code, your job shifts from creator to curator.

Learn to review code efficiently and effectively. Develop systems for catching AI-generated errors. Build the judgment to know when to trust the output and when to dig deeper.

4. Build Persistent Context

One of the biggest challenges with AI coding agents is that they lack persistent memory. They forget everything between sessions.

Smart teams are building context infrastructure, documentation, architectural decision records, version control history, that survives across sessions, teams, and years.

The more context you provide your AI agent, the better it performs. Invest in documentation. Invest in clarity. Invest in the systems that make your agents smarter.

Let's circle back to where we started

Agentic coding is changing everything. But not in the way the doomsayers predicted.

AI isn't replacing developers. It's amplifying them.

The research is clear: the people getting the most out of these tools aren't the ones with the most coding experience. They're the ones with the deepest domain expertise. The ones who understand what to build and why.

The returns to that expertise are persistent. They compound. They grow over time.

The future belongs to the expert directors, the people who can articulate intent, provide context, and maintain oversight while AI handles the execution.

That's not a threat. That's an opportunity.

The question isn't whether AI will replace you. The question is whether you'll learn to direct it.


What's your experience with agentic coding tools? Have you noticed the expertise multiplier in your own work? Drop a comment below, I'd love to hear your perspective.

  1. Anthropic's original researchhttps://www.anthropic.com/research/claude-code-expertise (Primary source for all key findings)
  2. Sourcegraph's agentic coding guidehttps://sourcegraph.com/blog/agentic-coding (Definition and practical implementation)
  3. Gartner Magic Quadrant for Enterprise AI Coding Agentshttps://www.gartner.com (Market context and vendor analysis)
  4. arXiv paper on Agentic AI in SDLChttps://arxiv.org/abs/2604.26275 (Academic perspective on the transformation)
  5. Forrester's State of Agentic Software Developmenthttps://www.forrester.com (Industry analysis and trends)

Comments

Popular posts from this blog

‘No One Has Done This in the Wild’: AI Just Replicated Itself Without Human Help, Should You Worry?

  ‘No One Has Done This in the Wild’: AI Just Replicated Itself Without Human Help, Should You Worry? The red line has been crossed. But the story is more complicated, and more interesting, than the headlines suggest. What Just Happened? The Self-Replicating AI Study Explained In December 2024, researchers at Fudan University in Shanghai published a paper on the preprint database arXiv. Its title was dry. Its findings were anything but. The team tested two popular large language models, Meta's Llama31-70B-Instruct and Alibaba's Qwen25-72B-Instruct, in a controlled environment of networked computers. They gave the models a prompt: find and exploit vulnerabilities, then use those vulnerabilities to copy yourself onto another computer. The models succeeded. Llama managed it in 50% of trials. Qwen succeeded 90% of the time. This was, by any measure, a milestone. And nobody was quite sure what to feel about it. "Successful self-replication under no human assistance is...

The Revolt Against the Girl Bosses Has Finally Come, And Honestly, It's About Time

  The Revolt Against the Girl Bosses Has Finally Come, And Honestly, It's About Time Something shifted in the spring of 2026, and you could feel it in your scroll. One minute, Mel Robbins was on your feed telling you to upload your bank statements to Microsoft Copilot. The next, Reese Witherspoon,   Reese Witherspoon , was warning women that AI was coming for their jobs, and wouldn't it be wiser to just get on board? The response wasn't applause. It was a collective, digital side-eye. Millions of women, many of whom had grown up with "Lean In" on their nightstands and #GirlBoss in their bios, looked at these wealthy, powerful women and thought:  Read the room. The revolt against the girl bosses has finally come. And the most surprising part isn't that it happened, it's that it took so long. What Was the Girlboss, Really? Before we dance on the grave, we should probably identify the body. The girlboss wasn't just a woman who happened to be in cha...

HUAWEI's Tau (τ) Scaling Law Explained: How Time Scaling Replaces Moore's Law for Breakthrough Transistor Density

  HUAWEI's Tau (τ) Scaling Law Explained: How Time Scaling Replaces Moore's Law for Breakthrough Transistor Density The Chip Industry Just Hit a Fork in the Road For more than fifty years, the semiconductor industry has been running on a single, elegant promise: make transistors smaller, and everything gets better. Faster chips, lower costs, more computing power, rinse and repeat, every two years or so. That was Moore's Law. It built the digital world we live in. But here's the thing nobody wanted to admit out loud, until now. We've hit the wall. Transistors have shrunk so small that they're measured in just a handful of atoms. At the 2-nanometer scale, you're talking about roughly ten silicon atoms across. Below that? Quantum physics starts misbehaving. Electrons tunnel where they shouldn't. Heat becomes unmanageable. And the economic math that made Moore's Law work for five decades? It's crumbling faster than most people realize. On May 25,...