How do you measure the percentage of AI-written code in an engineering team?

The most practical method is commit-based measurement: count commits with the Co-Authored-By header versus total commits. Claude Code adds this header automatically, and the same convention works on both GitHub and GitLab. In a pilot with 30-40 engineers, this approach measured 65% AI-written code and scaled without any new tooling — scripts simply crawled existing Git history.

Why is 'lines of code accepted' a bad AI code metric in 2026?

Lines of code accepted was designed for the autocomplete era, where you'd accept or reject individual suggestions. In the agentic era, the AI codes autonomously and every line counts as accepted automatically, inflating the metric to meaninglessness. Worse, if you discard the entire output because the plan was wrong, the counter doesn't adjust. Commits are a better unit of work because they represent completed, shipped tasks.

How does the Co-Authored-By Git header track AI-generated commits on GitHub and GitLab?

When Claude Code commits changes, it automatically adds a Co-Authored-By header to the commit message with no extra steps. This metadata is then queryable through standard Git tools and APIs on both GitHub and GitLab, allowing teams to filter, count, and calculate AI contribution percentages. The header convention is the same across platforms, making it a universal tracking mechanism.

What percentage of production code is actually written by AI in 2026?

In one documented pilot with 30-40 engineers using Claude Code, the measured figure was 65% AI-written code — but the real number was closer to 80%. The gap came from engineers who worked entirely with Claude but committed manually out of habit, missing the tracking header. This highlights a key limitation of opt-in AI code metrics: you measure the floor, not the ceiling.

How do you track AI-generated commits across a GitHub or GitLab organization at scale?

Using scripts that crawl commit history and check for the Co-Authored-By header, you can generate reports of every commit and its AI attribution status. The approach works on both GitHub and GitLab APIs and can be sliced by developer, team, repository, or entire organization. In one pilot, this same methodology was prepared to scale from 30-40 engineers to 300+ without requiring any additional tooling.

How do you prove ROI of AI coding tools to non-technical executives?

The most convincing metric is the ratio of AI-assisted commits to total commits — it translates directly to 'what percentage of work units are AI-delivered.' Unlike vanity metrics like lines of code, commit ratios mirror how engineering teams already think about work. In a pilot that hit 65% AI-written code, the data came straight from existing version control systems with no new infrastructure, making the numbers easy for a board to trust.

What KPIs should engineering leaders use to measure developer productivity with AI coding tools?

Focus on commits as units of work rather than lines of code, which inflate meaninglessly in the agentic era. Track the ratio of AI-attributed commits to total commits, the gap between measured and self-reported AI usage, and whether the methodology scales without new tooling. In one pilot, commit-based metrics revealed that the engineering team's self-reported AI usage was 80% while the tracked metric showed 65% — the delta itself became a useful adoption signal.

How do you measure AI coding adoption metrics when engineers don't consistently use AI commit attribution?

Opt-in tracking through commit headers measures the floor of AI adoption, not the ceiling. In a pilot where 65% of commits had the Co-Authored-By header, direct conversations with engineers revealed the actual figure was closer to 80% — they were using Claude Code for everything but committing manually out of habit. Following up with engineers who have low header rates is essential for getting a complete picture of real AI coding adoption.

Does commit-based AI code measurement work better than lines of code for reporting to the board?

Yes. Lines of code is a noise metric — two developers implementing the same task produce wildly different line counts, and in the agentic era the counter inflates automatically without reflecting shipped value. Commits represent actual units of work delivered. A board-ready metric like '65% of committed work was AI-assisted' is more meaningful and defensible than any lines-of-code figure.

How do you build a dashboard to audit AI code contributions from Git history?

Query your Git hosting platform's API to extract commit metadata, specifically the Co-Authored-By header that AI tools like Claude Code add automatically. Scripts that crawl commit history can generate reports filterable by developer, team, or repository. The same methodology works on both GitHub and GitLab, requires no special tooling beyond API access, and was used to track a pilot across 30-40 engineers preparing to scale to 300+.

How We Measured 65% AI-Written Code (And Why Lines Don't Matter)

After publishing what we learned from our Claude Code pilot, I got a flood of questions. Not about adoption strategies or developer feedback - about measurement. How do you actually quantify "65% AI-written code"?

The answer is simpler than you'd think. But it requires letting go of a metric that's been with us for decades.

Forget Lines of Code

Most AI coding tools - Claude Code, Cursor, GitHub Copilot - give you a "Lines of Code Accepted" counter out of the box. It sounds useful. It isn't.

That metric was designed for the autocomplete era. You'd ask the AI to implement a function, it would offer a suggestion, and you'd accept or reject it. In that world, "accepted" meant something.

In the agentic world, it means nothing.

When you work with an agent, you plan together, then the agent codes autonomously - writing, building, testing, iterating - and you review the output. Every line the agent writes during that autonomous phase counts as "accepted" automatically. The metric inflates to meaninglessness.

Worse: if you throw away the entire output because the plan was wrong, that counter doesn't adjust. You could "accept" 500 lines that never ship.

Lines of code also fail for a simpler reason: they never mattered. Two developers implementing the same task can produce wildly different line counts depending on style, patterns, and preferences. Same work, different numbers. That's not measurement - that's noise.

Commits as Units of Work

We went back to basics. A commit is a unit of work. You take a task, implement it, commit it. If it's a big task, you split it into reviewable pieces - the same practice we've followed for years.

Claude Code handles this naturally. When you ask Claude to commit your changes, it writes clear commit messages and adds a header automatically:

Co-Authored-By: Claude <[email protected]>

No extra steps. No friction. The metadata is just there.

From that point, measurement becomes straightforward: count commits with the header, count commits without. That ratio is your AI-written percentage.

The Technical Implementation

We ran this on GitLab. I knew who was in the pilot. GitLab knows what commits each person made. Using scripts that crawled commit history - scripts Claude wrote for me, which felt appropriate - we generated a report of every commit, its message, and whether it carried the Co-Authored-By header.

The analysis from there was trivial. Filter, count, calculate percentages.

This same approach scales to thousands of developers. You can slice it by person, team, repo, or the entire org. The data is already in your version control system. You just need to query it.

GitHub works the same way. The header convention is identical. If you're on GitHub, the same methodology applies.

The Gap Between Measured and Actual

Our pilot measured 65%. The real number was closer to 80%.

The gap came from engineers who didn't adopt the commit practice. They'd work entirely with Claude Code but commit manually out of habit, missing the header. When I followed up with them directly, they swore they couldn't go back - that every line of code during the pilot was written with Claude.

That's the limitation of any opt-in tracking: you measure the floor, not the ceiling. But 65% as a floor was more than enough to validate what we were seeing.

What I'd Tell Other Teams

If you're trying to measure AI adoption, start here: commits with co-author attribution, compared to total commits. It mirrors how you already work. It doesn't require new tooling. It scales.

If you have a better method, I'd genuinely like to hear it. But after running this pilot across 30-40 engineers and preparing to scale it to 300+, I haven't found one.

The metric that matters isn't how many lines the AI wrote. It's how many units of work the AI delivered. Commits capture that. Lines don't.

How We Measured 65% AI-Written Code (And Why Lines Don't Matter)

Forget Lines of Code

Commits as Units of Work

The Technical Implementation

The Gap Between Measured and Actual

What I'd Tell Other Teams

Frequently Asked Questions

Found this helpful? Share it!

Quick Links

Connect