Why code quality still matters in the era of AI

Written by Iulian Gioada | Dec 4, 2025 9:26:54 AM

I'll be honest: when I first started working with AI coding assistants, I thought we were on the verge of making code quality debates obsolete. Why obsess over clean architecture and test coverage when AI could just churn out whatever we needed?

I was spectacularly wrong. After spending the last year working with these tools in production environments, and seeing both the incredible wins and the spectacular failures, I've come to realize something fundamental:

AI doesn’t fix your problems.
AI amplifies them.

The amplifier effect

Here's what nobody tells you in the demos: AI is like handing a megaphone to your existing development practices. If your codebase is clean, modular, and well-tested, AI will help you move faster while maintaining that quality. But if you're already drowning in technical debt? AI will help you drown faster. And this isn’t just something I’ve seen in the trenches; the Coding Personalities of Leading LLMs research found that over 90 percent of the issues in AI-generated code are simply code smells, which means AI isn’t just writing faster; it’s quietly stacking up the quality debt we’ll be paying for later.

The DORA 2025 Report captured this perfectly when they called AI an amplifier. I've seen it firsthand. Teams with solid internal platforms and clean architectures are shipping features at unprecedented speed. Meanwhile, teams with fragile legacy systems are generating code faster than they can validate it, leading to increased instability and chaos. The throughput goes up, sure, but so does the deployment failure rate.

It's the classic mistake of confusing local optimization with system-level improvement. Yes, I can write a function 3x faster with Copilot. But if that function breaks three other things because our architecture is a tangled mess, what have I really gained?

The trust problem we don't talk about

There's an uncomfortable truth in our industry right now: I don't actually trust the code AI generates. And I shouldn't.

According to recent surveys, while 80% of developers report productivity gains from AI, only 24% have "a lot" of trust in AI-generated code. That's a massive gap. I'm using tools that make me faster but that I fundamentally don't trust.

This means validation it's absolutely critical. But here's the catch: I can't validate what I don't understand, and I can't understand messy, poorly-structured code. This is where teams without strong code quality practices hit a wall. They're stuck in this endless loop of generating code they don't trust and can't properly validate.

I've adopted a "trust but verify" approach, which sounds simple but requires serious discipline. It means maintaining comprehensive test suites, insisting on clear architectural boundaries, and actually reading the code AI generates. Not just skimming it, really reading it. That only works if the surrounding codebase is comprehensible in the first place.

AI in the real world: legacy systems

Working with brownfield projects has taught me that code quality isn't just nice to have, it's the safety net that keeps AI from destroying everything.

I learned this the hard way on a legacy refactoring project. I brought in AI to help modernize a complex old system, thinking it would speed things up. What I discovered is that AI agents need quality guardrails to operate safely. Without them, you're basically playing Russian roulette with your production system.

The approach that actually worked was treating the existing behavior as sacred. Before making any changes, I had the AI generate golden master tests and snapshot tests to capture what the system actually does, not what I think it does or what the sparse documentation claims it does. Only then could I safely refactor.

Even then, every change required semantic validation. I spent hours doing diffs, checking edge cases, making sure the AI hadn't subtly changed behavior in some obscure scenario. And you know what? It frequently had. Not because the AI is bad, but because complex systems have complex behaviors that aren't always obvious.

There's also something subtle I noticed: style consistency matters more than I expected. When refactoring legacy code, I started instructing the AI to match the exact style of the existing modules. Why? Because inconsistency is a red flag during code review. If something looks out of place, it probably is out of place, either stylistically or functionally.

The ownership and competence matter more than the lines of code

Here's what keeps me up at night: it's becoming possible to ship code nobody on your team truly understands.

I've seen projects where AI generated large chunks of the codebase, and six months later, nobody could confidently explain how certain features worked. Not because the code was bad, it often looked fine, but because no human had really owned it from the start.

This is a code quality issue masquerading as a productivity win. Sure, you shipped faster initially. But now you can't modify that code confidently, you don't know where the risks are, and your best developers are afraid to touch it. That's not technical debt, that's technical bankruptcy.

The solution isn't to avoid AI. It's to insist on human comprehension as a quality gate. If I don't understand the code AI generated, it doesn't go in. Period. This requires robust code review practices, solid testing, and clear architectural principles, all the quality practices I already know matter.

AI doesn't replace my judgment, it exposes it

Here's the thing that took me a while to accept: AI hasn't made me less important as an engineer. If anything, it's made my judgment more visible, and more critical.

I used to think AI would eventually replace the need for deep technical expertise. That we could just prompt our way to good software. But what I've learned is that AI is brutally honest about the quality of my engineering decisions. It's like a magnifying glass on my craftsmanship.

When I try to refactor poorly structured code with AI assistance, the structural problems become obvious. The AI will follow my instructions, but if my architectural understanding is weak, it just accelerates the accumulation of accidental complexity. I've watched this happen in real-time: asking AI to "fix" a design problem I didn't fully understand, only to create something that compiles and runs but is fundamentally more fragile than what I started with.

This is what scares me about teams that lack solid fundamentals jumping headfirst into AI-assisted development. They end up with codebases full of stuff that looks professional at first glance, proper naming, decent structure, passes the linter, but lacks the deep coherence that comes from genuine understanding. That's actually more dangerous than obviously bad code, because it's harder to identify and fix.

What AI has actually replaced in my workflow are the tedious, repetitive tasks I never enjoyed anyway: writing boilerplate, converting between formats, generating test scaffolding. The grunt work. But the hard stuff, deciding on the right abstraction, knowing when to refactor versus rewrite, understanding the trade-offs in a design decision, that still requires me. Actually, it requires me more than ever, because the stakes are higher when I'm moving faster.

I've come to see it this way: AI extends good engineers and exposes weak ones. If I have strong fundamentals, AI makes me incredibly effective. If I don't, AI just helps me create a mess faster. And for organizations that treat quality as optional? AI absolutely punishes that mindset.

>What this means, practically

So where does this leave me? I think it means doubling down on fundamentals:

Automated testing isn't optional anymore. It's my validation layer for AI-generated code. If I don't have comprehensive tests, I'm flying blind.

Architecture matters more, not less. Modular, well-designed systems let AI work safely within boundaries. Monolithic chaos gives AI infinite ways to break things.

Code review is evolving. I'm not just reviewing logic anymore, I'm validating that AI made sensible decisions and that I understand what was generated.

Technical literacy is the new baseline. I need to level up my ability to quickly assess code quality and spot problems. The skill isn't writing boilerplate anymore, it's recognizing when something is wrong.

The bottom line

AI is an incredible tool. It's already changed how I work, mostly for the better. But it's not magic, and it's definitely not a substitute for engineering discipline.

As someone who's been in the trenches with these tools, I can tell you: the teams that are winning with AI are the ones who already had their quality practices dialed in. They're using AI to move faster within a framework of good practices, not as a way to avoid those practices.

The teams struggling are the ones who thought AI would let them skip the hard work of building quality into their systems. It doesn't. It just makes you fail faster.

Uncle Bob was right when he said "the only way to go fast is to go well." That wisdom hits differently in the AI era. I used to think AI might finally let us choose speed over quality and get away with it. But the opposite is true: AI makes that principle more iron-clad than ever. When you're generating code at 3x speed, cutting corners doesn't give you a 3x speedup, it gives you a 3x pileup.

Code quality still matters, maybe more than ever. Because in the era of AI, quality is what separates accelerated progress from accelerated chaos. The speed is there for the taking, but only if you go well.

View full post