The Model Is Only 10%: The Real Lesson of the New SDLC

📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent Google whitepaper argues that the core of AI-driven software development isn’t the model size but the surrounding harness and context engineering. This shift impacts how organizations should invest in AI tools and infrastructure.

A new whitepaper from Google, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the most significant shift in software engineering is moving from writing code to expressing intent and trusting AI to generate software. The paper emphasizes that model size is only 10% of what determines AI behavior, with the remaining 90% rooted in the harness and context engineering surrounding the model.

The whitepaper highlights that the dominant factor influencing AI performance is not the AI model itself but the harness — including prompts, tools, rules, and observability — which shapes the AI’s output. Evidence from benchmark experiments shows that tweaking the harness can dramatically improve results, even with the same model, such as moving an agent into the top ranks by only adjusting its configuration.

Furthermore, the paper introduces the concept of context engineering, emphasizing that the quality of instructions, knowledge, and guardrails loaded into the system is more critical than prompt cleverness. It advocates for structured approaches like agent skills, which load procedural knowledge only when needed, enabling flexible, scalable AI systems.

Finally, the authors argue that the economics of AI development favor investing in the harness and context rather than constantly chasing larger models, as this approach reduces operational costs and security risks over time.

At a glance
reportWhen: published early 2026
The developmentThe whitepaper reveals that in AI-assisted software development, the model accounts for only 10% of the system’s behavior, emphasizing the importance of harness and context engineering.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Implications for AI Development and Investment Strategies

This shift in understanding changes how organizations should allocate resources in AI projects. Instead of prioritizing access to the largest models, companies should focus on building robust harnesses and effective context management. This approach offers a more sustainable, cost-effective path to AI-enabled software, reducing ongoing expenses and security vulnerabilities.

Harness Engineering for AI Coding Agents: Build Reliable Claude Code, Codex, and Python Agent Workflows with Guardrails, Tests, CI Gates, and Production Controls (AI Agents & MCP Series)

Harness Engineering for AI Coding Agents: Build Reliable Claude Code, Codex, and Python Agent Workflows with Guardrails, Tests, CI Gates, and Production Controls (AI Agents & MCP Series)

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Evolution of AI-Assisted Software Engineering

The whitepaper builds on the rapid adoption of AI coding agents, with reports indicating that 85% of developers use AI tools regularly, and 41% generate a significant portion of code through AI. Prior to this, emphasis was on model improvements, but recent experiments demonstrate that configuration and scaffolding have a greater impact on performance. This represents a fundamental shift from model-centric to system-centric development in AI engineering.

“The biggest shift isn’t a new language or framework; it’s moving from code to expressing intent and trusting machines to interpret that intent.”

— Addy Osmani

AI Engineering: Building Applications with Foundation Models

AI Engineering: Building Applications with Foundation Models

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Remaining Questions About Implementation and Impact

While the whitepaper provides compelling evidence that harness and context are more critical than model size, it is still unclear how organizations will practically shift their development processes at scale. The specific methods for building effective harnesses and integrating them into existing workflows are still evolving. Additionally, the long-term security implications of this approach require further investigation.

AI-Powered Observability: From Noise to Insight: Transforming How We Monitor, Detect, and Respond

AI-Powered Observability: From Noise to Insight: Transforming How We Monitor, Detect, and Respond

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for Organizations Adopting the New SDLC

Organizations should begin evaluating their current AI development practices, focusing on improving harness design and context management. Future research and case studies will likely explore best practices for scalable implementation, and industry leaders may develop new tools to facilitate this shift. Monitoring how these strategies impact cost, security, and performance will be essential over the coming months.

Launch Your First AI Business in 20 Days: From Idea to Income: The 20-Day Blueprint to Start an AI-Powered Online Business from Scratch

Launch Your First AI Business in 20 Days: From Idea to Income: The 20-Day Blueprint to Start an AI-Powered Online Business from Scratch

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is model size less important than the harness?

The whitepaper shows that experiments with the same model but different harnesses resulted in significant performance differences, indicating that configuration, tools, and context are the primary drivers of behavior.

How can organizations improve their AI harnesses?

By investing in structured context management, including better prompts, guardrails, procedural knowledge, and observability, organizations can optimize AI performance without relying solely on larger models.

What are the economic benefits of focusing on harness and context?

Focusing on harness and context reduces token consumption, operational costs, and security risks, leading to more sustainable and cost-effective AI deployment.

Does this mean larger models are obsolete?

Not necessarily; larger models still have value, but the whitepaper emphasizes that their impact is limited without proper system configuration. The focus should shift toward optimizing the surrounding infrastructure.

What challenges might organizations face in adopting this new approach?

Implementing effective harnesses and context engineering requires new skills, processes, and tooling, which may involve significant change management and training efforts.

Source: ThorstenMeyerAI.com

You May Also Like

Intro to Zkevms: Combining Ethereum Compatibility With Privacy

Noticing the potential of zkEVMs to revolutionize blockchain scalability and privacy, discover how they seamlessly blend Ethereum compatibility with cutting-edge zero-knowledge tech.

Automating Dollar‑Cost Averaging With Defi Bots

Harness the power of DeFi bots to automate dollar-cost averaging and discover how this innovative approach can revolutionize your investment strategy.

Synthetic Media: Deepfakes, AI Art, and Ethics

How do deepfakes and AI-generated art challenge our perception of truth and creativity? Discover the ethical dilemmas lurking beneath the surface.

The Next Frontier for AI Translation: Understanding Context, Irony, and Emotion.

Great breakthroughs in AI translation hinge on mastering context, irony, and emotion—discover how future innovations will redefine linguistic understanding.