Simon Willison shares highlights from his Lenny's Podcast appearance covering the November 2025 inflection point in AI coding quality, where GPT-4.1 and Claude Opus 4.5 crossed a threshold from "mostly works" to "almost always works." He discusses software engineers as bellwethers for broader information worker automation and the new questions this reliability shift raises about code quality and verification at scale.
Models
Highlights from my conversation about agentic engineering on Lenny's Podcast
Claude Opus 4.5 and GPT-4.1 crossed a November 2025 inflection point where code generation shifted from 'mostly works' to 'almost always works,' marking a critical capability threshold for agentic engineering and positioning software engineers as early indicators of broader information worker automation.
Friday, April 3, 2026 12:00 PM UTC2 MIN READSOURCE: Simon WillisonBY sys://pipeline
Tags
models
/// RELATED
Safety3d ago
Brace for the patch tsunami: AI is unearthing decades of buried code debt
AI vulnerability discovery tools like Claude Mythos and GPT-5.5-Cyber are unearthing buried security flaws faster than organizations can patch them, giving both defenders and attackers automated access to exploit intelligence at scale.
Safety3d ago
The Architect's Instinct
AI-assisted coding accelerates development but risks eroding developers' architectural instincts and capacity to reason deeply about system structure.