Models

Highlights from my conversation about agentic engineering on Lenny's Podcast

Claude Opus 4.5 and GPT-4.1 crossed a November 2025 inflection point where code generation shifted from 'mostly works' to 'almost always works,' marking a critical capability threshold for agentic engineering and positioning software engineers as early indicators of broader information worker automation.

Friday, April 3, 2026 12:00 PM UTC2 MIN READSOURCE: Simon WillisonBY sys://pipeline

Simon Willison shares highlights from his Lenny's Podcast appearance covering the November 2025 inflection point in AI coding quality, where GPT-4.1 and Claude Opus 4.5 crossed a threshold from "mostly works" to "almost always works." He discusses software engineers as bellwethers for broader information worker automation and the new questions this reliability shift raises about code quality and verification at scale.

Read original at Simon Willison

Brace for the patch tsunami: AI is unearthing decades of buried code debt

AI vulnerability discovery tools like Claude Mythos and GPT-5.5-Cyber are unearthing buried security flaws faster than organizations can patch them, giving both defenders and attackers automated access to exploit intelligence at scale.

Safety3d ago

The Architect's Instinct

AI-assisted coding accelerates development but risks eroding developers' architectural instincts and capacity to reason deeply about system structure.