The Agentic Coding Era

2019

February 14, 2019 • Technology

GPT-2 Announced

OpenAI announces GPT-2, a large language model that generates coherent paragraphs of text, but withholds the full weights due to safety concerns.

"This was the spark. It demonstrated that simply scaling up parameters and data (1.5B parameters) resulted in emergent capabilities."

July 25, 2019

Deep TabNine (GPT-2)

May 20, 2021

APPS Benchmark Released

2021

June 29, 2021 • Product

GitHub Copilot Technical Preview

GitHub Copilot technical preview announced by GitHub and OpenAI.

"Copilot is the moment coding assistance moves from "tool" to "companion". Shipping inside the editor rewires habits."

2021

July 7, 2021 • Technology

Codex & HumanEval Paper

Codex + HumanEval introduced in "Evaluating Large Language Models Trained on Code" (arXiv:2107.03374).

"Two things land at once: a model that can write real code, and a yardstick that makes progress legible."

2021

August 10, 2021 • Product

OpenAI Codex API Beta

OpenAI Codex API private beta announced (natural language → code).

"This turns code generation from a demo into an ingredient. APIs are how ideas escape labs and become defaults."

February 2, 2022

AlphaCode (DeepMind)

2022

June 21, 2022 • Product

GitHub Copilot GA

GitHub Copilot becomes generally available (paid subscription).

"GA means the weirdness is over: it's now normal to pay for a machine pair programmer."

June 23, 2022

Amazon CodeWhisperer Preview

2022

October 6, 2022 • Technology

ReAct Paper (Reason+Act)

ReAct (Reason + Act) introduced (arXiv:2210.03629).

"ReAct is the blueprint for agents: think, do, observe, repeat. It's the moment "chatbot" starts becoming "worker"."

October 31, 2022

Replit Ghostwriter

2022

November 30, 2022 • Product

ChatGPT Launch

OpenAI launches ChatGPT (GPT-3.5). Chat-driven programming workflows take off.

"ChatGPT makes prompting a new programming interface. "Chat-Driven Development" becomes a cultural default."

2023

January 17, 2023 • Product

Cursor IDE (Alpha)

Anysphere launches Cursor, a fork of VS Code designed to be "AI-native".

"The Editor itself must evolve. Cursor proved that plugins are insufficient for true agentic integration by indexing the local filesystem."

2023

March 14, 2023 • Technology

GPT-4 Released

GPT-4 released with major jump in coding + reasoning capability.

"The leap isn't just better code—it's fewer hallucinated steps and more coherent plans. That's what makes multi-step coding feasible."

2023

March 22, 2023 • Product

GitHub Copilot X

GitHub Copilot X announced (chat + PRs + GPT-4-powered experience).

"This is the pivot from inline suggestions to end-to-end workflow help. It starts to own outcomes, not keystrokes."

2023

March 23, 2023 • Technology

ChatGPT Plugins

ChatGPT plugins announced (tool use becomes mainstream).

"Tools are what turn language into leverage. The moment models can call systems, "write code" becomes "ship work"."

March 30, 2023

AutoGPT Released

2023

May 4, 2023 • Technology

StarCoder Released

StarCoder released (open-access code LLM) by ServiceNow and Hugging Face.

"Open models are how a category gets commoditized. StarCoder makes "coding LLM" something builders can actually own and modify."

2023

June 13, 2023 • Technology

OpenAI Function Calling

Function calling added to OpenAI API models.

"This is the missing primitive for agents: structured intent. It makes "call the tool" a dependable move instead of a prompt hack."

2023

August 24, 2023 • Technology

Code Llama Released

Code Llama released (coding-focused LLM family) by Meta.

"Another major lab shipping code models accelerates commoditization. When supply increases, people stop rationing usage."

2023

October 1, 2023 • Technology

SWE-bench Introduced

SWE-bench introduced (real GitHub issues as software-engineering eval).

"This is the benchmark that forces honesty. Real issues collapse the gap between "writes code" and "fixes bugs"."

October 5, 2023

v0.dev by Vercel

2023

November 6, 2023 • Technology

Assistants API

Assistants API announced at OpenAI DevDay.

"The SDK moment: threads, tools, and retrieval become off-the-shelf. Platforms win when they turn hard problems into defaults."

2023

December 29, 2023 • Product

Copilot Chat GA

GitHub Copilot Chat becomes generally available.

"Chat turns the assistant into a place you go, not a thing that occasionally interrupts you."

2024

February 15, 2024 • Technology

Gemini 1.5 Pro

Google releases Gemini 1.5 Pro with a 1-2 million token context window.

"Infinite Context changes the architecture of coding tools. Instead of complex RAG, you can now dump the entire codebase into the prompt."

2024

March 14, 2024 • Product

Devin (Cognition)

Devin announced as an autonomous AI software engineer; evaluated on SWE-bench.

"Devin is the marketing name for a real transition: agents that own a backlog item end-to-end. The bar just moved."

2024

April 2, 2024 • Technology

SWE-agent Released

SWE-agent released with strong SWE-bench results.

"This is the proof that the loop works: read repo, plan edits, run tests, patch. It's less magic, more engineering."

2024

April 29, 2024 • Product

Copilot Workspace Preview

GitHub Copilot Workspace technical preview.

"Workspace is "Spec to PR" as a product. It's an admission that the unit of work isn't a function—it's a task."

2024

June 21, 2024 • Technology

Claude 3.5 Sonnet

Claude 3.5 Sonnet released with "Artifacts" workflow.

"This is a quality jump that shows "coding model" isn't a monoculture. Multiple frontier models means assistants become a layer you can swap."

2024

June 17, 2024 • Technology

DeepSeek Coder V2

DeepSeek releases Coder V2, the first open-weights model to rival GPT-4 Turbo in coding.

"State-of-the-art coding intelligence becomes a commodity. This fuels the "Local AI" boom and puts pressure on paid API pricing."

2024

August 13, 2024 • Technology

SWE-bench Verified

SWE-bench Verified released (human-verified solvable subset).

"Verified eliminates the "maybe it's impossible" excuse. Once the target is clean, leaderboards become meaningful—and investment follows."

2024

September 5, 2024 • Product

Replit Agent

Replit integrates an autonomous agent that can plan, scaffold, and deploy full-stack apps with database provisioning.

"App development becomes accessible to non-coders on mobile devices. The "Hosting Environment" meets the "Agent"."

2024

September 12, 2024 • Technology

OpenAI o1-preview

OpenAI releases o1 (Strawberry), the first "Reasoning" model.

"Agents gain the ability to "Plan" effectively, reducing logic bugs in complex systems. It marked the shift from "fast thinking" to "slow thinking"."

2024

September 19, 2024 • Technology

Qwen 2.5 Coder

Alibaba Cloud releases Qwen 2.5 Coder. The 32B model brings GPT-4o level coding to consumer hardware.

"Local AI coding becomes viable for serious work. Privacy-conscious developers get a powerful open alternative to cloud APIs."

2024

October 3, 2024 • Product

OpenAI Canvas

OpenAI introduces Canvas, a collaborative interface for coding and writing.

"The "Chat" interface is officially insufficient for complex work. The industry shifts toward "Artifacts" and "Canvas" styles of interaction."

October 4, 2024

Lovable

October 5, 2024

Bolt.new

2024

October 22, 2024 • Technology

Claude "Computer Use"

Claude "computer use" capability introduced.

"GUIs are the surface area of the world. If an agent can click and type like a human, it can work anywhere without an API."

2024

October 29, 2024 • Product

Copilot Multi-Model

GitHub Copilot adds multi-model choice (Claude + Gemini + OpenAI models).

"This is the "browser wars" moment for coding models. Once the platform supports multiple engines, the assistant layer becomes permanent."

November 1, 2024

Windsurf Editor

2024

November 25, 2024 • Technology

Model Context Protocol

Model Context Protocol (MCP) introduced by Anthropic.

"Standards are how ecosystems form. MCP is a bet that context and tools will be as modular as libraries."

2025

January 25, 2025 • Culture

The "Vibe Coding" Era

Andrej Karpathy crystallizes the industry feeling with a tweet about "Vibe Coding".

"Coding transitions from a typing task to a managerial task. The "Vibe Check" became the new Code Review."

2025

February 6, 2025 • Product

Copilot Agent Mode

Copilot Agent Mode preview ("The agent awakens").

"This is GitHub saying the assistant should act, not just suggest. When the default tool becomes an agent, teams reorganize around delegation."

2025

February 24, 2025 • Product

Claude 3.7 Sonnet + Claude Code

Claude 3.7 Sonnet + Claude Code announced (agentic coding tool).

"CLI-native agents feel like "real programmers" because they can run tools and keep state. This is the path from chat to craftsmanship."

2025

February 25, 2025 • Product

Replit Agent v2

Replit releases Agent v2 with "real-time app design preview" and improved autonomous hypothesis formation.

"The agent now sees what it builds as it builds it. Design and logic loops tighten, reducing the "blind coding" errors of previous generations."

February 26, 2025

Continue 1.0

2025

April 16, 2025 • Technology

OpenAI o3

OpenAI o3 becomes generally available, setting new records on SWE-bench Verified (69.1%).

"Reasoning models mature. With 69% on SWE-bench, o3 proves that "slow thinking" is the key to solving complex engineering tasks autonomously."

2025

April 16, 2025 • Product

Codex CLI Open Source

OpenAI Codex CLI open-sourced (local terminal coding agent).

"Open-sourcing the client makes agents feel like tooling, not a website. The terminal is where developers already live."

2025

May 16, 2025 • Technology

Codex Agent Preview

OpenAI launches Codex (cloud-based software engineering agent) research preview.

"This is "agent as a teammate": async work, long-running tasks, and PR-shaped output. Once agents can wait and retry, they look like employees."

2025

May 19, 2025 • Product

Copilot Coding Agent

GitHub announces Copilot Coding Agent (async agent that opens PRs).

"The PR is the unit of integration. When an agent can open one, it's no longer helping you code—it's shipping code into your process."

2025

June 19, 2025 • Product

OpenCode AI

OpenCode AI launched as an open-source, terminal-native AI coding agent supporting 75+ LLMs.

"The terminal becomes the agentic workspace. OpenCode proves that developers want powerful, flexible AI tools that live where they code."

2025

July 15, 2025 • Product

Sourcegraph Amp

Sourcegraph launches Amp, its next-generation AI coding assistant succeeding Cody, designed for enterprise-scale codebase reasoning.

"Enterprise context is the moat. Amp leverages knowledge graphs to understand massive repositories better than generic models."

2025

September 25, 2025 • Product

Copilot Agent GA

Copilot Coding Agent becomes generally available.

"GA is where experiments become budget lines. Once agents are paid-for defaults, management starts asking what else can be delegated."

2025

October 12, 2025 • Technology

Beads Memory System

Steve Yegge releases Beads, a Git-backed memory and task-tracking system for coding agents. Issues stored as JSONL, hash-based IDs prevent merge collisions, and semantic "memory decay" summarizes old tasks.

""Your agents simply cannot keep track of work using Markdown files." Beads solves the long-horizon planning problem by giving agents a structured place to track state across sessions."

2025

October 15, 2025 • Product

Amp Free (Ad-Supported)

Sourcegraph launches Amp Free, an ad-supported tier making agentic coding accessible to everyone. Ads appear discreetly at bottom of editor/CLI; code snippets never shared with ad partners.

""Agentic coding is now free for everyone." The Netflix/Spotify model comes to dev tools—ads cover costs without compromising agent behavior or code privacy."

2025

October 28, 2025 • Product

GitHub Agent HQ

GitHub introduces Agent HQ to orchestrate multiple third-party coding agents.

"This is the "app store" move: the platform becomes the place agents run. Once orchestration is centralized, agents become plugins."

2025

November 24, 2025 • Technology

Claude Opus 4.5

Anthropic releases Claude Opus 4.5, completing the 4.5 model family. Leads on SWE-bench Verified and introduces an "effort parameter" for computation tradeoffs.

"Opus 4.5 sets a new bar for agentic coding: 80.9% on SWE-bench Verified. The effort parameter signals a shift toward adaptive compute—models that think harder when tasks demand it."

2025

December 1, 2025 • Culture

The Ralph Wiggum Technique

Geoffrey Huntley's "Ralph Wiggum" technique goes mainstream—a Bash loop that iteratively feeds prompts to Claude Code until tasks complete autonomously.

""Ralph Wiggum + Opus 4.5 is really, really good." — Matt Pocock. The technique reduces software costs dramatically by letting agents fail, learn from git history, and retry indefinitely."

2025

December 11, 2025 • Technology

GPT-5.2

GPT-5.2 released (Instant, Thinking, and Pro variants) as a frontier agentic coding model line.

"Naming a model line after the job is a tell: "coding agent" is no longer an application, it's a first-class product."

2026

January 1, 2026 • Technology

Gas Town (Yegge)

Steve Yegge releases Gas Town, a multi-agent orchestrator for coordinating 20-30 concurrent Claude Code agents. Named after Mad Max's oil refinery citadel.

""The biggest problem with Claude Code is that it ends." GUPP (Gastown Universal Propulsion Principle) solves this: "If there is work on your hook, YOU MUST RUN IT.""

2026

January 15, 2026 • Technology

Cursor: Scaling Multi-Agent Coding

Cursor publishes research on running hundreds of concurrent agents for weeks—building a web browser (1M+ lines), migrating Solid to React (+266K/-193K edits over 3 weeks), and a Windows 7 emulator (14.6K commits, 1.2M lines).

""The harness and models matter, but the prompts matter more." Flat hierarchies failed when 20 agents slowed to the throughput of 2-3 due to lock contention. The fix: separating planners (explore, spawn tasks) from workers (grind until done)."

End of Report

2019

February 14, 2019 • Technology

GPT-2 Announced

OpenAI announces GPT-2, a large language model that generates coherent paragraphs of text, but withholds the full weights due to safety concerns.

"This was the spark. It demonstrated that simply scaling up parameters and data (1.5B parameters) resulted in emergent capabilities."

July 25, 2019

Deep TabNine (GPT-2)

May 20, 2021

APPS Benchmark Released

2021

June 29, 2021 • Product

GitHub Copilot Technical Preview

GitHub Copilot technical preview announced by GitHub and OpenAI.

"Copilot is the moment coding assistance moves from "tool" to "companion". Shipping inside the editor rewires habits."

2021

July 7, 2021 • Technology

Codex & HumanEval Paper

Codex + HumanEval introduced in "Evaluating Large Language Models Trained on Code" (arXiv:2107.03374).

"Two things land at once: a model that can write real code, and a yardstick that makes progress legible."

2021

August 10, 2021 • Product

OpenAI Codex API Beta

OpenAI Codex API private beta announced (natural language → code).

"This turns code generation from a demo into an ingredient. APIs are how ideas escape labs and become defaults."

February 2, 2022

AlphaCode (DeepMind)

2022

June 21, 2022 • Product

GitHub Copilot GA

GitHub Copilot becomes generally available (paid subscription).

"GA means the weirdness is over: it's now normal to pay for a machine pair programmer."

June 23, 2022

Amazon CodeWhisperer Preview

2022

October 6, 2022 • Technology

ReAct Paper (Reason+Act)

ReAct (Reason + Act) introduced (arXiv:2210.03629).

"ReAct is the blueprint for agents: think, do, observe, repeat. It's the moment "chatbot" starts becoming "worker"."

October 31, 2022

Replit Ghostwriter

2022

November 30, 2022 • Product

ChatGPT Launch

OpenAI launches ChatGPT (GPT-3.5). Chat-driven programming workflows take off.

"ChatGPT makes prompting a new programming interface. "Chat-Driven Development" becomes a cultural default."

2023

January 17, 2023 • Product

Cursor IDE (Alpha)

Anysphere launches Cursor, a fork of VS Code designed to be "AI-native".

"The Editor itself must evolve. Cursor proved that plugins are insufficient for true agentic integration by indexing the local filesystem."

2023

March 14, 2023 • Technology

GPT-4 Released

GPT-4 released with major jump in coding + reasoning capability.

"The leap isn't just better code—it's fewer hallucinated steps and more coherent plans. That's what makes multi-step coding feasible."

2023

March 22, 2023 • Product

GitHub Copilot X

GitHub Copilot X announced (chat + PRs + GPT-4-powered experience).

"This is the pivot from inline suggestions to end-to-end workflow help. It starts to own outcomes, not keystrokes."

2023

March 23, 2023 • Technology

ChatGPT Plugins

ChatGPT plugins announced (tool use becomes mainstream).

"Tools are what turn language into leverage. The moment models can call systems, "write code" becomes "ship work"."

March 30, 2023

AutoGPT Released

2023

May 4, 2023 • Technology

StarCoder Released

StarCoder released (open-access code LLM) by ServiceNow and Hugging Face.

"Open models are how a category gets commoditized. StarCoder makes "coding LLM" something builders can actually own and modify."

2023

June 13, 2023 • Technology

OpenAI Function Calling

Function calling added to OpenAI API models.

"This is the missing primitive for agents: structured intent. It makes "call the tool" a dependable move instead of a prompt hack."

2023

August 24, 2023 • Technology

Code Llama Released

Code Llama released (coding-focused LLM family) by Meta.

"Another major lab shipping code models accelerates commoditization. When supply increases, people stop rationing usage."

2023

October 1, 2023 • Technology

SWE-bench Introduced

SWE-bench introduced (real GitHub issues as software-engineering eval).

"This is the benchmark that forces honesty. Real issues collapse the gap between "writes code" and "fixes bugs"."

October 5, 2023

v0.dev by Vercel

2023

November 6, 2023 • Technology

Assistants API

Assistants API announced at OpenAI DevDay.

"The SDK moment: threads, tools, and retrieval become off-the-shelf. Platforms win when they turn hard problems into defaults."

2023

December 29, 2023 • Product

Copilot Chat GA

GitHub Copilot Chat becomes generally available.

"Chat turns the assistant into a place you go, not a thing that occasionally interrupts you."

2024

February 15, 2024 • Technology

Gemini 1.5 Pro

Google releases Gemini 1.5 Pro with a 1-2 million token context window.

"Infinite Context changes the architecture of coding tools. Instead of complex RAG, you can now dump the entire codebase into the prompt."

2024

March 14, 2024 • Product

Devin (Cognition)

Devin announced as an autonomous AI software engineer; evaluated on SWE-bench.

"Devin is the marketing name for a real transition: agents that own a backlog item end-to-end. The bar just moved."

2024

April 2, 2024 • Technology

SWE-agent Released

SWE-agent released with strong SWE-bench results.

"This is the proof that the loop works: read repo, plan edits, run tests, patch. It's less magic, more engineering."

2024

April 29, 2024 • Product

Copilot Workspace Preview

GitHub Copilot Workspace technical preview.

"Workspace is "Spec to PR" as a product. It's an admission that the unit of work isn't a function—it's a task."

2024

June 21, 2024 • Technology

Claude 3.5 Sonnet

Claude 3.5 Sonnet released with "Artifacts" workflow.

"This is a quality jump that shows "coding model" isn't a monoculture. Multiple frontier models means assistants become a layer you can swap."

2024

June 17, 2024 • Technology

DeepSeek Coder V2

DeepSeek releases Coder V2, the first open-weights model to rival GPT-4 Turbo in coding.

"State-of-the-art coding intelligence becomes a commodity. This fuels the "Local AI" boom and puts pressure on paid API pricing."

2024

August 13, 2024 • Technology

SWE-bench Verified

SWE-bench Verified released (human-verified solvable subset).

"Verified eliminates the "maybe it's impossible" excuse. Once the target is clean, leaderboards become meaningful—and investment follows."

2024

September 5, 2024 • Product

Replit Agent

Replit integrates an autonomous agent that can plan, scaffold, and deploy full-stack apps with database provisioning.

"App development becomes accessible to non-coders on mobile devices. The "Hosting Environment" meets the "Agent"."

2024

September 12, 2024 • Technology

OpenAI o1-preview

OpenAI releases o1 (Strawberry), the first "Reasoning" model.

"Agents gain the ability to "Plan" effectively, reducing logic bugs in complex systems. It marked the shift from "fast thinking" to "slow thinking"."

2024

September 19, 2024 • Technology

Qwen 2.5 Coder

Alibaba Cloud releases Qwen 2.5 Coder. The 32B model brings GPT-4o level coding to consumer hardware.

"Local AI coding becomes viable for serious work. Privacy-conscious developers get a powerful open alternative to cloud APIs."

2024

October 3, 2024 • Product

OpenAI Canvas

OpenAI introduces Canvas, a collaborative interface for coding and writing.

"The "Chat" interface is officially insufficient for complex work. The industry shifts toward "Artifacts" and "Canvas" styles of interaction."

October 4, 2024

Lovable

October 5, 2024

Bolt.new

2024

October 22, 2024 • Technology

Claude "Computer Use"

Claude "computer use" capability introduced.

"GUIs are the surface area of the world. If an agent can click and type like a human, it can work anywhere without an API."

2024

October 29, 2024 • Product

Copilot Multi-Model

GitHub Copilot adds multi-model choice (Claude + Gemini + OpenAI models).

"This is the "browser wars" moment for coding models. Once the platform supports multiple engines, the assistant layer becomes permanent."

November 1, 2024

Windsurf Editor

2024

November 25, 2024 • Technology

Model Context Protocol

Model Context Protocol (MCP) introduced by Anthropic.

"Standards are how ecosystems form. MCP is a bet that context and tools will be as modular as libraries."

2025

January 25, 2025 • Culture

The "Vibe Coding" Era

Andrej Karpathy crystallizes the industry feeling with a tweet about "Vibe Coding".

"Coding transitions from a typing task to a managerial task. The "Vibe Check" became the new Code Review."

2025

February 6, 2025 • Product

Copilot Agent Mode

Copilot Agent Mode preview ("The agent awakens").

"This is GitHub saying the assistant should act, not just suggest. When the default tool becomes an agent, teams reorganize around delegation."

2025

February 24, 2025 • Product

Claude 3.7 Sonnet + Claude Code

Claude 3.7 Sonnet + Claude Code announced (agentic coding tool).

"CLI-native agents feel like "real programmers" because they can run tools and keep state. This is the path from chat to craftsmanship."

2025

February 25, 2025 • Product

Replit Agent v2

Replit releases Agent v2 with "real-time app design preview" and improved autonomous hypothesis formation.

"The agent now sees what it builds as it builds it. Design and logic loops tighten, reducing the "blind coding" errors of previous generations."

February 26, 2025

Continue 1.0

2025

April 16, 2025 • Technology

OpenAI o3

OpenAI o3 becomes generally available, setting new records on SWE-bench Verified (69.1%).

"Reasoning models mature. With 69% on SWE-bench, o3 proves that "slow thinking" is the key to solving complex engineering tasks autonomously."

2025

April 16, 2025 • Product

Codex CLI Open Source

OpenAI Codex CLI open-sourced (local terminal coding agent).

"Open-sourcing the client makes agents feel like tooling, not a website. The terminal is where developers already live."

2025

May 16, 2025 • Technology

Codex Agent Preview

OpenAI launches Codex (cloud-based software engineering agent) research preview.

"This is "agent as a teammate": async work, long-running tasks, and PR-shaped output. Once agents can wait and retry, they look like employees."

2025

May 19, 2025 • Product

Copilot Coding Agent

GitHub announces Copilot Coding Agent (async agent that opens PRs).

"The PR is the unit of integration. When an agent can open one, it's no longer helping you code—it's shipping code into your process."

2025

June 19, 2025 • Product

OpenCode AI

OpenCode AI launched as an open-source, terminal-native AI coding agent supporting 75+ LLMs.

"The terminal becomes the agentic workspace. OpenCode proves that developers want powerful, flexible AI tools that live where they code."

2025

July 15, 2025 • Product

Sourcegraph Amp

Sourcegraph launches Amp, its next-generation AI coding assistant succeeding Cody, designed for enterprise-scale codebase reasoning.

"Enterprise context is the moat. Amp leverages knowledge graphs to understand massive repositories better than generic models."

2025

September 25, 2025 • Product

Copilot Agent GA

Copilot Coding Agent becomes generally available.

"GA is where experiments become budget lines. Once agents are paid-for defaults, management starts asking what else can be delegated."

2025

October 12, 2025 • Technology

Beads Memory System

""Your agents simply cannot keep track of work using Markdown files." Beads solves the long-horizon planning problem by giving agents a structured place to track state across sessions."

2025

October 15, 2025 • Product

Amp Free (Ad-Supported)

Sourcegraph launches Amp Free, an ad-supported tier making agentic coding accessible to everyone. Ads appear discreetly at bottom of editor/CLI; code snippets never shared with ad partners.

""Agentic coding is now free for everyone." The Netflix/Spotify model comes to dev tools—ads cover costs without compromising agent behavior or code privacy."

2025

October 28, 2025 • Product

GitHub Agent HQ

GitHub introduces Agent HQ to orchestrate multiple third-party coding agents.

"This is the "app store" move: the platform becomes the place agents run. Once orchestration is centralized, agents become plugins."

2025

November 24, 2025 • Technology

Claude Opus 4.5

Anthropic releases Claude Opus 4.5, completing the 4.5 model family. Leads on SWE-bench Verified and introduces an "effort parameter" for computation tradeoffs.

"Opus 4.5 sets a new bar for agentic coding: 80.9% on SWE-bench Verified. The effort parameter signals a shift toward adaptive compute—models that think harder when tasks demand it."

2025

December 1, 2025 • Culture

The Ralph Wiggum Technique

Geoffrey Huntley's "Ralph Wiggum" technique goes mainstream—a Bash loop that iteratively feeds prompts to Claude Code until tasks complete autonomously.

""Ralph Wiggum + Opus 4.5 is really, really good." — Matt Pocock. The technique reduces software costs dramatically by letting agents fail, learn from git history, and retry indefinitely."

2025

December 11, 2025 • Technology

GPT-5.2

GPT-5.2 released (Instant, Thinking, and Pro variants) as a frontier agentic coding model line.

"Naming a model line after the job is a tell: "coding agent" is no longer an application, it's a first-class product."

2026

January 1, 2026 • Technology

Gas Town (Yegge)

Steve Yegge releases Gas Town, a multi-agent orchestrator for coordinating 20-30 concurrent Claude Code agents. Named after Mad Max's oil refinery citadel.

""The biggest problem with Claude Code is that it ends." GUPP (Gastown Universal Propulsion Principle) solves this: "If there is work on your hook, YOU MUST RUN IT.""

2026

January 15, 2026 • Technology

Cursor: Scaling Multi-Agent Coding

End of Report