AI · Featured Debate
5 guests 6 episodes 2,974 words

Copilot or Autopilot? The AI Architecture Decision That Defines the Next Decade

Should AI products be copilots or fully autonomous agents?

GitHub named its AI tool "Copilot" -- not "Pilot" -- for a reason. But Bret Taylor thinks the entire market is going toward agents. Scott Wu built Devin to work as a fully autonomous junior engineer that submits its own pull requests. Intercom's AI agent Fin is on track for $100M ARR in under three quarters, priced at $0.99 per resolved ticket. The copilot metaphor is comforting, but is it already outdated?

This is not a philosophical debate about the future of AI. It is a concrete product architecture and business model decision that every team building with AI has to make right now: do you build a tool that assists humans, or a system that acts on their behalf?

Should AI products be designed as copilots (augmenting humans in real time) or as autonomous agents (completing tasks end-to-end independently)? And how does this choice shape your product, pricing, and competitive position?

Intercom

Fin: $100M ARR trajectory in <3 quarters, 300%+ YoY growth, #1 by customer count, revenue, and benchmarks

Early Fin economics: $0.99 revenue vs. $1.20 cost per resolution -- a deliberate bet on declining inference costs

Cognition (Devin)

Cognition's 15-person team: each engineer uses 5 Devins, ~25% of PRs are agent-authored, expected to exceed 50%

Devin integrates with Slack, Linear, and GitHub -- the same tools human engineers use

Microsoft

Cursor hitting $300M ARR in 2 years -- a competitive challenge to Microsoft's Copilot

Microsoft Copilot across Office products (seat-based augmentation) alongside agent research projects

Andreessen Horowitz (a16z)

Linus Torvalds acknowledging AI now codes better than the world's best programmers (holiday break 2025)

Executive/secretary email example: tasks shifted but both jobs persisted with different task bundles

GitHub

GitHub Copilot: 1.5M+ developers, 37,000+ organizations, 55% faster coding, 85% more confident in code quality

Shopify: over 1 million lines of code written by Copilot in their codebase

Sierra / OpenAI

Sierra's AI agents handle customer interactions end-to-end with outcome-based pricing

Quip (Taylor's productivity startup, sold to Salesforce for $750M) is cited as evidence of how hard it is to monetize productivity tools

The Synthesis

The copilot-vs-agent debate is really a debate about three things: task measurability, error tolerance, and business model.

01
Task Measurability
What three variables actually determine whether to build a copilot or an agent?
02
Trust Curve
Why does the transition follow trust, not capability?
03
Institutional Resistance
Why will copilots persist even where agents are technically superior?
04
Task Unbundling
Is the real unit of analysis the product or the task?

The copilot-vs-agent debate is really about three things: task measurability, error tolerance, and business model. Outcome-based pricing creates structurally superior business dynamics, but requires clearly measurable and attributable AI impact.

The copilot-to-agent transition follows a trust curve, not a capability curve. New AI capabilities start as copilots with high oversight and graduate to agents as users build confidence. GitHub Copilot started as code completion and is progressively becoming more agentic. The most successful customers onboard agents like junior engineers: incrementally, through small wins.

Regulated industries, unionized workforces, and bureaucratic organizations will mandate human-in-the-loop for legal and political reasons long after the technical case for autonomy is settled. ChatGPT may be a better doctor than your doctor today, but it cannot get a license to practice medicine.

The real unit of analysis is the task, not the product. The copilot-vs-agent choice is about which tasks to shift from human to AI. The job persists longer than the individual tasks. Ten years from now, the job title might simply be 'I build products' -- encompassing what we currently separate into PM, engineering, and design.

Which Approach Fits You?

Answer 3 questions about your situation. We'll match you to the right approach.

Question 1

How measurable and attributable are the outcomes of your AI product?

Question 2

What is the error tolerance in your domain?

Question 3

How do you want to price your AI product?

Notable Absences

The Bottom Line

Andreessen adds a structural reason copilots may persist even where agents are technically superior: institutional resistance. "ChatGPT is almost certainly a better doctor than your doctor today, but ChatGPT can't get a license to practice medicine." Regulated industries, unionized workforces, and bureaucratic organizations will mandate human-in-the-loop for legal and political reasons long after the technical case for autonomy is settled.

The non-obvious insight: the copilot-to-agent transition follows a trust curve, not a capability curve. New AI capabilities start as copilots (low trust, high oversight) and graduate to agents (high trust, low oversight) as users build confidence. GitHub Copilot started as code completion and is progressively becoming more agentic. Intercom had a working Fin prototype six weeks after GPT-3.5, but pricing it at $0.99 per resolution was a trust bet that took months of conviction. Scott Wu's most successful customers onboard Devin the same way they would onboard a junior engineer: incrementally, building trust through small wins.

  1. Bret Taylor"He saved OpenAI, invented the “Like” button, and built Google Maps: Bret Taylor on the future of careers, coding, agents, and more" — Lenny's Podcast, July 31, 2025
  2. Inbal Shani"The future of AI in software development | Inbal Shani (CPO of GitHub)" — Lenny's Podcast, December 1, 2023
  3. Scott Wu"How Devin replaces your junior engineers with infinite AI interns that never sleep | Scott Wu (Cognition CEO)" — Lenny's Podcast, September 8, 2025
  4. Eoghan McCabe"How Intercom rose from the ashes by betting everything on AI | Eoghan McCabe (founder and CEO)" — Lenny's Podcast, August 21, 2025
  5. Aparna Chennapragada"Microsoft CPO: If you aren’t prototyping with AI, you’re doing it wrong | Aparna Chennapragada" — Lenny's Podcast, May 18, 2025
esc
Loading…
navigate filter openesc close