One Metric to Rule Them All? The North Star Debate That Shapes Every Product Team
Should your company rally around a single North Star metric?
Stripe Atlas went from 15% to 85% of founders with zero support tickets -- and market share followed the same curve. Airtable abandoned revenue as its north star and switched to adoption. Duolingo ran a sensitivity analysis on seven retention levers and discovered one had five times the impact of the rest. Amazon stopped chasing quarterly revenue targets and started obsessing over page load times. Twitter's DAU focus killed Spaces before it had a chance.
These are not just metric stories. They are strategy stories. The metric you choose determines what work feels important, which bets get funded, and which innovations die in committee.
Should your team rally around a single north star metric, or is that oversimplification dangerous? And if you pick one, should it measure value delivered to users, value captured by the business, or something else entirely?
The 4 Positions
Evidence from the Archive
Airtable shifted from revenue to user growth as the organizational NSM, enabling teams to take a decade-long view on...
Airtable shifted from revenue to user growth as the organizational NSM, enabling teams to take a decade-long view on monetization
At Nubank, every new product must reach 50% 'very disappointed' threshold before launch, even at the feature level
At Dropbox, the NSM was tied to files stored and shared, reflecting the core value of easy file access
Airbnb: nights booked as NSM reflecting value for both hosts and guests
WhatsApp: messages sent as NSM (value delivered), with monetization metrics tracked separately
Instacart: monthly active orderers (MAO) as the volume NSM, with sub-teams owning metrics like app load time, search...
Ramp: pipeline driven and payback period as paired NSMs, with growth engineering team supporting sales efficiency through AI automation
Jeff Bezos and the S-Team reviewed input metrics weekly, not quarterly output metrics
Amazon tracked selection breadth, delivery speed, page load times, and price competitiveness as input metrics -- all controllable by teams
Stripe Atlas: zero support tickets from 15% to 85% over 18 months, market share followed the same curve
Specific tactical sub-metric: time from application to final risk review decision, measured as a cohort curve that should go 'up into the left'
The Synthesis
The surface tension -- one metric versus two versus many -- masks the deeper agreement. Every guest converges on three principles:
Every guest converges on this: revenue is value captured, not value delivered. It is much farther down the line of whether the growth team can impact it. Even Amazon's framework builds everything around customer-experience inputs that are the opposite of revenue targets.
The metric must connect to daily work. Input metrics should answer 'What should I work on today?' A north star that only the CEO checks is not a north star -- it is a scoreboard. The key is translation layers that convert abstract NSMs into team-level metrics with clear ownership.
Every advocate of single-metric focus also advocates countermeasures. Guardrails are non-negotiable. Revenue correlation, healthy growth metrics, and multi-metric tracking alongside the single optimization target prevent gaming and perverse incentives.
Which Approach Fits You?
Answer 3 questions about your situation. We'll match you to the right approach.
How large is your organization?
Do you need to protect speculative bets from short-term metric pressure?
How often does your metric create perverse incentives?
Notable Absences
The Bottom Line
Lenny's own newsletter work provides the bridge. In "Choosing Your North Star Metric" (published via a16z), he surveyed 40+ companies and concluded: "Your North Star Metric is your strategy, and your strategy is your North Star Metric." In "Fostering a Culture of Experimentation," he showed how Airbnb's alignment around nights booked transformed OKRs from "Launch X" to measurable outcomes. And the Duolingo case study in his newsletter demonstrates the gold standard for NSM selection: Duolingo built a quantitative model of user segments, simulated moving each retention lever by 2% per quarter, and discovered that current user retention rate (CURR) had five times the impact of the next best lever on DAU. That analysis created the Retention Team and reignited Duolingo's growth.
**Third, guardrails are non-negotiable.** Every advocate of single-metric focus also advocates countermeasures. Weinstein tracks multiple metrics but optimizes one. Ellis requires revenue correlation as a guardrail. Isford layers additional metrics for "healthy and sustainable growth." Even Kayvon Beykpour's critique is not anti-metric -- it is anti-metric-without-guardrails.
Sources
- Sean Ellis — "The original growth hacker reveals his secrets | Sean Ellis (author of “Hacking Growth”)" — Lenny's Podcast, September 5, 2024
- Itamar Gilad — "Becoming evidence-guided | Itamar Gilad (Gmail, YouTube, Microsoft)" — Lenny's Podcast, September 21, 2023
- Sri Batchu — "Lessons from scaling Ramp | Sri Batchu (Ramp, Instacart, Opendoor)" — Lenny's Podcast, June 25, 2023
- Bill Carr — "Unpacking Amazon’s unique ways of working | Bill Carr (author of Working Backwards)" — Lenny's Podcast, November 2, 2023
- Lauryn Isford — "Mastering onboarding | Lauryn Isford (Head of Growth at Airtable)" — Lenny's Podcast, February 12, 2023
- Jeff Weinstein — "Building product at Stripe: craft, metrics, and customer obsession | Jeff Weinstein (Product lead)" — Lenny's Podcast, July 11, 2024