Wills Education

Chapter 01

The setup: a research lab nobody asked to scale

OpenAI began life in 2015 as a non-profit AI research lab with a stated mission of "ensuring AGI benefits all of humanity." For its first six years it published papers, released the occasional eye-catching demo (DALL·E, GPT-3), and lived off donations and a $1B Microsoft commitment. Nothing about that origin predicted the company that, by late 2024, would post $3.7B in revenue and serve 200M weekly active users.

What changed wasn't the research. The research had been steadily compounding for years. What changed was distribution. ChatGPT, launched in November 2022 as a low-priority "research preview," did something that no prior AI product had done, it gave a normal person a useful experience in their first session. Five days later it had a million users. Two months later it had a hundred million. The lab had accidentally shipped the fastest-adopted consumer product in history.

1
2015
Founded as a non-profit
Mission statement: ensure AGI benefits all of humanity.
2
2019
Capped-profit + Microsoft $1B
Structure flipped to enable commercial scaling. Azure becomes the exclusive compute partner.
3
2022
ChatGPT research preview
1M users in 5 days. 100M in 2 months. Fastest consumer product launch in history.
4
2023
Board crisis weekend
Sam Altman fired and re-hired in 5 days. Microsoft's leverage becomes public.
5
2024
Corporate restructure
Non-profit veto removed. $3.7B revenue, $13B+ Microsoft commitment.

Chapter 02

The product funnel: ChatGPT → API → GPTs

OpenAI's monetisation is structured as a three-tier funnel that prints fundamentally different margins. ChatGPT (free + Plus + Team + Enterprise) is the consumer wedge, high volume, mass-market pricing, and the layer where most users meet the brand. The API is where the actual platform business lives: priced per token, used by thousands of SaaS companies, and the layer that turned every founder into an OpenAI distributor without OpenAI having to hire a single sales rep for them. GPTs and the GPT Store sit on top, attempting to capture some of the value those founders are creating.

	ChatGPT	API	GPTs / Store
User type	End consumer + teams	Developers / SaaS founders	Power users + builders
Pricing model	Subscription tiers	Per-token usage	Subscription + revenue share
Margin profile	Low (compute-heavy)	High once at scale	Mixed, early stage
Strategic role	Brand + feedback data	Distribution platform	Ecosystem capture

The genius of this stack is that each tier subsidises the next. ChatGPT generates the user feedback that improves the models. The improved models make the API more attractive. The API ecosystem creates demand for the underlying compute, which justifies the next round of model investment. None of the tiers in isolation is defensible, but the combination compounds.

Chapter 03

The real moat: RLHF at scale

Most discussions of OpenAI's competitive position focus on the model weights, but raw model capability has commoditised faster than anyone predicted. Llama, Claude, Gemini and Mistral all match or exceed GPT-4 on most public benchmarks. If the moat were the model, OpenAI would already be in trouble.

OpenAI has three years of preference data from real ChatGPT conversations, scaled annotation pipelines staffed by thousands of contractors, and an experimentation system that lets them ship a model variant to a small slice of users and read the result in days. Competitors can copy the architecture; they cannot copy the feedback loop.

3 yrs

Of preference data

real ChatGPT conversations

1,000s

Annotators

scaled global pipeline

Days

Variant ship cycle

model → users → readout

This is why the most useful frame for OpenAI is not "AI lab" but "data company that happens to train models." The labelling, the rating, the safety classifiers, that's where the work is.

Chapter 04

The Microsoft entanglement

Every conversation about OpenAI eventually arrives at Microsoft. The original 2019 deal, $1B in compute credits in exchange for an exclusive Azure partnership, has grown into a $13B+ commitment and a structural dependency: OpenAI doesn't own its compute, and Microsoft has commercial rights to OpenAI's models in perpetuity (with limits that have been re-negotiated multiple times).

Microsoft's investment in OpenAI ($B, cumulative commitment)

From experimental research bet to one of the largest corporate AI commitments in history.

This is the classic platform tradeoff played at maximum stakes. Microsoft gives OpenAI scale they couldn't otherwise afford. In exchange, Microsoft gets to ship Copilot, Azure OpenAI, and the broader enterprise integration story without having to build the underlying models themselves. Both companies need the relationship to work, and both have publicly hedged. Microsoft has invested in alternate model providers (and built its own MAI-1). OpenAI has built relationships with Oracle and SoftBank for the Stargate compute project.

The November 2023 board crisis (when OpenAI's board fired and then re-hired Sam Altman over a weekend) was, in part, a referendum on this dependency. Microsoft's threat to absorb the entire research staff if Altman wasn't reinstated revealed who actually controlled the strategic optionality. Twelve months later, OpenAI announced a corporate restructure to remove the non-profit's veto over commercial decisions.

Chapter 05

What you can copy from this playbook

Three things are transferable to almost any product team. First: the API wedge. Even if your product is consumer-facing, exposing a clean programmatic interface lets other companies build distribution for you for free. OpenAI shipped ChatGPT and the API the same week, the API was not an afterthought, it was the strategy.

Second: instrument the feedback loop before you scale the product. OpenAI was logging conversation-level signals from day one of ChatGPT's research preview. By the time competitors caught up on model quality, OpenAI had a dataset they could not. If you're shipping ML in your product, the question "how do we know if the output was good?" should be answered before launch, not after.

Third: pricing tiers as user segmentation, not just monetisation. ChatGPT Free → Plus → Team → Enterprise isn't a price ladder, it's four different products with different success metrics. Free optimises for engagement. Plus optimises for retention. Team optimises for seat expansion. Enterprise optimises for security and compliance. Building four products inside one brand is hard, but it's the move that lets a single account team sell from $0 to $7-figure deals without re-positioning.

Chapter 06

Looking Ahead: The AGI Infrastructure Race & Stargate

As OpenAI transitions toward agentic actions and reasoning-centric systems (like the o-series models), its primary bottleneck has shifted from model algorithms to massive compute scale. OpenAI, in partnership with Microsoft and Oracle, has planned the Stargate compute cluster—a proposed $100B, multi-gigawatt supercomputer powered by dedicated nuclear energy grids to scale model reasoning capabilities.

$100B

Estimated project cost

largest compute project in history

5 GW

Power requirement

backed by clean nuclear grids

1M+

Reasoning chips

interconnected cluster scale

OpenAIInside OpenAI's path from research lab to $3B ARR platform.

The setup: a research lab nobody asked to scale

The product funnel: ChatGPT → API → GPTs

The real moat: RLHF at scale

The Microsoft entanglement

What you can copy from this playbook

Looking Ahead: The AGI Infrastructure Race & Stargate

More in AI & ML

Netflix's recommender: the $1B algorithm hiding in plain sight.

Spotify's discovery engine, how Wrapped became a growth loop.

Stripe Radar: fraud detection as a developer primitive.

Ready to apply this playbook?