Skip to main content

Suggested Reading Order

Three paths through the primer. Pick the one that matches your role and time budget, then use the tables below to navigate. All paths start at the Welcome page.


PM / Product Path (~3โ€“4 hours)โ€‹

For product managers, product strategists, and technical leaders who need fluent understanding without deep implementation knowledge.

#PageTimeNotes
1Welcome5 minStart here
2What Is an LLM?15 minFoundation
3Tokens & Tokenization15 minExplains billing
4Embeddings15 minPowers semantic search
5Transformer Architecture20 minHow it works
6Attention Mechanism15 minWhy context matters
7Training Pipeline25 minSFT, RLHF explained
8Context Windows20 minMemory & cost
9Mixture of Experts15 minWhy MoE matters
10RAG20 minPrivate knowledge
11Agents & Tool Use20 minAgentic systems
12Cost & Latency20 minBudgeting AI
13Evaluation & Benchmarks15 minReading leaderboards
14Agentic Vocabulary25 minTerminology
15Tools Landscape30 minReference

Engineer Path (~6โ€“8 hours)โ€‹

For software engineers, platform engineers, and ML engineers who want deep conceptual grounding plus implementation context.

#PageTimeNotes
1Welcome5 min
2What Is an LLM?15 min
3Tokens & Tokenization15 min
4Embeddings15 min
5Transformer Architecture20 minRead full paper abstract
6Attention Mechanism15 min
7Model Architecture Types15 min
8Training Pipeline25 minRead InstructGPT abstract
9Context Windows20 min
10Mixture of Experts15 min
11Scaling Laws15 min
12RAG20 min
13Agents & Tool Use20 min
14Multimodal15 min
15Evaluation & Benchmarks15 min
16Cost & Latency20 min
17Prompt Engineering20 min
18Agentic Vocabulary25 min
19Tools Landscape30 minReference

Researcher Path (~12โ€“15 hours)โ€‹

All pages in sidebar order plus full paper reading. For each paper cited in a page, read the full paper โ€” not just the abstract. Use the Papers reference as your guide.

Work through every page in the order they appear in the sidebar: Basics โ†’ Intermediate โ†’ Advanced โ†’ Use Cases. When a page cites a paper, follow the link and read according to the PM reading guide at minimum; for the researcher path, read the full paper.

Estimated total: 12โ€“15 hours depending on paper reading depth. The scaling laws papers and the transformer paper in particular reward slow reading โ€” both are dense with implications that take time to absorb.