Lesson 1 — What Is an LLM?

Before you build anything with an LLM, you need one mental model that changes how you see everything. This lesson gives you that model.

The one-sentence truth

An LLM — Large Language Model — is a program that has read an enormous amount of human text and learned one skill: predicting what word comes next.

That is it. Everything else — coding help, creative writing, answering questions, summarising documents — is a consequence of doing that one thing extremely well, on an almost unimaginable scale.

The LLM picks the most probable next word — then repeats until the response is done.

Why does predicting words make it seem smart?

To predict what word comes next in "The capital of France is ___", the model has to have absorbed the fact that Paris is the capital of France. To predict the next line of code, it has to understand what the previous lines do.

It learned all of this by reading a significant fraction of the internet, Wikipedia, books, code repositories, and scientific papers — hundreds of billions of words. The "intelligence" is compressed pattern-matching from that data.

Product builder's insight: You are not building the LLM. You are building the thing around it — the interface, the data pipeline, the prompts that shape its behaviour. The LLM is an engine. Your product is the car.

What "large" actually means

The "L" in LLM refers to the number of parameters — internal numbers adjusted during training to improve prediction. A small model might have 7 billion parameters. GPT-4 has an estimated 1.8 trillion.

More parameters generally means better reasoning — but also requires more compute. Your RTX 5060 (8GB VRAM) runs 7B models comfortably. Anything larger needs the cloud.

Runs on
your GPU ✓

13B

Slow on
your GPU ⚠️

70B+

Needs cloud
or big GPU ✗

GPT-4
~1.8T

Cloud only
(API) ☁️

How your product will use an LLM

Building LLM products is mostly about steps 1 and 3: writing effective prompts and deciding what to do with the output. That is where your value as a builder lives.

Primary source — watch after the quiz

Intro to Large Language Models — Andrej Karpathy (YouTube, 1 hour)

Karpathy explains this exact mental model visually. Best single hour you can invest in understanding LLMs. No CS background needed.

Ask your teacher anything. Type your question directly to Claude — your AI teacher is right here in this session. Confused about parameters? Want to know how this applies to your product idea? Just ask.

What Is an LLM?

The one-sentence truth

Why does predicting words make it seem smart?

What "large" actually means

How your product will use an LLM

Check your understanding