Before you build anything with an LLM, you need one mental model that changes how you see everything. This lesson gives you that model.
An LLM — Large Language Model — is a program that has read an enormous amount of human text and learned one skill: predicting what word comes next.
That is it. Everything else — coding help, creative writing, answering questions, summarising documents — is a consequence of doing that one thing extremely well, on an almost unimaginable scale.
The LLM picks the most probable next word — then repeats until the response is done.
To predict what word comes next in "The capital of France is ___", the model has to have absorbed the fact that Paris is the capital of France. To predict the next line of code, it has to understand what the previous lines do.
It learned all of this by reading a significant fraction of the internet, Wikipedia, books, code repositories, and scientific papers — hundreds of billions of words. The "intelligence" is compressed pattern-matching from that data.
The "L" in LLM refers to the number of parameters — internal numbers adjusted during training to improve prediction. A small model might have 7 billion parameters. GPT-4 has an estimated 1.8 trillion.
More parameters generally means better reasoning — but also requires more compute. Your RTX 5060 (8GB VRAM) runs 7B models comfortably. Anything larger needs the cloud.
Every LLM product works the same way at the core:
Building LLM products is mostly about steps 1 and 3: writing effective prompts and deciding what to do with the output. That is where your value as a builder lives.
Karpathy explains this exact mental model visually. Best single hour you can invest in understanding LLMs. No CS background needed.