AI

Large Language Models (LLMs)

A practical guide to understanding what large language models are and why they matter for product and UX teams.

What LLMs are, how they generate responses, and what designers and product people need to understand to work effectively with AI-powered features.

22 May 20265 min read

What it is

A is a type of AI trained on vast amounts of text to understand and generate human language.

learn in language by processing enormous quantities of text — books, websites, articles, code, and more. From that training, they develop the ability to predict what comes next in a sequence of words, which is the core mechanism behind generating .

When you type a question or into an AI tool, the LLM generates a one token at a time, each step influenced by everything that came before it.

This means are not retrieving pre-written answers. They are generating new text based on learned during training.

power most of the AI people interact with today, from chatbots and writing assistants to tools, code generators, and product recommendation systems.

When to use it

Understand when are the right foundation. They are most relevant when:

They are less relevant when:

You need to process, generate, or summarise natural language at scale
Users need to interact with a system through conversational or free-text input
Tasks involve interpretation, summarisation, or content generation
You are building features that need to understand context across multiple inputs
The task requires precise numerical calculation or logic
Real-time accuracy is critical and hallucinations cannot be tolerated
The data is structured and better handled by traditional software

Key takeaway

LLMs are powerful at language tasks but not infallible. Understanding what they are and how they work helps you design features that play to their strengths.

How it works

Understand the basic mechanism. are trained using a called next-token prediction. The is shown huge amounts of text and learns to predict what word or phrase comes next based on the context that precedes it.

Through billions of these predictions during training, the develops a deep representation of language, including grammar, facts, reasoning , and style.

When generating a , the samples from a probability distribution of likely next tokens, building the output word by word.

What this means for designers and product teams. do not think or reason the way humans do. They generate plausible-sounding based on matching, which is why they can produce confident but incorrect answers.

The quality of an LLM's output is heavily influenced by how it is prompted. Vague or ambiguous inputs produce inconsistent outputs.

have a knowledge cutoff — they do not know about events that occurred after their training was completed unless supplemented with tools like RAG.

What to look for

Focus on:

Output consistency — whether the model produces reliable responses to similar inputs
Hallucination risk — where the model might generate plausible but incorrect information
Prompt sensitivity — how much the output changes based on small changes to the input
Knowledge boundaries — what the model does and does not know
Tone and style — whether the model's default output matches your product's voice

Where it goes wrong

Most issues come from: Treating an LLM like a database that retrieves facts will to disappointment.

Expecting the model to be accurate when it has no grounded knowledge source
Vague or under-specified prompts producing inconsistent outputs
No testing across edge cases and unusual inputs
Ignoring the model's knowledge cutoff when accuracy matters
Assuming outputs will be consistent without proper evaluation

What you get from it

Understanding gives you:

A foundational understanding of how most AI features work
Better ability to brief engineers and evaluate AI tools
Clearer expectations about what AI can and cannot reliably do
A basis for designing AI features with appropriate safeguards

Key takeaway

LLMs are not magic — they are statistical language machines. The more clearly you understand that, the better the AI features you will design around them.

FAQ

Common questions

A few practical answers to the questions that usually come up around this method.

What is a large language model in simple terms?

An LLM is an AI trained on a huge amount of text to understand and generate human language. It works by predicting what words should come next in a sequence, which allows it to write, summarise, translate, answer questions, and hold conversations.

How is an LLM different from a search engine?

A engine retrieves existing documents that match a query. An LLM generates new text based on it learned during training. Search returns what exists — generate what seems most plausible, which is why they can produce incorrect information with confidence.

What are the most well-known LLMs?

GPT-4 and GPT-4o from OpenAI, Claude from Anthropic, Gemini from Google, and Llama from Meta are among the most widely used. Most AI products you interact with are built on top of one of these .

Why do LLMs sometimes get things wrong?

Because they generate based on matching, not retrieval of verified facts. If the contained errors, biases, or gaps, those will show up in the outputs. This is why hallucinations happen — the model produces a plausible-sounding answer even when it does not have reliable information to draw from.

Do I need to understand how LLMs work technically?

Not in depth. But understanding that they generate language probabilistically, that they have knowledge cutoffs, and that their depends heavily on how they are prompted will help you make better design and product decisions.

Quick take

If you are designing AI-powered products, understanding what an LLM is and how it works is a starting point for everything else.

Related Services

LET'S WORK TOGETHER

Ready to improve your product?

UX, research and product leadership for teams tackling complex digital services. The work usually starts where things have become harder than they need to be: unclear journeys, inconsistent products, competing priorities, or teams trying to move forward without a clear direction. I help simplify the problem, shape the right next step, and turn complexity into something people can actually use.

Previous feedback

Will Parkhouse

Senior Content Designer

01/20