Learn

Fine-tuning: further training on your own data

Fine-tuning is the practice of taking an already-trained model and training it further on a smaller, targeted set of examples to change how it behaves. It bakes new patterns into the weights themselves, unlike retrieval or prompting, which feed information at request time. It costs compute, data preparation, and the risk of degrading abilities the base model already had.

At a glance

What it is
Extra training of a base model on your own examples
What it changes
The weights, so behaviour shifts without a longer prompt
Common light form
Adapters such as LoRA, which train a small add-on
The risk
Overfitting and losing skills the base model had

What is fine-tuning?

Fine-tuning starts from a model that has already been trained on a large general corpus and trains it further on a smaller set of examples you provide. The point is to shift behaviour: a house style, a stricter output format, a domain the base model handles only roughly. Because the change goes into the weights, the model carries the new behaviour without you having to spell it out in every prompt. That is the appeal, and also the catch, since changing the weights can quietly erode skills the base model already had.

How is it different from retrieval?

Fine-tuning and retrieval solve different problems and people mix them up often. Retrieval feeds the model fresh information at request time, so it is the right tool when the facts change: prices, documents, anything you would otherwise have to retrain to update. Fine-tuning bakes patterns in, which suits stable things like style and format but ages badly for facts. A good rule: if the answer is “the model does not know X”, reach for retrieval; if it is “the model knows X but keeps phrasing it wrong”, consider fine-tuning.

Do you actually need it?

Usually less often than it feels. A clearer prompt or a few worked examples in the context fixes many problems people reach for fine-tuning to solve, with no training run and nothing to maintain. When you do fine-tune, lighter adapters such as low-rank adaptation (LoRA) train a small add-on instead of every weight, which is cheaper and easier to undo. Start with the cheapest tool that works and escalate only when it does not.

Fine-tuning is good for

  • Teaching a consistent style, format, or tone
  • A narrow task the base model keeps getting slightly wrong
  • Cutting a long instruction prompt down by baking rules in
  • A domain with stable patterns and enough clean examples

Fine-tuning is the wrong tool for

  • Adding facts that change; retrieval handles those better
  • A job a better prompt or examples would already fix
  • A tiny or messy dataset, which mostly teaches noise
  • Keeping the model current; weights go stale, a store does not

Related terms

← All terms Reviewed: June 2026