Open-weight models
Open-weight models are AI models where the trained parameters (weights) are made publicly available so others can download, run, and often fine-tune them locally.
The core idea (plain terms)
When an AI model is trained, it learns billions (or trillions) of numbers—these are its weights.
An open-weight model gives you access to those numbers.
That means you can:
- Run the model on your own machine or server
- Fine-tune it with your own data
- Inspect or modify how it behaves (to some extent)
How this differs from other terms
1. Open-weight vs Closed models
- Open-weight: You get the weights
- Example: LLaMA 2, Mistral 7B
- Closed model: You only get API access
- Example: GPT-4
With closed models, you use them—but you don’t own or inspect them.
2. Open-weight vs Open-source
This is where people get sloppy—don’t.
- Open-weight → weights are available
- Open-source → weights plus training data, code, and full transparency
Most “open” AI models today are actually open-weight, not fully open-source.
Why open-weight models matter
They give you control and flexibility:
- Run offline (privacy-sensitive use cases)
- Lower cost at scale (no API fees)
- Customize behavior deeply (fine-tuning, LoRA, etc.)
- Experiment freely (research, tooling, edge deployments)
This is why models like Gemma are popular with developers.
The catch (don’t ignore this)
Open-weight doesn’t automatically mean:
- Fully transparent
- Free for all use cases (licenses can restrict usage)
- Easy to run (hardware requirements can be heavy)
So before jumping in, always check:
- License terms
- Model size vs your hardware
- Ecosystem support (e.g., MLX, Ollama, etc.)
Simple analogy
- Closed model = Streaming on Netflix
- Open-weight model = Downloading the movie file
- Open-source model = Getting the movie plus the script, raw footage, and editing tools
If you’re planning to use something like Gemma on your Mac (e.g., with MLX), open-weight models are exactly what make that possible.