Open-weights describes AI models whose trained parameters are publicly downloadable — anyone can run them, fine-tune them, modify them — even if the training data, training code, or licensing terms are not fully open-source. The distinction matters because open-weights is the dominant model for AI distribution outside the closed-API frontier in 2026.
The spectrum of openness in 2026:
- Fully open source — weights, training code, data and recipes all public, with permissive licences. Rare; OLMo from Allen AI is one example.
- Open weights with permissive licence — weights public, Apache 2.0 or MIT licence, no major usage restrictions. Mistral's smaller models, many community fine-tunes.
- Open weights with restricted licence — weights public but licence restricts certain commercial uses, certain large companies, or certain applications. Llama, Mistral Large.
- Closed weights — weights private, accessed only through API. GPT-5, Claude Sonnet 4, Gemini 2.5.
The 2026 leaders in open weights:
- Meta Llama 4 — the most widely deployed open-weight family; permissive licence with restrictions on very large commercial use.
- Mistral — Mixtral, Codestral, Pixtral and several open variants; mixed licensing.
- DeepSeek V3 / R1 — Chinese lab producing frontier-quality open-weight models; significantly cheaper to train and run than Western frontier models.
- Qwen 3 (Alibaba) — strong multilingual open-weight family.
- Cohere Command R+ — commercial-friendly licensing for the medium-tier.
- Microsoft Phi — small efficient models for on-device use.
- Google Gemma — open-weight cousins of Gemini, smaller scale.
- Stability AI / Black Forest Labs — image generation open weights (Stable Diffusion, FLUX).
Why open weights matter strategically:
- Cost at scale — running your own deployment of Llama 4 70B is dramatically cheaper than equivalent quality from a frontier API at high volume.
- Data residency and privacy — when data cannot leave your infrastructure (regulated industries, government, sovereign clouds), self-hosted open weights are often the only option.
- Customisation — fine-tune freely without provider approval; build domain-specialised models.
- Independence from any single provider — multi-vendor strategies are easier when one option is "we can host it ourselves".
- Research and education — open weights are the foundation of academic AI research and most public alignment work.
The trade-offs:
- Quality gap to frontier — open weights generally trail closed frontier by 6–18 months on the hardest benchmarks, though the gap has narrowed dramatically.
- Operational burden — you have to host, monitor, scale and update. Hosted-open-weights providers (Together AI, Fireworks, Groq, Replicate) handle this for a fee.
- Safety responsibility shifts to deployer — once weights are public, no central provider applies safety filters. You own that.
- Misuse risk — open weights enable bad actors as well as good ones; an active and unresolved policy debate.
For a US team in 2026, the realistic strategy is multi-vendor with both closed APIs (for prototyping, hardest tasks, lowest operational friction) and open weights (for high-volume production, sensitive data, cost-controlled deployments). The open-weights ecosystem has matured into a credible production foundation, not just a research curiosity.