Infrastructure & Ethics

Open Weights

Models whose trained parameters are publicly downloadable, even if the training data and code are not.

In common use since 2023

Open-weights describes AI models whose trained parameters are publicly downloadable — anyone can run them, fine-tune them, modify them — even if the training data, training code, or licensing terms are not fully open-source. The distinction matters because open-weights is the dominant model for AI distribution outside the closed-API frontier in 2026.

The spectrum of openness in 2026:

  • Fully open source — weights, training code, data and recipes all public, with permissive licences. Rare; OLMo from Allen AI is one example.
  • Open weights with permissive licence — weights public, Apache 2.0 or MIT licence, no major usage restrictions. Mistral's smaller models, many community fine-tunes.
  • Open weights with restricted licence — weights public but licence restricts certain commercial uses, certain large companies, or certain applications. Llama, Mistral Large.
  • Closed weights — weights private, accessed only through API. GPT-5, Claude Sonnet 4, Gemini 2.5.

The 2026 leaders in open weights:

  • Meta Llama 4 — the most widely deployed open-weight family; permissive licence with restrictions on very large commercial use.
  • Mistral — Mixtral, Codestral, Pixtral and several open variants; mixed licensing.
  • DeepSeek V3 / R1 — Chinese lab producing frontier-quality open-weight models; significantly cheaper to train and run than Western frontier models.
  • Qwen 3 (Alibaba) — strong multilingual open-weight family.
  • Cohere Command R+ — commercial-friendly licensing for the medium-tier.
  • Microsoft Phi — small efficient models for on-device use.
  • Google Gemma — open-weight cousins of Gemini, smaller scale.
  • Stability AI / Black Forest Labs — image generation open weights (Stable Diffusion, FLUX).

Why open weights matter strategically:

  • Cost at scale — running your own deployment of Llama 4 70B is dramatically cheaper than equivalent quality from a frontier API at high volume.
  • Data residency and privacy — when data cannot leave your infrastructure (regulated industries, government, sovereign clouds), self-hosted open weights are often the only option.
  • Customisation — fine-tune freely without provider approval; build domain-specialised models.
  • Independence from any single provider — multi-vendor strategies are easier when one option is "we can host it ourselves".
  • Research and education — open weights are the foundation of academic AI research and most public alignment work.

The trade-offs:

  • Quality gap to frontier — open weights generally trail closed frontier by 6–18 months on the hardest benchmarks, though the gap has narrowed dramatically.
  • Operational burden — you have to host, monitor, scale and update. Hosted-open-weights providers (Together AI, Fireworks, Groq, Replicate) handle this for a fee.
  • Safety responsibility shifts to deployer — once weights are public, no central provider applies safety filters. You own that.
  • Misuse risk — open weights enable bad actors as well as good ones; an active and unresolved policy debate.

For a US team in 2026, the realistic strategy is multi-vendor with both closed APIs (for prototyping, hardest tasks, lowest operational friction) and open weights (for high-volume production, sensitive data, cost-controlled deployments). The open-weights ecosystem has matured into a credible production foundation, not just a research curiosity.

Keep exploring

Looking for something else? The full glossary covers 120+ AI terms updated for 2026.

Open the glossary
Chat on WhatsApp