Infrastructure & Ethics

Vector Database

A specialised database optimised for storing and searching high-dimensional vectors by similarity.

In common use since 2018

A vector database is a specialised database optimised for storing and searching high-dimensional vectors — typically embeddings of text, images or other content — by similarity. Where a traditional database finds rows by exact match on a column, a vector database finds rows whose vectors are closest to a query vector under cosine similarity, dot product or Euclidean distance. It is the storage layer beneath retrieval-augmented generation, semantic search, recommendations and most other AI features that lean on embeddings.

The 2026 vector database landscape:

  • pgvector — Postgres extension; the default for teams already running Postgres. Production-ready up to tens of millions of vectors with proper indexes.
  • Pinecone — managed cloud vector DB, scales to billions of vectors with ease.
  • Qdrant — open-source, self-hostable, strong on metadata filtering.
  • Weaviate — open-source, hybrid search and rich schema support.
  • Chroma — embedded vector DB; popular for local development and small-to-medium projects.
  • Milvus — open-source, designed for very large scale.
  • Turbopuffer — newer, serverless, optimised for cost at scale.
  • Cloud-native options — AWS OpenSearch with vector support, Azure AI Search, Google Vertex AI Matching Engine.

The core operations:

  • Upsert — add a vector with associated metadata.
  • Query — find the top-k vectors most similar to a query vector.
  • Filter — query with metadata constraints ("similar to X but only from documents tagged Y").
  • Hybrid search — combine vector similarity with keyword (BM25) ranking; almost always more accurate than pure vector search.
  • Delete and reindex — manage the lifecycle as documents change.

The architectural decisions for a US team in 2026:

  • Already on Postgres? Start with pgvector. No new infrastructure, transactional guarantees, joins with the rest of your data. Production-proven up to tens of millions of vectors.
  • Need to scale beyond pgvector? Pinecone (managed) or Qdrant (self-hosted) are the safe defaults.
  • Strict cost control at very high scale? Turbopuffer or self-hosted Milvus.
  • Need rich filtering and schema? Weaviate or Qdrant lead here.
  • Multi-modal (text + image embeddings together)? Most modern vector DBs handle this; the question is your embedding model strategy.

The operational concerns:

  • Index parameters matter — HNSW vs IVF, m and ef parameters; the defaults are usually fine but tuning can improve recall and latency significantly.
  • Embedding model versioning — when you upgrade your embedding model, every existing vector needs re-embedding. Plan for it.
  • Per-tenant isolation — multi-tenant SaaS needs strict isolation; namespaces in Pinecone, separate collections in Qdrant.
  • Cost growth — vector storage is more expensive than blob storage; tens of millions of vectors at production query volume can run into thousands per month.

For a US team building anything that uses embeddings (RAG, semantic search, recommendation, deduplication, classification), a vector database is core infrastructure. The category has matured to the point where almost any choice works for typical workloads — the operational care goes into your embedding strategy, your retrieval pipeline and your evaluation harness rather than into the vector DB itself.

Keep exploring

Looking for something else? The full glossary covers 120+ AI terms updated for 2026.

Open the glossary
Chat on WhatsApp