Vector Database: Definition & Meaning | AI Glossary

A vector database is a specialised database optimised for storing and searching high-dimensional vectors — typically embeddings of text, images or other content — by similarity. Where a traditional database finds rows by exact match on a column, a vector database finds rows whose vectors are closest to a query vector under cosine similarity, dot product or Euclidean distance. It is the storage layer beneath retrieval-augmented generation, semantic search, recommendations and most other AI features that lean on embeddings.

The 2026 vector database landscape:

pgvector — Postgres extension; the default for teams already running Postgres. Production-ready up to tens of millions of vectors with proper indexes.
Pinecone — managed cloud vector DB, scales to billions of vectors with ease.
Qdrant — open-source, self-hostable, strong on metadata filtering.
Weaviate — open-source, hybrid search and rich schema support.
Chroma — embedded vector DB; popular for local development and small-to-medium projects.
Milvus — open-source, designed for very large scale.
Turbopuffer — newer, serverless, optimised for cost at scale.
Cloud-native options — AWS OpenSearch with vector support, Azure AI Search, Google Vertex AI Matching Engine.

The core operations:

Upsert — add a vector with associated metadata.
Query — find the top-k vectors most similar to a query vector.
Filter — query with metadata constraints ("similar to X but only from documents tagged Y").
Hybrid search — combine vector similarity with keyword (BM25) ranking; almost always more accurate than pure vector search.
Delete and reindex — manage the lifecycle as documents change.

The architectural decisions for a US team in 2026:

Already on Postgres? Start with pgvector. No new infrastructure, transactional guarantees, joins with the rest of your data. Production-proven up to tens of millions of vectors.
Need to scale beyond pgvector? Pinecone (managed) or Qdrant (self-hosted) are the safe defaults.
Strict cost control at very high scale? Turbopuffer or self-hosted Milvus.
Need rich filtering and schema? Weaviate or Qdrant lead here.
Multi-modal (text + image embeddings together)? Most modern vector DBs handle this; the question is your embedding model strategy.

The operational concerns:

Index parameters matter — HNSW vs IVF, m and ef parameters; the defaults are usually fine but tuning can improve recall and latency significantly.
Embedding model versioning — when you upgrade your embedding model, every existing vector needs re-embedding. Plan for it.
Per-tenant isolation — multi-tenant SaaS needs strict isolation; namespaces in Pinecone, separate collections in Qdrant.
Cost growth — vector storage is more expensive than blob storage; tens of millions of vectors at production query volume can run into thousands per month.

For a US team building anything that uses embeddings (RAG, semantic search, recommendation, deduplication, classification), a vector database is core infrastructure. The category has matured to the point where almost any choice works for typical workloads — the operational care goes into your embedding strategy, your retrieval pipeline and your evaluation harness rather than into the vector DB itself.

Related terms