Vector Databases
Vector databases specialized for similarity search, RAG (Retrieval-Augmented Generation) pipelines, and AI-powered applications.Available Services
Qdrant
Port: 6333 (REST), 6334 (gRPC) | Memory: 512 MB | Maturity: StableHigh-performance vector similarity search engine for building RAG pipelines, semantic search, and AI-powered recommendation systems.Features:
- Fast vector search
- Filtering and payload
- HNSW algorithm
- Quantization support
- Distributed mode
- Rust-based performance
- Skill:
qdrant-memory - Environment:
QDRANT_HOST,QDRANT_PORT
ChromaDB
Port: 8100 | Memory: 512 MB | Maturity: StableOpen-source AI-native vector database with simple APIs for storing, searching, and filtering vectors.Features:
- Easy-to-use API
- Multiple embedding models
- Metadata filtering
- Auto-embedding
- Python and JavaScript clients
- Lightweight
- Environment:
CHROMADB_HOST,CHROMADB_PORT
Milvus
Port: 19530 (API), 9091 (Metrics) | Memory: 2048 MB | Maturity: StableOpen-source vector database built for scalable similarity search and AI applications.Features:
- Billion-scale vectors
- Hybrid search
- Multiple index types
- GPU acceleration
- Kubernetes-ready
- Cloud-native architecture
- Environment:
MILVUS_URI
Weaviate
Port: 8082 (REST), 50051 (gRPC) | Memory: 1024 MB | Maturity: StableCloud-native vector database with built-in vectorization modules, hybrid search, and GraphQL API.Features:
- GraphQL API
- Built-in vectorizers
- Hybrid search (vector + keyword)
- Multi-tenancy
- Replication
- Schema-based
- Environment:
WEAVIATE_HOST,WEAVIATE_PORT
Usage Examples
RAG Pipeline Stack
Research Agent Preset
Knowledge Base Stack
Vector Database Comparison
| Database | Performance | Scalability | API Style | Hybrid Search | Memory |
|---|---|---|---|---|---|
| Qdrant | Excellent | Good | REST/gRPC | ✅ | 512 MB |
| ChromaDB | Good | Moderate | REST | ✅ | 512 MB |
| Milvus | Excellent | Excellent | REST/gRPC | ✅ | 2048 MB |
| Weaviate | Excellent | Excellent | GraphQL/REST | ✅ | 1024 MB |
RAG Architecture Patterns
Basic RAG
Advanced RAG with Reranking
Multi-Modal RAG
Embedding Models
Popular Embedding Models
| Model | Dimensions | Use Case | Provider |
|---|---|---|---|
| text-embedding-3-small | 1536 | General purpose | OpenAI |
| text-embedding-3-large | 3072 | High accuracy | OpenAI |
| all-MiniLM-L6-v2 | 384 | Fast, local | Sentence Transformers |
| BAAI/bge-large-en | 1024 | English text | Open source |
| intfloat/e5-large | 1024 | Multi-lingual | Open source |
Local Embedding with Ollama
Collection Management
Qdrant Collections
ChromaDB Collections
Optimization Tips
Qdrant Optimization
- Index Type: Use HNSW for speed, quantization for memory
- Payload: Store minimal metadata for better performance
- Filtering: Use indexed payload fields for fast filtering
- Batch Operations: Insert vectors in batches
- Memory: Allocate sufficient RAM for index
ChromaDB Optimization
- Embedding Function: Choose appropriate embedding model
- Distance Metric: Use cosine similarity for most cases
- Persistence: Enable persistence for production
- Batch Size: Process documents in batches
- Metadata: Keep metadata small and indexed
Milvus Optimization
- Index Selection: Choose IVF_FLAT, IVF_SQ8, or HNSW
- Segmentation: Configure segment size appropriately
- Resource Groups: Allocate resources per workload
- GPU Acceleration: Use GPU for large-scale search
- Sharding: Distribute data across shards
Use Cases
Semantic Search
Question Answering
Recommendation Systems
Document Chat
Image Search
Integration Examples
Qdrant + Ollama + Open WebUI
ChromaDB + Dify
Milvus + LiteLLM + Flowise
Monitoring and Maintenance
Health Checks
Metrics
Backups
Performance Benchmarks
Query Latency (approximate)
| Database | 1K vectors | 100K vectors | 1M vectors |
|---|---|---|---|
| Qdrant | <1ms | 1-5ms | 5-20ms |
| ChromaDB | <1ms | 5-10ms | 20-50ms |
| Milvus | <1ms | 1-5ms | 5-15ms |
| Weaviate | <1ms | 5-10ms | 10-30ms |
Throughput (queries/sec)
| Database | Single Node | Distributed |
|---|---|---|
| Qdrant | 1000+ | 10000+ |
| ChromaDB | 500+ | N/A |
| Milvus | 2000+ | 20000+ |
| Weaviate | 1000+ | 10000+ |