Popular Embedding Providers
OpenAI
text-embedding-3-small and text-embedding-3-large
Cohere
Embed-v3 with multilingual support
Vertex AI and Generative AI embeddings
Nomic
Open-source embedding models
OpenAI Embeddings
OpenAI provides state-of-the-art embedding models with excellent performance and cost-efficiency.Installation
Usage
Batch Embeddings
Model Options
Cohere Embeddings
Cohere’s embedding models support multiple languages and task-specific optimizations.Installation
Usage
Input Types
Cohere embeddings support different input types for optimization:Google Embeddings
Google offers embedding models through both Vertex AI and Generative AI APIs.Installation
Generative AI (API Key)
Vertex AI
Nomic Embeddings
Nomic provides open-source embedding models that can run locally or via their API.Installation
Usage
Additional Providers
AWS Bedrock
@langchain/aws - Titan and other embedding modelsAzure OpenAI
@langchain/openai - OpenAI embeddings via AzureOllama
@langchain/ollama - Local embedding modelsMixedbread AI
@langchain/mixedbread-ai - High-quality embeddingsCommunity Integrations
Additional embedding providers are available in@langchain/community:
HuggingFace Example
Local Embeddings
For privacy-sensitive applications, you can run embeddings locally:Ollama
Transformers.js (Browser/Node)
Common Patterns
Similarity Search
Caching for Performance
Best Practices
- Choose the right model: Balance cost, quality, and dimension size
- Batch when possible: Embed multiple documents in one call for efficiency
- Cache embeddings: Store embeddings to avoid redundant API calls
- Normalize inputs: Consistent text preprocessing improves quality
- Monitor costs: Track embedding generation for budget management
- Consider local models: For privacy or high-volume applications
Dimension Comparison
| Provider | Model | Dimensions | Use Case |
|---|---|---|---|
| OpenAI | text-embedding-3-small | 1536 | General purpose, cost-effective |
| OpenAI | text-embedding-3-large | 3072 | High quality, detailed semantics |
| Cohere | embed-english-v3.0 | 1024 | Multilingual, task-specific |
| text-embedding-004 | 768 | Google ecosystem integration | |
| Nomic | nomic-embed-text-v1.5 | 768 | Open-source, local deployment |
Next Steps
Vector Stores
Store and search embeddings efficiently
Document Loaders
Load documents for embedding
Retrieval
Build RAG applications with embeddings
Chat Models
Combine embeddings with LLMs
