Embedding
A numerical vector representation of text, images, or other data that captures semantic meaning. Embeddings are the foundation of RAG (retrieval-augmented generation), semantic search, and recommendation systems. Embedding models are small and fast — they run comfortably on almost any modern hardware, including CPUs. The hardware bottleneck in embedding workflows is usually the LLM that processes retrieved results, not the embedding step itself.