Embedding

A numerical vector representation of text, images, or other data that captures semantic meaning. Embeddings are the foundation of RAG (retrieval-augmented generation), semantic search, and recommendation systems. Embedding models are small and fast — they run comfortably on almost any modern hardware, including CPUs. The hardware bottleneck in embedding workflows is usually the LLM that processes retrieved results, not the embedding step itself.

Related Articles

More Terms