Embedding

A numerical vector representation of text, images, or other data that captures semantic meaning. Embeddings are the foundation of RAG (retrieval-augmented generation), semantic search, and recommendation systems. Embedding models are small and fast — they run comfortably on almost any modern hardware, including CPUs. The hardware bottleneck in embedding workflows is usually the LLM that processes retrieved results, not the embedding step itself.

Run LLMs locally

More Terms

DLSS (Deep Learning Super Sampling)DPO (Direct Preference Optimization)Fine-tuning Flash Attention FP16 / FP32 / FP8

Back to Glossary

Embedding

Related Articles

More Terms