Text embeddings for RAG and search - Python, Ollama, OpenAI-compatible APIs

Learn what text embeddings are, how they power RAG and semantic search, and how to call embedding APIs from Python using Ollama or an OpenAI-compatible server (for example llama.cpp). Includes persistence, retrieval, and links to chunking, vector stores, and reranking on this site.

Text embeddings for RAG and search - Python, Ollama, OpenAI-compatible APIs

Comments

Popular posts from this blog

Gitflow Workflow overview

UV - a New Python Package Project and Environment Manager. Here we provide it's short description, performance statistics, how to install it and it's main commands