Setup
Settings.llm globally applies to all LlamaIndex components that use an LLM — query engines, chat engines, agents — without further configuration.
Embeddings
RAG pipeline
Agentic query engine
X-Xenovia-Session-Id to group the full agent run in Traces.
Session tracking
Pass the same session header into both the LLM and embedding clients so the full workflow stays grouped in Traces:Handling policy blocks
When a request is blocked, LlamaIndex propagates the upstream403 as an openai.PermissionDeniedError.