Rag | knowledged.to

CompactRAG

Explains CompactRAG, a multi-hop RAG method using offline atomic QA pairs and fixed two-call inference.

Explains graph-based memory for LLM agents, including taxonomy, GAM consolidation, and hybrid retrieval.

Overview of model drift, detection, mitigation, and LLM-specific issues like knowledge staleness and provider drift.

Explains top-k retrieval in RAG, tradeoffs for choosing k, reranking patterns, and similarity thresholds.