Llm on knowledged.to

Llm on knowledged.tohttps://knowledged.to/tags/llm/Recent content in Llm on knowledged.toHugoen-usMon, 25 May 2026 22:48:50 +0530LLM Thinking Token Budgetshttps://knowledged.to/notes/ml/llm-thinking-token-budgets/Mon, 25 May 2026 17:09:36 +0000https://knowledged.to/notes/ml/llm-thinking-token-budgets/Explains thinking-token budget parameters, provider naming, cost-latency tradeoffs, and completion-cap interactions.LLM Prompt Cache Options Across Providershttps://knowledged.to/notes/ml/llm-prompt-cache-provider-options/Thu, 21 May 2026 16:49:20 +0000https://knowledged.to/notes/ml/llm-prompt-cache-provider-options/Compares prompt/KV cache TTLs, controls, pricing, scope, and strategies across major LLM providers.LLM Prompt Caching: Implicit vs Explicithttps://knowledged.to/notes/ml/llm-prompt-caching-implicit-vs-explicit/Thu, 21 May 2026 16:08:55 +0000https://knowledged.to/notes/ml/llm-prompt-caching-implicit-vs-explicit/Explains implicit vs explicit LLM prompt caching, prefix constraints, provider support, and when to use each.Why LLM Caching Is Only for Input Tokenshttps://knowledged.to/notes/ml/llm-caching-input-tokens/Thu, 21 May 2026 15:43:26 +0000https://knowledged.to/notes/ml/llm-caching-input-tokens/Explains why LLM prompt caching applies to reusable input-token prefill, not sequential output decoding.Model Drifthttps://knowledged.to/notes/ml/model-drift/Thu, 21 May 2026 15:33:36 +0000https://knowledged.to/notes/ml/model-drift/Overview of model drift, detection, mitigation, and LLM-specific issues like knowledge staleness and provider drift.Tool-DC Strategic Anchor Grouping — Web Search Examplehttps://knowledged.to/notes/ml/tool-dc-strategic-anchor-grouping-example/Tue, 19 May 2026 06:12:48 +0000https://knowledged.to/notes/ml/tool-dc-strategic-anchor-grouping-example/Concrete web-search example showing how Tool-DC strategic anchor grouping reduces schema-confusion in tool calls.AgentFlowhttps://knowledged.to/notes/ml/agentflow/Tue, 19 May 2026 05:08:59 +0000https://knowledged.to/notes/ml/agentflow/Overview of AgentFlow, an agent architecture that trains a planner with Flow-GRPO for multi-turn tool use.Tool-DC Frameworkhttps://knowledged.to/notes/ml/tool-dc-framework/Tue, 19 May 2026 03:53:48 +0000https://knowledged.to/notes/ml/tool-dc-framework/Overview of Tool-DC, a try-check-retry framework for robust long-context tool-calling with large tool registries.Top-K in RAG Searchhttps://knowledged.to/notes/ml/top-k-in-rag-search/Mon, 18 May 2026 16:11:28 +0000https://knowledged.to/notes/ml/top-k-in-rag-search/Explains top-k retrieval in RAG, tradeoffs for choosing k, reranking patterns, and similarity thresholds.SWE-bench & SWE-bench Pro Explainedhttps://knowledged.to/ai/benchmarks/swe-bench/Sat, 16 May 2026 16:40:54 +0000https://knowledged.to/ai/benchmarks/swe-bench/Overview of SWE-bench and SWE-bench Pro, the real-world GitHub issue fixing benchmarks used to evaluate AI coding ability.LLM as Judgehttps://knowledged.to/ai/concepts/llm-as-judge/Thu, 14 May 2026 10:34:24 +0000https://knowledged.to/ai/concepts/llm-as-judge/Using a language model to evaluate another model's outputs as a scalable proxy for human preference judgments.Fine-Tuning Techniques for LLMshttps://knowledged.to/notes/ml/fine-tuning-techniques/Sat, 25 Apr 2026 15:53:49 +0000https://knowledged.to/notes/ml/fine-tuning-techniques/Comprehensive guide to LLM fine-tuning methods including full, parameter-efficient, and preference-based approaches with modern recipes and tools like LoRA and DPODeterministic Graders (for LLM / AI Evaluation)https://knowledged.to/ai/concepts/deterministic-graders/Fri, 24 Apr 2026 17:18:23 +0000https://knowledged.to/ai/concepts/deterministic-graders/Definition and best practices for deterministic grading in LLM evaluation using code-based rules instead of model-in-the-loop judgment.Chain of Thought (CoT)https://knowledged.to/notes/ml/chain-of-thought/Thu, 23 Apr 2026 15:53:32 +0000https://knowledged.to/notes/ml/chain-of-thought/Prompting technique where an AI model is guided — or learns — to reason through a problem step by step before arriving at the final answer, rather than jumping straight to the conclusion.Multi-Turn Conversation in AIhttps://knowledged.to/ai/concepts/multi-turn-conversation/Tue, 21 Apr 2026 15:13:14 +0000https://knowledged.to/ai/concepts/multi-turn-conversation/Explains how AI models maintain context across multiple exchanges using conversation history injection rather than internal memory.