<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Llm on knowledged.to</title><link>https://knowledged.to/tags/llm/</link><description>Recent content in Llm on knowledged.to</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Mon, 25 May 2026 22:48:50 +0530</lastBuildDate><atom:link href="https://knowledged.to/tags/llm/index.xml" rel="self" type="application/rss+xml"/><item><title>LLM Thinking Token Budgets</title><link>https://knowledged.to/notes/ml/llm-thinking-token-budgets/</link><pubDate>Mon, 25 May 2026 17:09:36 +0000</pubDate><guid>https://knowledged.to/notes/ml/llm-thinking-token-budgets/</guid><description>Explains thinking-token budget parameters, provider naming, cost-latency tradeoffs, and completion-cap interactions.</description></item><item><title>LLM Prompt Cache Options Across Providers</title><link>https://knowledged.to/notes/ml/llm-prompt-cache-provider-options/</link><pubDate>Thu, 21 May 2026 16:49:20 +0000</pubDate><guid>https://knowledged.to/notes/ml/llm-prompt-cache-provider-options/</guid><description>Compares prompt/KV cache TTLs, controls, pricing, scope, and strategies across major LLM providers.</description></item><item><title>LLM Prompt Caching: Implicit vs Explicit</title><link>https://knowledged.to/notes/ml/llm-prompt-caching-implicit-vs-explicit/</link><pubDate>Thu, 21 May 2026 16:08:55 +0000</pubDate><guid>https://knowledged.to/notes/ml/llm-prompt-caching-implicit-vs-explicit/</guid><description>Explains implicit vs explicit LLM prompt caching, prefix constraints, provider support, and when to use each.</description></item><item><title>Why LLM Caching Is Only for Input Tokens</title><link>https://knowledged.to/notes/ml/llm-caching-input-tokens/</link><pubDate>Thu, 21 May 2026 15:43:26 +0000</pubDate><guid>https://knowledged.to/notes/ml/llm-caching-input-tokens/</guid><description>Explains why LLM prompt caching applies to reusable input-token prefill, not sequential output decoding.</description></item><item><title>Model Drift</title><link>https://knowledged.to/notes/ml/model-drift/</link><pubDate>Thu, 21 May 2026 15:33:36 +0000</pubDate><guid>https://knowledged.to/notes/ml/model-drift/</guid><description>Overview of model drift, detection, mitigation, and LLM-specific issues like knowledge staleness and provider drift.</description></item><item><title>Tool-DC Strategic Anchor Grouping — Web Search Example</title><link>https://knowledged.to/notes/ml/tool-dc-strategic-anchor-grouping-example/</link><pubDate>Tue, 19 May 2026 06:12:48 +0000</pubDate><guid>https://knowledged.to/notes/ml/tool-dc-strategic-anchor-grouping-example/</guid><description>Concrete web-search example showing how Tool-DC strategic anchor grouping reduces schema-confusion in tool calls.</description></item><item><title>AgentFlow</title><link>https://knowledged.to/notes/ml/agentflow/</link><pubDate>Tue, 19 May 2026 05:08:59 +0000</pubDate><guid>https://knowledged.to/notes/ml/agentflow/</guid><description>Overview of AgentFlow, an agent architecture that trains a planner with Flow-GRPO for multi-turn tool use.</description></item><item><title>Tool-DC Framework</title><link>https://knowledged.to/notes/ml/tool-dc-framework/</link><pubDate>Tue, 19 May 2026 03:53:48 +0000</pubDate><guid>https://knowledged.to/notes/ml/tool-dc-framework/</guid><description>Overview of Tool-DC, a try-check-retry framework for robust long-context tool-calling with large tool registries.</description></item><item><title>Top-K in RAG Search</title><link>https://knowledged.to/notes/ml/top-k-in-rag-search/</link><pubDate>Mon, 18 May 2026 16:11:28 +0000</pubDate><guid>https://knowledged.to/notes/ml/top-k-in-rag-search/</guid><description>Explains top-k retrieval in RAG, tradeoffs for choosing k, reranking patterns, and similarity thresholds.</description></item><item><title>SWE-bench &amp; SWE-bench Pro Explained</title><link>https://knowledged.to/ai/benchmarks/swe-bench/</link><pubDate>Sat, 16 May 2026 16:40:54 +0000</pubDate><guid>https://knowledged.to/ai/benchmarks/swe-bench/</guid><description>Overview of SWE-bench and SWE-bench Pro, the real-world GitHub issue fixing benchmarks used to evaluate AI coding ability.</description></item><item><title>LLM as Judge</title><link>https://knowledged.to/ai/concepts/llm-as-judge/</link><pubDate>Thu, 14 May 2026 10:34:24 +0000</pubDate><guid>https://knowledged.to/ai/concepts/llm-as-judge/</guid><description>Using a language model to evaluate another model&amp;#39;s outputs as a scalable proxy for human preference judgments.</description></item><item><title>Fine-Tuning Techniques for LLMs</title><link>https://knowledged.to/notes/ml/fine-tuning-techniques/</link><pubDate>Sat, 25 Apr 2026 15:53:49 +0000</pubDate><guid>https://knowledged.to/notes/ml/fine-tuning-techniques/</guid><description>Comprehensive guide to LLM fine-tuning methods including full, parameter-efficient, and preference-based approaches with modern recipes and tools like LoRA and DPO</description></item><item><title>Deterministic Graders (for LLM / AI Evaluation)</title><link>https://knowledged.to/ai/concepts/deterministic-graders/</link><pubDate>Fri, 24 Apr 2026 17:18:23 +0000</pubDate><guid>https://knowledged.to/ai/concepts/deterministic-graders/</guid><description>Definition and best practices for deterministic grading in LLM evaluation using code-based rules instead of model-in-the-loop judgment.</description></item><item><title>Chain of Thought (CoT)</title><link>https://knowledged.to/notes/ml/chain-of-thought/</link><pubDate>Thu, 23 Apr 2026 15:53:32 +0000</pubDate><guid>https://knowledged.to/notes/ml/chain-of-thought/</guid><description>Prompting technique where an AI model is guided — or learns — to reason through a problem step by step before arriving at the final answer, rather than jumping straight to the conclusion.</description></item><item><title>Multi-Turn Conversation in AI</title><link>https://knowledged.to/ai/concepts/multi-turn-conversation/</link><pubDate>Tue, 21 Apr 2026 15:13:14 +0000</pubDate><guid>https://knowledged.to/ai/concepts/multi-turn-conversation/</guid><description>Explains how AI models maintain context across multiple exchanges using conversation history injection rather than internal memory.</description></item></channel></rss>