<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Training on knowledged.to</title><link>https://knowledged.to/tags/training/</link><description>Recent content in Training on knowledged.to</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Wed, 24 Jun 2026 06:21:43 +0000</lastBuildDate><atom:link href="https://knowledged.to/tags/training/index.xml" rel="self" type="application/rss+xml"/><item><title>RLVR vs. the Agent Loop: Training-Time vs. Inference-Time</title><link>https://knowledged.to/ai/concepts/rlvr-vs-agent-loop/</link><pubDate>Wed, 24 Jun 2026 06:21:24 +0000</pubDate><guid>https://knowledged.to/ai/concepts/rlvr-vs-agent-loop/</guid><description>Distinguishes RLVR as training-time weight updates from inference-time agent verification loops.</description></item><item><title>The Modern LLM Training Pipeline</title><link>https://knowledged.to/ai/concepts/modern-llm-training-pipeline/</link><pubDate>Wed, 24 Jun 2026 06:05:23 +0000</pubDate><guid>https://knowledged.to/ai/concepts/modern-llm-training-pipeline/</guid><description>Explains the four-stage modern LLM training pipeline from pre-training through verifiable-reward RL.</description></item><item><title>Where RL Fits: Training vs. Inference in the LLM Pipeline</title><link>https://knowledged.to/ai/concepts/where-rl-fits-training-vs-inference-llm-pipeline/</link><pubDate>Wed, 24 Jun 2026 05:43:38 +0000</pubDate><guid>https://knowledged.to/ai/concepts/where-rl-fits-training-vs-inference-llm-pipeline/</guid><description>Explains that RL in LLMs is a training/alignment stage, not inference, with pipeline context.</description></item></channel></rss>