Rlhf on knowledged.to

Rlhf on knowledged.tohttps://knowledged.to/tags/rlhf/Recent content in Rlhf on knowledged.toHugoen-usThu, 21 May 2026 21:04:12 +0530Model Drifthttps://knowledged.to/notes/ml/model-drift/Thu, 21 May 2026 15:33:36 +0000https://knowledged.to/notes/ml/model-drift/Overview of model drift, detection, mitigation, and LLM-specific issues like knowledge staleness and provider drift.PPO — Proximal Policy Optimizationhttps://knowledged.to/notes/ml/ppo-proximal-policy-optimization/Tue, 19 May 2026 17:18:44 +0000https://knowledged.to/notes/ml/ppo-proximal-policy-optimization/Overview of PPO, the clipped policy-gradient RL algorithm used in RLHF for InstructGPT and original ChatGPT.GRPO — Group Relative Policy Optimizationhttps://knowledged.to/notes/ml/grpo-group-relative-policy-optimization/Tue, 19 May 2026 17:17:58 +0000https://knowledged.to/notes/ml/grpo-group-relative-policy-optimization/Critic-free RL algorithm that replaces PPO's value model with group-relative rewards for LLM fine-tuning.LLM as Judgehttps://knowledged.to/ai/concepts/llm-as-judge/Thu, 14 May 2026 10:34:24 +0000https://knowledged.to/ai/concepts/llm-as-judge/Using a language model to evaluate another model's outputs as a scalable proxy for human preference judgments.Fine-Tuning Techniques for LLMshttps://knowledged.to/notes/ml/fine-tuning-techniques/Sat, 25 Apr 2026 15:53:49 +0000https://knowledged.to/notes/ml/fine-tuning-techniques/Comprehensive guide to LLM fine-tuning methods including full, parameter-efficient, and preference-based approaches with modern recipes and tools like LoRA and DPO