Grpo on knowledged.to

Grpo on knowledged.tohttps://knowledged.to/tags/grpo/Recent content in Grpo on knowledged.toHugoen-usTue, 19 May 2026 22:48:31 +0530GRPO — Group Relative Policy Optimizationhttps://knowledged.to/notes/ml/grpo-group-relative-policy-optimization/Tue, 19 May 2026 17:17:58 +0000https://knowledged.to/notes/ml/grpo-group-relative-policy-optimization/Critic-free RL algorithm that replaces PPO's value model with group-relative rewards for LLM fine-tuning.Fine-Tuning Techniques for LLMshttps://knowledged.to/notes/ml/fine-tuning-techniques/Sat, 25 Apr 2026 15:53:49 +0000https://knowledged.to/notes/ml/fine-tuning-techniques/Comprehensive guide to LLM fine-tuning methods including full, parameter-efficient, and preference-based approaches with modern recipes and tools like LoRA and DPO