Notes

RLHF and DPO: Aligning AI to Human Preferences

Comparison of RLHF and DPO alignment techniques, covering their pipelines, strengths, weaknesses, and where each is used in practice

Overview of instruction tuning, how it works, dataset construction, and variants like RLHF, RLAIF, and DPO

Explanation of perplexity as a language model evaluation metric, including the formula, intuition, caveats, and relationship to cross-entropy loss

Overview of model quantization techniques, precision levels, and trade-offs for reducing neural network memory and improving inference speed

Quick reference for common gcloud commands including authentication, project setup, and GKE cluster configuration

Quick reference for kubectl port-forward command syntax