Judge

LLM as Judge

Using a language model to evaluate another model's outputs as a scalable proxy for human preference judgments.

May 14, 2026 · 2 min

© 2026 knowledged.to · Powered by Knowledged, Hugo & PaperMod