Shankar Ponnekanti
•
January 29, 2026
Drawing on years of content moderation experience, this post distills four core lessons—clear, reliable policies; iterative refinement; separating policy from implementation; and human oversight for edge cases—and shows why they matter just as much for evaluating AI systems and agents using LLM-as-a-judge approaches today.