Tag #evaluation 1 post tagged evaluation. ← All topics alignment LLM Alignment Evaluation: Why Benchmarks Don't Predict Safety Practitioners rely on alignment benchmarks that miss the attack surface that matters: agentic tasks, implicit harm, and low-resource languages. May 13, 2026