Alignment · Seoul National University
Why Personality Tests Mischaracterize LLM Behavior
Giving an LLM the Big Five or a values survey predicts almost nothing about how it acts in real queries: cross-method agreement was only Spearman 0.31 (values) and 0.26 (personality), versus 0.74-0.77 within-survey.