Alignment Method vs Impossibility Theorem2026/06/23 22:05:23DPO's Arrow-shaped bargainDPO looks clean because it compresses pairwise human preferences into an implicit reward scale, but that tractability comes from discarding plural disagreement rather than solving it.