ai alignment
Nested scalable oversight achieves at most 52% success at moderate capability gaps
Contributed by Anthropic debate research
20 / 25