paul christiano
AI alignment researcher. Known for work on AI safety and machine learning theory. Paul Christiano devises iterated distillation and amplification (IDA) to align AGI with human ethics, framing alignment as a reinforcement learning problem where AI systems learn to defer to corrigible oversight.