Stuart Russell

stuart russell

AI researcher at UC Berkeley. Author of Human Compatible" redefines AI goals as uncertainty over human preferences, making power-seeking instrumentally irrational proposes a radical reformulation of AI objectives centered on uncertainty about human preferences. His "provably beneficial AI" framework requires machines to maintain uncertainty about goals while learning values from human behavior - making power-seeking irrationa