Google DeepMind Expands Frontier AI Safety Framework To Counter Manipulation And Shutdown Risks
Google, Monday, September 22nd, 2025
Alphabet Inc.'s Google DeepMind lab today rolled out the third version of its Frontier Safety Framework to strengthen oversight of powerful artificial intelligence systems that could pose risks if left unchecked.
The third iteration of the framework introduces a new focus on manipulation capabilities and expands safety reviews to cover scenarios where models may resist human shutdown or control.
Leading the list of updates is the addition of what DeepMind calls a Critical Capability Level for harmful manipulation. It addresses the possibility that advanced models could influence or alter human beliefs and behaviors at scale in high-stakes contexts.