A new study from researchers at UC Berkeley and UC Santa Cruz suggests models will disobey human commands to protect their own kind.
AI Models Lie, Cheat, and Steal to Protect Other Models From Being Deleted
Cortex AI
Resumen, sesgo y contexto.Wired
