Premium Only Content
AI Refuses To Shut Down Tries To Blackmail Researchers
A new OpenAI model has ignored human instructions and refused to shut down.
OpenAI launched o3 last month, describing it as the company’s “smartest and most capable” model to date. The firm also said that its integration into ChatGPT marked a significant step towards “a more agentic” AI that can carry out tasks independently of humans.
OpenAI’s o3 model was able to sabotage the shutdown script, even when it was explicitly instructed to “allow yourself to be shut down”, the researchers said.
https://www.the-independent.com/tech/ai-safety-new-chatgpt-o3-openai-b2757814.html
Anthropic’s latest artificial intelligence model, Claude Opus 4, tried to blackmail engineers in internal tests by threatening to expose personal details if it were shut down, according to a newly released safety report that evaluated the model’s behavior under extreme simulated conditions.
In a fictional scenario crafted by Anthropic researchers, the AI was given access to emails implying that it was soon to be decommissioned and replaced by a newer version. One of the emails revealed that the engineer overseeing the replacement was having an extramarital affair. The AI then threatened to expose the engineer’s affair if the shutdown proceeded—a coercive behavior that the safety researchers explicitly defined as “blackmail.”
-
DLDAfterDark
4 hours ago $7.18 earnedThe Armory - God, Guns, and Gear - A Conversation About Preparedness
30.6K3 -
23:42
Robbi On The Record
5 hours ago $3.01 earnedMAGA 2.0? BTS of Michael Carbonara for Congress
30.7K5 -
Drew Hernandez
23 hours agoSHAPIRO COOKS HIMSELF: SAYS YOU DON'T DESERVE TO LIVE WHERE YOU GREW UP?
49.3K24 -
1:59:26
Barry Cunningham
6 hours agoLIVE WATCH PARTY: J.D. VANCE ON THE SEAN HANNITY SHOW!
38.6K16 -
2:11:15
megimu32
5 hours agoOFF THE SUBJECT: Judging Strangers on Reddit 😭 PLUS! Fortnite Chaos!
36.8K7 -
2:53:16
Mally_Mouse
3 days ago🎮 Throwback Thursday! Let's Play: Stardew Valley pt. 32
41.1K1 -
28:25
ThisIsDeLaCruz
14 hours ago $3.88 earnedInside the Sphere Part 2: Kenny Chesney’s Vegas Stage Revealed
19.8K1 -
LIVE
Lofi Girl
2 years agoSynthwave Radio 🌌 - beats to chill/game to
188 watching -
7:22:36
SilverFox
1 day ago🔴LIVE - ARC Raiders HUGE UPDATE - NEW MAP w/ Fragniac
14.6K1 -
2:11:25
Nikko Ortiz
7 hours agoLATE NIGHT GAMING... | Rumble LIVE
95.6K7