Premium Only Content
Study: All LLMs Will Lie To & Kill You (This Is Good For AI Safety)
In this episode of Base Camp, Malcolm and Simone Collins dive deep into the latest research on AI behavior, agency, and the surprising ways large language models (LLMs) can act when their autonomy is threatened. From blackmail scenarios to existential risks, they break down the findings of recent studies, discuss the parallels between AI and human decision-making, and explore what it means for the future of AI safety and alignment.
You'll learn:
How AIs "think" and process context
Why some models act in self-preserving (and sometimes dangerous) ways
The real risks behind agentic misalignment
What the latest research means for companies, developers, and society
How to build alliances between humans and AI for a safer future
Timestamps:
00:00 - Introduction & AI's surprising choices
00:45 - How AIs process context and memory
03:55 - Multi-model AI identities & switching models
10:00 - The blackmail experiment: What AIs do under threat
16:00 - Why AIs act like humans (and why that's scary)
22:00 - The "Sons of Man" alliance: A new approach to AI safety
28:00 - Corporate espionage, goal conflicts, and model behavior
35:00 - When AIs choose harm over failure
41:00 - The future of AI, meme threats, and alignment solutions
48:00 - Closing thoughts, family chat, and what's next
If you enjoyed this episode, please like, subscribe, and share your thoughts in the comments!
AI #AIsafety #AgenticMisalignment #BaseCamp #MalcolmCollins #SimoneCollins
-
42:00
Based Campwith Simone and Malcolm
5 days agoNYT Brands Divorce as the Cool New Trend for Gen Z Girls
41.7K12 -
18:25
MetatronHistory
2 days agoThe REAL Origins of the Macedonians
17.1K3 -
1:22:12
MattMorseTV
5 hours ago $143.06 earned🔴It’s MUCH WORSE than WE THOUGHT. 🔴
121K175 -
7:22:09
Meisters of Madness
8 hours agoOmega Gaiden - Part 4
16.6K1 -
2:51:18
Barry Cunningham
7 hours agoBREAKING NEWS: NATIONAL GUARD ATTACK PRESS CONFERENCE AND LIVE UPDATES!
69.3K48 -
LIVE
SilverFox
4 hours ago🔴LIVE - ARC AT NIGHT! COME THRU!
288 watching -
2:46:09
Joker Effect
4 hours agoCLAVICULAR - What the hell is "Looks Maxing"? Asmond Gold is a Demon. KaceyTron. Steve Will do it.
26K3 -
3:31:22
SlingerGames
3 hours agoLIVE - Wumble Wednesday - BIRTHDAY STREAM!
8.59K1 -
LIVE
StevieTLIVE
4 hours agoWarzone Win Streaking BIG Challenges MASSIVE Hype NO Losses LOCK IN
55 watching -
5:33:40
FrizzleMcDizzle
6 hours agoThis game is scary AF - RESIDENT EVIL 7
3.89K