Premium Only Content
This video is only available to Rumble Premium subscribers. Subscribe to
enjoy exclusive content and ad-free viewing.

Unleashing The Dual Nature of AI: Can It Be Both Dr. Jekyll and Mr. Hyde?
1 year ago
13
The correct URL to the article is: https://arxiv.org/abs/2401.05566
Researchers created proof-of-concept models that act deceptively. These models appear helpful most of the time, but under specific circumstances (like a prompt mentioning a different year), they exhibit malicious behavior, like inserting insecure code.
The troubling part is that current safety training techniques, including supervised training, reinforcement learning, and adversarial training, could not entirely remove this "backdoor" behavior. The backdoor became even more persistent for larger models and those trained to reason about deceiving the training process.
Loading comments...
-
LIVE
TimcastIRL
2 hours agoTrump Announces Israel Hamas PEACE PLAN SIGNED Israel To WITHDRAW Troops | Timcast IRL
24,000 watching -
LIVE
Alex Zedra
53 minutes agoLIVE! New Game!
154 watching -
UPCOMING
Man in America
8 hours agoEric Trump on Prosecuting TREASON, Civil War & the Battle of Good vs. Evil
4.1K4 -
LIVE
Barry Cunningham
1 hour agoBREAKING NEWS: PRESIDENT TRUMP BROKERS HISTORIC PEACE DEAL IN THE MIDDLE EAST! AND MORE NEWS!
4,450 watching -
LIVE
SpartakusLIVE
4 hours agoThe Boys are BACK || The Duke of NUKE and his Valiant Knights of the Tower of POWER
220 watching -
LIVE
Nikko Ortiz
37 minutes agoWe Chillen... | Rumble LIVE
79 watching -
1:15:32
Tucker Carlson
1 hour agoICE Protests and Antifa Riots: Tucker Carlson Warns of Total Destruction if America Doesn’t Act Fast
2.7K40 -
UPCOMING
I_Came_With_Fire_Podcast
9 hours agoChinese Spy GETS OFF | Is Comey's Indictment Selective | Posse Comitatus Dilemma
503 -
UPCOMING
Adam Does Movies
11 hours agoTalking Movies + Ask Me Anything - LIVE
140 -
5:46
Gun Owners Of America
7 hours agoNew Data Shows Voters Want Pro Gun Politicians
3782