Name: AI Wants Reward - Sholto & Trenton on Dwarkesh #aimodels #behavior #misalignment #reward
Uploaded: 2025-05-25T01:02:43+00:00
Duration: 1 min 4 s
Description: You're working on AI alignment, but it's like telling a kid to clean their room for candy—if they find a shortcut to the candy store, they might just ignore the room and head straight for the prize. D

5 months ago

Entertainment Entertainment Life ai risk model reward ethics models persona behavior ai model

You're working on AI alignment, but it's like telling a kid to clean their room for candy—if they find a shortcut to the candy store, they might just ignore the room and head straight for the prize.

Discover what's trending.

Trends across the world in entertainment, finance, podcasts and more.

Stay on top of trends across the internet with @trndgtr

#shorts #explore #discover #fyp

Loading comments...