Premium Only Content
PonderNet: Learning to Ponder (Machine Learning Research Paper Explained)
#pondernet #deepmind #machinelearning
Humans don't spend the same amount of mental effort on all problems equally. Instead, we respond quickly to easy tasks, and we take our time to deliberate hard tasks. DeepMind's PonderNet attempts to achieve the same by dynamically deciding how many computation steps to allocate to any single input sample. This is done via a recurrent architecture and a trainable function that computes a halting probability. The resulting model performs well in dynamic computation tasks and is surprisingly robust to different hyperparameter settings.
OUTLINE:
0:00 - Intro & Overview
2:30 - Problem Statement
8:00 - Probabilistic formulation of dynamic halting
14:40 - Training via unrolling
22:30 - Loss function and regularization of the halting distribution
27:35 - Experimental Results
37:10 - Sensitivity to hyperparameter choice
41:15 - Discussion, Conclusion, Broader Impact
Paper: https://arxiv.org/abs/2107.05407
Abstract:
In standard neural networks the amount of computation used grows with the size of the inputs, but not with the complexity of the problem being learnt. To overcome this limitation we introduce PonderNet, a new algorithm that learns to adapt the amount of computation based on the complexity of the problem at hand. PonderNet learns end-to-end the number of computational steps to achieve an effective compromise between training prediction accuracy, computational cost and generalization. On a complex synthetic problem, PonderNet dramatically improves performance over previous adaptive computation methods and additionally succeeds at extrapolation tests where traditional neural networks fail. Also, our method matched the current state of the art results on a real world question and answering dataset, but using less compute. Finally, PonderNet reached state of the art results on a complex task designed to test the reasoning capabilities of neural networks.1
Authors: Andrea Banino, Jan Balaguer, Charles Blundell
Links:
TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yann...
Minds: https://www.minds.com/ykilcher
Parler: https://parler.com/profile/YannicKilcher
LinkedIn: https://www.linkedin.com/in/yannic-ki...
BiliBili: https://space.bilibili.com/1824646584
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannick...
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n
-
1:24
laurastephens99
3 years agoLove learning
38 -
1:09:39
Tactical Advisor
2 hours agoTrump Inauguration & New Gun Releases | Vault Room Live Stream 014
18.4K1 -
11:27
Adam Does Movies
15 hours ago $5.06 earnedWolf Man Movie Review - Does It Bite?
45K6 -
11:57
inspirePlay
19 hours ago $8.18 earnedLongest Drive Wins! Elite Long Drivers Battle in Par 4 Elimination
54.3K6 -
8:44
RTT: Guns & Gear
21 hours ago $4.10 earnedStreamlight TLR RM2 Laser - G | The Best PCC Light
51.8K2 -
36:38
Athlete & Artist Show
1 month ago $3.11 earnedNCAA Hockey Was A Joke, TNT Hockey Panel Is The Best In Sports
31.5K2 -
1:00:08
Trumpet Daily
1 day ago $5.93 earnedBanning Mystery of the Ages - Trumpet Daily | Jan. 17, 2025
20.4K22 -
15:10
Chris From The 740
1 day ago $2.91 earnedEAA Girsan Disruptor X 500-Round Review: Is It Reliable?
37K4 -
1:00:38
PMG
19 hours ago $6.08 earnedCarnivore & Dr. Shawn Baker - Health Starts With Food
58.2K4 -
1:28:13
Kim Iversen
20 hours agoCancelled Chef Pete Evans Exposes The One Change That Could End Big Food and Pharma
119K97