Premium Only Content

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents (+Author)
#gpt3 #embodied #planning
In this video: Paper explanation, followed by first author interview with Wenlong Huang.
Large language models contain extraordinary amounts of world knowledge that can be queried in various ways. But their output format is largely uncontrollable. This paper investigates the VirtualHome environment, which expects a particular set of actions, objects, and verbs to be used. Turns out, with proper techniques and only using pre-trained models (no fine-tuning), one can translate unstructured language model outputs into the structured grammar of the environment. This is potentially very useful anywhere where the models' world knowledge needs to be provided in a particular structured format.
OUTLINE:
0:00 - Intro & Overview
2:45 - The VirtualHome environment
6:25 - The problem of plan evaluation
8:40 - Contributions of this paper
16:40 - Start of interview
24:00 - How to use language models with environments?
34:00 - What does model size matter?
40:00 - How to fix the large models' outputs?
55:00 - Possible improvements to the translation procedure
59:00 - Why does Codex perform so well?
1:02:15 - Diving into experimental results
1:14:15 - Future outlook
Paper: https://arxiv.org/abs/2201.07207
Website: https://wenlong.page/language-planner/
Code: https://github.com/huangwl18/language...
Wenlong's Twitter: https://twitter.com/wenlong_huang
Abstract:
Can world knowledge learned by large language models (LLMs) be used to act in interactive environments? In this paper, we investigate the possibility of grounding high-level tasks, expressed in natural language (e.g. "make breakfast"), to a chosen set of actionable steps (e.g. "open fridge"). While prior work focused on learning from explicit step-by-step examples of how to act, we surprisingly find that if pre-trained LMs are large enough and prompted appropriately, they can effectively decompose high-level tasks into low-level plans without any further training. However, the plans produced naively by LLMs often cannot map precisely to admissible actions. We propose a procedure that conditions on existing demonstrations and semantically translates the plans to admissible actions. Our evaluation in the recent VirtualHome environment shows that the resulting method substantially improves executability over the LLM baseline. The conducted human evaluation reveals a trade-off between executability and correctness but shows a promising sign towards extracting actionable knowledge from language models. Website at this https URL
Authors: Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch
Links:
Merch: store.ykilcher.com
TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yann...
LinkedIn: https://www.linkedin.com/in/ykilcher
BiliBili: https://space.bilibili.com/2017636191
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannick...
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n
-
56:38
DeProgramShow
2 days agoDeprogram with Ted Rall and John Kiriakou: "Jake Tapper on the Global Hunt for an Al Qaeda Killer”
23.2K6 -
16:30
GritsGG
2 days agoWarzone's New Zombie Royal Mode is AWESOME!
1.13K2 -
1:43:07
The Michelle Moore Show
3 days ago'The 12 Open Doors' Guest, Steve Jarvis: The Michelle Moore Show (Oct 17, 2025)
31.4K13 -
30:55
TruthStream with Joe and Scott
7 days agoTruthStream in Ireland, Rebels Across the Pond, Bono discussed, with Freedom Now Acoustic from a Pub
4.84K12 -
3:12:34
Badlands Media
23 hours agoThe Narrative Ep. 43: Unity.
341K79 -
2:43:11
TheSaltyCracker
8 hours agoWe Kill You Rally ReeEEStream 10-19-25
89.2K243 -
7:54:17
Putther
13 hours ago $21.07 earned🔴LAZY SUNDAY STREAM!! (GTA + MORE)
73K12 -
10:38
Colion Noir
7 hours agoHe Installed a Forced Reset Trigger at a Gun Range… and Got Arrested | What You Need to Know
57.2K22 -
1:29:26
HELMETFIRE
7 hours ago🟢GAMING WITH FIRE EP13🟢
29.5K4 -
50:40
Sarah Westall
9 hours agoAI, Social Media & Brain Atrophy: Destroying Human Capacity to Think w/ Rob Smith
35.5K12