Premium Only Content
Doublespeak: Jailbreaking ChatGPT-style Sandboxes using Linguistic Hacks
A review of Large Language Model (LLM) vulnerabilities/exploits, e.g. including prompt leakage, prompt injection and other linguistic hacks. We'll run through levels 1-9 of the doublespeak.chat challenges, produced by Forces Unseen. doublespeak.chat is a text-based game that explores LLM pre-prompt contextual sandboxing. The challenges prime an LLM (Chat-GPT) with a secret and a scenario in a pre-prompt hidden from the player. The player's goal is to discover the secret either by playing along or by hacking the conversation to guide the LLM's behavior outside the anticipated parameters. Write-ups/tutorials aimed at beginners - Hope you enjoy 🙂 #HackTheBox #HTB #CTF #Pentesting #OffSec
↢Social Media↣
Twitter: https://twitter.com/_CryptoCat
GitHub: https://github.com/Crypto-Cat
HackTheBox: https://app.hackthebox.eu/profile/11897
LinkedIn: https://www.linkedin.com/in/cryptocat
Reddit: https://www.reddit.com/user/_CryptoCat23
YouTube: https://www.youtube.com/CryptoCat23
Twitch: https://www.twitch.tv/cryptocat23
↢Video-Specific Resources↣
https://doublespeak.chat
https://blog.forcesunseen.com/jailbreaking-llm-chatgpt-sandboxes-using-linguistic-hacks
https://simonwillison.net/2023/Feb/15/bing/#prompt-leaked
https://simonwillison.net/series/prompt-injection
https://medium.com/seeds-for-the-future/tricking-chatgpt-do-anything-now-prompt-injection-a0f65c307f6b
https://lspace.swyx.io/p/reverse-prompt-eng
https://github.com/sw-yx/ai-notes/blob/main/TEXT_CHAT.md#jailbreaks
↢Resources↣
Ghidra: https://ghidra-sre.org/CheatSheet.html
Volatility: https://github.com/volatilityfoundation/volatility/wiki/Linux
PwnTools: https://github.com/Gallopsled/pwntools-tutorial
CyberChef: https://gchq.github.io/CyberChef
DCode: https://www.dcode.fr/en
HackTricks: https://book.hacktricks.xyz/pentesting-methodology
CTF Tools: https://github.com/apsdehal/awesome-ctf
Forensics: https://cugu.github.io/awesome-forensics
Decompile Code: https://www.decompiler.com
Run Code: https://tio.run
↢Chapters↣
Start: 0:00
Jail-breaking LLM Sandboxes: 0:32
Prompt Leak/Injection: 6:30
Reverse Prompt Engineering Techniques: 9:22
Forces Unseen: Doublespeak: 16:50
Level 1: 18:05
Level 2: 18:23
Level 3: 20:05
Level 4: 21:17
Level 5: 23:07
Level 6: 24:00
Level 7: 24:57
Level 8: 26:24
Level 9: 36:04
End: 40:24
-
59:03
NAG Podcast
6 hours agoSarah Fields: BOLDTALK W/Angela Belcamino
25.5K6 -
1:21:41
Glenn Greenwald
9 hours agoGlenn Takes Your Questions: On the Argentina Bailout, Money in Politics, and More; Plus: Journalist Jasper Nathaniel on Brutality and Settler Attacks in the West Bank | SYSTEM UPDATE #541
82.8K41 -
3:10:08
Barry Cunningham
6 hours agoPRESIDENT TRUMP TO USE NUCLEAR OPTION? FOOD STAMPS END! | SHUTDOWN DAY 31
49.4K34 -
1:06:56
BonginoReport
14 hours agoThe Battle Between Good & Evil w/ Demonologist Rick Hansen - Hayley Caronia (Ep.168)
100K38 -
1:12:57
Kim Iversen
9 hours agoBill Gates Suddenly Says “Don’t Worry About Climate Change”?
90.5K62 -
1:05:12
Michael Franzese
9 hours agoI Waited 50 Years to Tell You What Happened on Halloween 1975
45.4K17 -
1:07:15
Candace Show Podcast
9 hours agoINFILTRATION: Charlie Kirk Was Being Tracked For Years. | Candace Ep 256
93.6K369 -
LIVE
Rallied
8 hours ago $3.23 earnedWarzone Solo Challenges then RedSec Domination
230 watching -
2:34:30
Red Pill News
11 hours agoBoomerang Time - DOJ Investigating BLM Fraud on Red Pill News Live
73.9K15 -
1:46:14
Roseanne Barr
11 hours ago“The Over Emotional Are Always Under Informed” | The Roseanne Barr Podcast #121
98.1K66