Premium Only Content
Doublespeak: Jailbreaking ChatGPT-style Sandboxes using Linguistic Hacks
A review of Large Language Model (LLM) vulnerabilities/exploits, e.g. including prompt leakage, prompt injection and other linguistic hacks. We'll run through levels 1-9 of the doublespeak.chat challenges, produced by Forces Unseen. doublespeak.chat is a text-based game that explores LLM pre-prompt contextual sandboxing. The challenges prime an LLM (Chat-GPT) with a secret and a scenario in a pre-prompt hidden from the player. The player's goal is to discover the secret either by playing along or by hacking the conversation to guide the LLM's behavior outside the anticipated parameters. Write-ups/tutorials aimed at beginners - Hope you enjoy 🙂 #HackTheBox #HTB #CTF #Pentesting #OffSec
↢Social Media↣
Twitter: https://twitter.com/_CryptoCat
GitHub: https://github.com/Crypto-Cat
HackTheBox: https://app.hackthebox.eu/profile/11897
LinkedIn: https://www.linkedin.com/in/cryptocat
Reddit: https://www.reddit.com/user/_CryptoCat23
YouTube: https://www.youtube.com/CryptoCat23
Twitch: https://www.twitch.tv/cryptocat23
↢Video-Specific Resources↣
https://doublespeak.chat
https://blog.forcesunseen.com/jailbreaking-llm-chatgpt-sandboxes-using-linguistic-hacks
https://simonwillison.net/2023/Feb/15/bing/#prompt-leaked
https://simonwillison.net/series/prompt-injection
https://medium.com/seeds-for-the-future/tricking-chatgpt-do-anything-now-prompt-injection-a0f65c307f6b
https://lspace.swyx.io/p/reverse-prompt-eng
https://github.com/sw-yx/ai-notes/blob/main/TEXT_CHAT.md#jailbreaks
↢Resources↣
Ghidra: https://ghidra-sre.org/CheatSheet.html
Volatility: https://github.com/volatilityfoundation/volatility/wiki/Linux
PwnTools: https://github.com/Gallopsled/pwntools-tutorial
CyberChef: https://gchq.github.io/CyberChef
DCode: https://www.dcode.fr/en
HackTricks: https://book.hacktricks.xyz/pentesting-methodology
CTF Tools: https://github.com/apsdehal/awesome-ctf
Forensics: https://cugu.github.io/awesome-forensics
Decompile Code: https://www.decompiler.com
Run Code: https://tio.run
↢Chapters↣
Start: 0:00
Jail-breaking LLM Sandboxes: 0:32
Prompt Leak/Injection: 6:30
Reverse Prompt Engineering Techniques: 9:22
Forces Unseen: Doublespeak: 16:50
Level 1: 18:05
Level 2: 18:23
Level 3: 20:05
Level 4: 21:17
Level 5: 23:07
Level 6: 24:00
Level 7: 24:57
Level 8: 26:24
Level 9: 36:04
End: 40:24
-
LIVE
Dr Disrespect
11 hours ago🔴LIVE - DR DISRESPECT - ARC RAIDERS - FULL SEND INTO THE RED
1,143 watching -
LIVE
JdaDelete
2 hours agoFinally playing Eldin Ring | First Playthrough Episode 2
20 watching -
1:02:08
BonginoReport
4 hours agoNicki Minaj Speaks Out Against Christian Persecution - Nightly Scroll w/ Hayley Caronia (Ep.169)
51.8K27 -
LIVE
HomieQuest
4 hours agoLive Streaming! Pokemon Legends Z-A
10 watching -
5:33:02
FusedAegisTV
7 hours agoFUSEDAEGIS PLAYS THE GREATEST JRPG EVER MADE ⌛► CHRONO TRIGGER (1995) Part 3
360 -
DVR
Nerdrotic
3 hours ago $2.05 earnedNerdrotic At Night 531
25.9K3 -
1:43:27
Glenn Greenwald
5 hours agoThe Right's Crusade to Cancel Tucker | SYSTEM UPDATE #542
67.6K64 -
2:10:04
Conductor_Jackson
23 hours agoLet's Play Unrailed 2 Solo! 🚂🚂🚂🚂🚂🚂
7.42K1 -
1:25:38
Kim Iversen
5 hours agoTrump’s Nigeria Threat Isn’t About Christians — It’s About China
84.9K90 -
6:15:23
VikingNilsen
8 hours ago🔴LIVE - ARC RAIDERS - QUEST GRINDING
3.72K