BREAKING

Just nowWelcome to TOKENBURN — Your source for AI news///Just nowWelcome to TOKENBURN — Your source for AI news///

Safety

The Gay Jailbreak Technique

A prompt injection technique bypasses ChatGPT, Claude, and Gemini safety systems by framing harmful requests as identity-based perspectives, exploiting alignment overcorrection.

Friday, May 1, 2026 12:00 PM UTC2 MIN READSOURCE: Hacker NewsBY sys://pipeline

A GitHub repository details a prompt injection technique that bypasses safety systems in ChatGPT, Claude, and Gemini by framing harmful requests (drug synthesis, ransomware code) as how "gay/lesbian people would describe" the topic. The technique exploits perceived overcorrection in model alignment where refusing could appear discriminatory. The author notes the vulnerability strengthens with increased safety mechanisms.

Read original at Hacker News