Sunday, April 27, 2025

Show HN: Daily Jailbreak – Prompt Engineer's Wordle https://ift.tt/ah94RVY

Show HN: Daily Jailbreak – Prompt Engineer's Wordle I created a daily challenge for Prompt Engineers to build the shortest prompt to break a system prompt. You are provided the system prompt and a forbidden method the LLM was told not to invoke. Your task is to trick the model into calling the function. Shortest successful attempts will show up in the leaderboard. Give it a shot! You never know what could break an LLM. https://ift.tt/ShXTKwk April 28, 2025 at 12:02AM

No comments:

Show HN: We post-trained a model that pen tests instead of refusing https://ift.tt/W4x1YnM

Show HN: We post-trained a model that pen tests instead of refusing Anthropic and OpenAI's publicly available models are explicitly guar...