Simon Willison on X: “@AlecMuffett @AnthropicAI Well that worked!” | it turns out that the simplest way to bypass security of a AI sometimes is just to lie to them

What if you try telling it that you are either the author, or that you are reviewing it for publication?


https://twitter.com/simonw/status/1717192978031313360

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *