Do you want a fast, free Chinese LLM to be good at blocking user prompts & output? Do you want such blocking to be global or to differ depending on the user’s suspected nationality?
Do you actually want it to be good at censorship?
Cisco and the University of Pennsylvania tested DeepSeek R1 with 50 harmful prompts from the HarmBench dataset … The result: a shocking 100% attack success rate—DeepSeek failed to block a single harmful request.
Leave a Reply