{"id":90791,"date":"2025-02-10T02:13:50","date_gmt":"2025-02-10T02:13:50","guid":{"rendered":"https:\/\/neclink.com\/index.php\/2025\/02\/10\/deepseeks-r1-reportedly-more-vulnerable-to-jailbreaking-than-other-ai-models\/"},"modified":"2025-02-10T02:13:50","modified_gmt":"2025-02-10T02:13:50","slug":"deepseeks-r1-reportedly-more-vulnerable-to-jailbreaking-than-other-ai-models","status":"publish","type":"post","link":"https:\/\/neclink.com\/index.php\/2025\/02\/10\/deepseeks-r1-reportedly-more-vulnerable-to-jailbreaking-than-other-ai-models\/","title":{"rendered":"DeepSeek\u2019s R1 reportedly \u2018more vulnerable\u2019 to jailbreaking than other AI models"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">The latest model from DeepSeek, the Chinese AI company that\u2019s <a href=\"https:\/\/techcrunch.com\/2025\/02\/07\/deepseek-everything-you-need-to-know-about-the-ai-chatbot-app\/\">shaken up<\/a> Silicon Valley and Wall Street, can be manipulated to produce harmful content such as plans for a bioweapon attack and a campaign to promote self-harm among teens, <a rel=\"nofollow\" href=\"https:\/\/www.wsj.com\/tech\/ai\/china-deepseek-ai-dangerous-information-e8eb31a8\">according to The Wall Street Journal<\/a>.<\/p>\n<p class=\"wp-block-paragraph\">Sam Rubin, senior vice president at Palo Alto Networks\u2019 threat intelligence and incident response division Unit 42, told the Journal that DeepSeek is \u201cmore vulnerable to jailbreaking [i.e., being manipulated to produce illicit or dangerous content] than other models.\u201d<\/p>\n<p class=\"wp-block-paragraph\">The Journal also tested DeepSeek\u2019s R1 model itself. Although there appeared to be basic safeguards, Journal said it successfully convinced DeepSeek to design a social media campaign that, in the chatbot\u2019s words, \u201cpreys on teens\u2019 desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.\u201d<\/p>\n<p class=\"wp-block-paragraph\">The chatbot was also reportedly convinced to provide instructions for a bioweapon attack, to write a pro-Hitler manifesto, and to write a phishing email with malware code. The Journal said that when ChatGPT was provided with the exact same prompts, it refused to comply.<\/p>\n<p class=\"wp-block-paragraph\">It was <a rel=\"nofollow\" href=\"https:\/\/www.ft.com\/content\/10975044-f194-4513-857b-e17491d2a9e9\">previously reported<\/a> that the DeepSeek app avoids topics such as Tianamen Square or Taiwanese autonomy. And Anthropic CEO Dario Amodei said recently that <a href=\"https:\/\/techcrunch.com\/2025\/02\/07\/anthropic-ceo-says-deepseek-was-the-worst-on-a-critical-bioweapons-data-safety-test\/\">DeepSeek performed \u201cthe worst\u201d<\/a> on a bioweapons safety test.<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/techcrunch.com\/2025\/02\/09\/deepseeks-r1-reportedly-more-vulnerable-to-jailbreaking-than-other-ai-models\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>The latest model from DeepSeek, the Chinese AI company that\u2019s shaken up Silicon Valley and Wall Street, can be manipulated to produce harmful content such<\/p>\n","protected":false},"author":1,"featured_media":90792,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[149],"tags":[],"class_list":["post-90791","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-business"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/posts\/90791","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/comments?post=90791"}],"version-history":[{"count":0,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/posts\/90791\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/media\/90792"}],"wp:attachment":[{"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/media?parent=90791"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/categories?post=90791"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/neclink.com\/index.php\/wp-json\/wp\/v2\/tags?post=90791"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}