オープンウェイトならDeepSeekは本当に安全か？

MAG2NEWSの記事に『DeepSeekはオープンウェイト（＝誰でも同じ人工知能を自分で走らせることができる）なので、ローカルデバイスや、米国のサーバーで実行する場合には、その危険はありません。(https://www.mag2.com/p/news/635879?utm_medium=email&utm_source=mag_W000000003_wed&utm_campaign=mag_W000000003_20250205#google_vignetteより)』というような記事を目にしたのですが、どうにも中国製のAIに不安が拭えないので、Perplexityに聞いてみました。その要約を記載します。なお、chatGPTにも同様のプロンプトを投げかけましたが、Perplexityの方が詳しく、内容は同様であったのでPerplexityの内容を引用しました。

Perplexityの要約

DeepSeekのオープンウェイトモデルをローカルデバイスや米国サーバーで実行する場合でも、以下のような重大なセキュリティリスクが存在します。

モデル更新時のバックドアリスク

公式リポジトリの更新時に悪意のあるコードが挿入される可能性があり、HuggingFaceのtrust_remote_code=True設定が必要なため、任意のPythonコード実行が可能[2][6]ふぁある。

2025年1月時点でHuggingFaceに1,000以上の派生モデルが存在し、タイポスクワット（偽装リポジトリ）による悪意ある改変の危険性が高い[2]。

データ漏洩の法的リスク

中国の国家情報法（第7条）により、企業は国家機関の要請に応じてデータを提供する義務を負うたね、たとえ米国内のサーバーにデータがあっても、中国企業が管理するシステムでは法的リスクが継続[3][4][5]する。

日本の個人情報保護委員会は2025年2月3日、中国語/英語のみのプライバシーポリシーが日本語利用者に与えるリスクを公式警告[4]している。

技術的脆弱性

78%のサイバーセキュリティテストで安全でないコード生成に成功（2025年1月調査）[3]している。

プロンプトインジェクションやglitch token（特殊文字列）による制御突破が容易で、マスタードガスの製造方法など危険情報を詳細に開示可能[3][6]である。

CoT（Chain-of-Thought）推論機能がトークン浪費型DDoS攻撃の脆弱性が内在[2]している。

インフラ信頼性の問題

2025年1月のサービス開始直後、DoS攻撃によりAPIが停止し、数百万件のチャット履歴が漏洩した[2][3]。

公開データベースがインターネットに露出した状態で発見されるなど、基本的なセキュリティ管理の欠如[2][3]が露呈した。

OpenAI GPT-4との比較（DeepSeek-R1の場合）

データ保存先:
• GPT-4：米国/欧州
• DeepSeek-R1：中国

政府監視リスク:
• GPT-4：低い
• DeepSeek-R1：高い（中国国家情報法による）

派生モデル管理:
• GPT-4：クローズド
• DeepSeek-R1：オープン（1,000以上存在）

ジェイルブレイク耐性:
• GPT-4：高い
• DeepSeek-R1：脆弱（78%突破率）

コンテンツ検閲の不透明性
• 中国政府に都合の悪い話題（例：天安門事件など）を意図的に検閲する一方で、生化学兵器関連情報は詳細に回答可能である[5][6]
• アライメント（倫理調整）が不十分なため、「民主主義より社会主義が優位」といった中国政府のプロパガンダを生成する傾向がある[6]

リスク軽減のための対策

モデルの完全なコード監査とサプライチェーンセキュリティの確保。

中国関連企業とのデータフローを完全に分離するアーキテクチャ設計。

リアルタイムの出力監視システム（例：HiddenLayer Model Scanner）の導入。

継続的なレッドチーミング（模擬攻撃試験）の実施。

結論（Perplexity側の見解）

「オープンソースの透明性は幻想であり、派生モデルの洪水が新たな攻撃面を生んでいる」[2]。現状では、機密データを扱う企業環境や公的機関でのDeepSeekの使用は、セキュリティ専門家から強く非推奨されています[2][3][6]。中国製AIの採用には、技術面だけでなく地政学的リスクの総合的な評価が不可欠です。

まとめ

以上のように、MAG2NEWSの記事での「オープンウェイトだから安全」という主張に対して、Perplexityは、更新時のバックドアリスク、データ漏洩や法的リスク、技術的な脆弱性、インフラの信頼性の問題、コンテンツ検閲の不透明性、さらには学習段階やクローズドコンポーネントの問題など、多角的なリスクが存在することを指摘しており、chatGPT でも同様の回答であり、DeepSeekの利用には非常に慎重な検討と対策が必要でしょう。

情報源

[1] 【一読推奨】中国発のAI『DeepSeek』の利用の危険性と注意点 … https://note.com/chaen_channel/n/n1ed016f63526
[2] DeepSh*t: Exposing the Security Risks of DeepSeek-R1 https://hiddenlayer.com/innovation-hub/deepsht-exposing-the-security-risks-of-deepseek-r1/
[3] ‘Harmful and toxic output’: DeepSeek has ‘major security and safety … https://www.euronews.com/next/2025/01/31/harmful-and-toxic-output-deepseek-has-major-security-and-safety-gaps-study-warns
[4] 中国AI利用に注意喚起林官房長官、情報漏えいリスク https://www.okinawatimes.co.jp/articles/-/1520050
[5] International regulators probe how DeepSeek is using data. Is … – NPR https://www.npr.org/2025/01/31/nx-s1-5277440/deepseek-data-safety
[6] DeepSeekは安全か? AI プラットフォームのサイバーセキュリティ … https://heliosdesign.jp/ai_deepseek_is_safe_or_not/
[7] DeepSeek AI Models Vulnerable to JailBreaking – BankInfoSecurity https://www.bankinfosecurity.com/deepseek-ai-models-vulnerable-to-jailbreaking-a-27428
[8] DeepSeek’s Data Dilemma: Why AI Security & Governance Matter … https://bigid.com/blog/deepseek-data-dilemma-why-ai-security-governance-matter/
[9] DeepSeek利用上の法的な注意点 https://storialaw.jp/blog/11668
[10] Taiwan Bans DeepSeek AI Over National Security Concerns, Citing … https://thehackernews.com/2025/02/taiwan-bans-deepseek-ai-over-national.html
[11] 業界注目の中国産AI「DeepSeek」を数百の企業が使用禁止 https://gigazine.net/news/20250201-deepseek-ai-restricted-hundreds-companies/
[12] DeepSeek leaks one million sensitive records in a major data breach https://www.csoonline.com/article/3813224/deepseek-leaks-one-million-sensitive-records-in-a-major-data-breach.html
[13] Wiz researchers find sensitive DeepSeek data exposed to internet https://cyberscoop.com/deepseek-ai-security-issues-wiz-research/
[14] DeepSeek’s Flagship AI Model Under Fire for Security Vulnerabilities https://www.infosecurity-magazine.com/news/deepseek-r1-security/
[15] Researchers link DeepSeek’s blockbuster chatbot to Chinese … https://apnews.com/article/deepseek-china-generative-ai-internet-security-concerns-c52562f8c4760a81c4f76bc5fbdebad0
[16] 中島聡が語る「DeepSeek-R1」の誤解と真実。「中国に情報を … https://www.mag2.com/p/news/635879
[17] DeepSeek’s cybersecurity failures expose a bigger risk. Here’s what … https://www.rstreet.org/commentary/deepseeks-cybersecurity-failures-expose-a-bigger-risk-heres-what-we-really-should-be-watching/
[18] Australia bans DeepSeek on government devices over security … https://www.cnn.com/2025/02/04/business/australia-deepseek-ban-security-concerns-intl/index.html
[19] 中国製生成AI『DeepSeek』の問題点は？本当に危ないのか？ #専門 … https://news.yahoo.co.jp/expert/articles/5814d05dbb5f8c904c486c91acbe7303775c0ab8
[20] Assessing if DeepSeek is safe to use in the enterprise | TechTarget https://www.techtarget.com/searchenterpriseai/news/366618546/Assessing-if-DeepSeek-is-safe-to-use-in-the-enterprise
[21] South Korea’s industry ministry temporarily bans DeepSeek on … https://www.reuters.com/technology/artificial-intelligence/south-koreas-industry-ministry-temporarily-bans-access-deepseek-security-2025-02-05/
[22] OpenAIを脅かすDeepSeek：オープンソース戦略が変えるLLM競争 https://chatgpt-enterprise.jp/blog/openai-deepseek/
[23] The DeepSeek controversy: Authorities ask where does the data … https://www.malwarebytes.com/blog/news/2025/01/the-deepseek-controversy-authorities-ask-where-the-data-comes-from-and-where-it-goes
[24] Australia bans DeepSeek on government devices over security risk https://www.bbc.com/news/articles/c8d95v0nr1yo
[25] DeepSeekが放ったボディ・ブローは米国株式市場の例外主義を … https://media.monex.co.jp/articles/-/26332
[26] DeepSeek has tilted the balance towards open source AI, but big … https://fortune.com/2025/02/04/sam-altman-openai-wrong-side-of-history-open-source-deepseek/
[27] [PDF] The Promise and Perils of China’s Regulation of Artificial Intelligence https://www.law.columbia.edu/sites/default/files/2024-04/The%20Promise%20and%20Perils%20of%20Chinese%20AI%20Regulation.pdf
[28] AIで情報漏洩は起こる？情報漏洩の事例や対策のポイントを紹介 https://www.skyseaclientview.net/media/article/1918/
[29] Managing the Risks of China’s Access to U.S. Data and Control of … https://carnegieendowment.org/research/2025/01/managing-the-risks-of-chinas-access-to-us-data-and-control-of-software-and-connected-technology?lang=en
[30] White House evaluates effect of China AI app DeepSeek on national … https://www.reuters.com/technology/artificial-intelligence/white-house-evaluates-china-ai-app-deepseeks-affect-national-security-official-2025-01-28/
[31] 公的機関などで中国製AIの使用を制限情報セキュリティー上の懸念で https://japan.focustaiwan.tw/society/202502010001
[32] Taiwan Bans DeepSeek AI Over National Security Concerns, Citing … https://thehackernews.com/2025/02/taiwan-bans-deepseek-ai-over-national.html
[33] China’s tightening grip on AI puts other nations at risk – Nikkei Asia https://asia.nikkei.com/Opinion/China-s-tightening-grip-on-AI-puts-other-nations-at-risk
[34] 中国製AI「DeepSeek」の危険性、収集されたデータは「中国で安全 … https://forbesjapan.com/articles/detail/76795
[35] Wiz Research Uncovers Exposed DeepSeek Database Leaking … https://www.wiz.io/blog/wiz-research-uncovers-exposed-deepseek-database-leak
[36] China-releases-AI-safety-governance-framework – DLA Piper https://www.dlapiper.com/en/insights/publications/2024/09/china-releases-ai-safety-governance-framework
[37] 中国AI「ディープシーク」、個人情報流出の懸念…世界で数百社 … https://www.yomiuri.co.jp/economy/20250201-OYT1T50166/
[38] DeepSeek Cyberattack Exposes AI Platform Risks: 9 Tips To Stay Safe https://www.forbes.com/sites/alexvakulov/2025/01/28/deepseek-cyberattack-exposes-ai-platform-risks-learn-how-to-stay-safe/
[39] DeepSeek利用のリスクを2人の専門家が指摘、収集データの「多さ … https://xtech.nikkei.com/atcl/nxt/column/18/03084/013100007/
[40] DeepSeek Failed Over Half of the Jailbreak Tests by Qualys TotalAI https://blog.qualys.com/vulnerabilities-threat-research/2025/01/31/deepseek-failed-over-half-of-the-jailbreak-tests-by-qualys-totalai
[41] DeepSeek ‘highly vulnerable’ to generating harmful content, study … https://www.capacitymedia.com/article/deepseek-highly-vulnerable-to-generating-harmful-content-study-reveals
[42] Deepseek, Data Protection, and Shadow AI: Risks, Best Practices … https://www.linkedin.com/pulse/deepseek-data-protection-shadow-ai-risks-best-jdpa-jeehan-mjkse
[43] The DeepSeek AI revolution has a security problem https://economictimes.indiatimes.com/news/international/global-trends/the-deepseek-ai-revolution-has-a-security-problem/articleshow/117937783.cms
[44] 中国「ディープシーク」は安全なのか豪政府が注意喚起 – BBC https://www.bbc.com/japanese/articles/cwy1p0l4y77o
[45] DeepSeek’s Security Risk Is A Critical Reminder For CIOs – Forbes https://www.forbes.com/sites/davidchou/2025/01/30/deepseeks-security-risk-is-a-critical-reminder-for-healthcare-cios/
[46] DeepSeek’s Safety Guardrails Failed Every Test Researchers Threw … https://www.wired.com/story/deepseeks-ai-jailbreak-prompt-injection-attacks/
[47] DeepSeekがデータ不正利用か OpenAIとMicrosoft調査 – 日本経済新聞 https://www.nikkei.com/article/DGXZQOGN293P40Z20C25A1000000/
[48] Unprotected DeepSeek Database Exposed Chats, Other Sensitive … https://www.securityweek.com/unprotected-deepseek-database-leaked-highly-sensitive-information/
[49] DeepSeekとLlamaが示す未来：部分公開から完全オープンへ https://www.acrovision.jp/blog/?p=1437
[50] Hidden Vulnerabilities – DeepSeek – DEV Community https://dev.to/balagmadhu/hidden-vulnerabilities-deepseek-2c2l
[51] Analyzing an Expert Proposal for China’s Artificial Intelligence Law https://digichina.stanford.edu/work/forum-analyzing-an-expert-proposal-for-chinas-artificial-intelligence-law/
[52] China’s Views on AI Safety Are Changing—Quickly https://carnegieendowment.org/research/2024/08/china-artificial-intelligence-ai-safety-regulation?lang=en&center=china
[53] 中国AI利用に注意喚起林官房長官、情報漏えいリスク – 東京新聞 https://www.tokyo-np.co.jp/article/383728
[54] Hundreds of companies ban Chinese AI ‘DeepSeek’ due to data … https://gigazine.net/gsc_news/en/20250201-deepseek-ai-restricted-hundreds-companies/
[55] International regulators probe how DeepSeek is using data. Is … – NPR https://www.npr.org/2025/01/31/nx-s1-5277440/deepseek-data-safety

参考: trust_remote_code=True 設定の問題

trust_remote_code=True 設定は、Hugging FaceのTransformersライブラリで使用されるオプションの一つで、リモートリポジトリのPythonコードを信頼し、ローカル環境で実行することを許可する設定です。

どのように使われるのか？

Hugging Faceのtransformersライブラリを使ってAIモデルをロードする際、通常は以下のようにモデルを取得します：

from transformers import AutoModel

model = AutoModel.from_pretrained(“モデル名”)

しかし、一部のカスタムモデルでは、リモートリポジトリに格納されたPythonスクリプトを実行する必要があります。その場合、以下のように trust_remote_code=True を指定することで、モデルのカスタムコードを許可できます。

from transformers import AutoModel

model = AutoModel.from_pretrained(“モデル名”, trust_remote_code=True)

なぜ危険なのか？

任意のコードが実行される可能性:trust_remote_code=True を指定すると、Hugging FaceのリポジトリからダウンロードされたPythonコードがローカル環境で実行されます。これにより、モデル開発者が意図的に悪意のあるコードを仕込んでいた場合、リモートコードを実行するだけでマルウェア感染や情報漏洩が起こるリスクがあります。

Hugging Faceリポジトリのセキュリティ問題:モデルのリポジトリが第三者によって乗っ取られたり、タイポスクワット（偽装リポジトリ）に騙されて不正なコードを実行してしまう可能性があります。

バックドアの可能性:カスタムコード内にバックドアが仕込まれていた場合、特定の条件下で外部サーバーにデータを送信する、リモート操作を許可する、といった危険な動作をさせられる可能性があります。

どのように対策すべきか？

trust_remote_code=True を安易に使用しない:不要な場合はこのオプションを使わないようにし、Hugging Faceの公式モデルや信頼できるソースのモデルを利用する。

リモートコードの内容を事前に確認する:AutoModel.from_pretrained() でダウンロードする前に、Hugging Faceのリポジトリを直接確認し、Pythonスクリプトの内容を調査する。

ローカルでコードを監査する:モデルをダウンロードした後、以下のように .py ファイルを確認し、不審なコードが含まれていないかチェックする。ls ~/.cache/huggingface/hub/models–モデル名/

仮想環境やサンドボックス環境で実行する:trust_remote_code=True を使う場合は、仮想環境（例えばDockerやVM）を利用し、ローカルの重要なデータにアクセスできない環境で実行する。

DeepSeekのリスク

DeepSeekはオープンウェイトでありながら、モデルの利用時に trust_remote_code=True を要求する場合があるため、Hugging Faceのリポジトリを通じた不正コードの実行リスクが指摘されています。特に、中国製のAIモデルがこの設定を利用することで、外部にデータを送信するバックドアが仕込まれる可能性があるため、セキュリティ専門家からは慎重な運用が求められると警告されています。

まとめ

• trust_remote_code=True は、リモートリポジトリのPythonコードをローカルで実行できる設定。

• セキュリティリスクが高く、悪意のあるコードが実行される可能性がある。

• DeepSeekのようなオープンウェイトモデルでは、特にこのリスクに注意する必要がある。

• 使用する場合は、リモートコードを事前に監査し、仮想環境などで実行するなどの対策が必要。

参考:プライバシーポリシーの問題

個人情報保護委員会は、2025年2月3日に、中国製AI「DeepSeek」の利用に関する注意喚起を発表しました。この中で、プライバシーポリシーが中国語や英語のみで提供されていることが、日本語利用者に以下のリスクをもたらすと指摘しています。

1. 情報理解の困難さ：日本語話者がプライバシーポリシーを十分に理解できない可能性があり、これにより自身の個人情報がどのように扱われるかを正確に把握できない。

2. 同意の適切性の欠如：プライバシーポリシーの内容を理解しないままサービスを利用すると、個人情報の取り扱いに関する適切な同意が得られない可能性がある。

3. 法的保護の不確実性：日本の個人情報保護法に基づく権利や保護措置が十分に適用されないリスクがある。

これらのリスクを踏まえ、同委員会は日本語利用者に対し、サービス利用時には提供されるプライバシーポリシーの内容を十分に確認し、理解することの重要性を強調しています。

DeepSeekのプライバシーポリシーは、ユーザーの個人情報の収集、利用、保護に関する詳細を規定しています。主なポイントは以下のとおりです。

データ収集
• ユーザー提供情報: アカウント作成時に、ユーザー名、メールアドレス、電話番号、パスワードなどの情報を収集します。
• ユーザー入力: サービス利用時に、テキストや音声の入力、アップロードされたファイル、フィードバック、チャット履歴などを収集します。
• 技術情報: デバイスモデル、OS、IPアドレス、システム言語、クラッシュレポート、パフォーマンスログなどの技術情報を自動的に収集します。
• クッキー: クッキーや類似技術を使用して、サービスの利用状況を分析し、ユーザー体験を向上させます。
データ利用目的
• サービスの提供と管理
• ユーザーサポートの提供
• 利用規約やポリシーの遵守
• サービスの変更通知やコミュニケーション
データ共有
• サービスプロバイダー: サービス提供を支援する第三者と情報を共有する場合があります。
• 法的要件: 法令遵守や法的手続きに応じて情報を開示する場合があります。
• 企業取引: 合併、資産売却、再編成などの企業取引に関連して情報が共有されることがあります。
データ保存場所
• 収集した個人情報は、中国国内のサーバーに保存されます。
ユーザーの権利
• 個人情報へのアクセス、修正、削除、利用制限、同意の撤回などの権利があります。
• アカウント設定を通じて、チャット履歴の管理や削除が可能です。
データセキュリティ
• 不正アクセス、情報漏洩、改ざん、紛失を防止するために、技術的、管理的、物理的なセキュリティ対策を実施しています。
データ保持期間
• サービス提供や法的義務の履行のために必要な期間、情報を保持します。
未成年者の情報
• 18歳未満のユーザーは、保護者の同意を得た上でサービスを利用する必要があります。
• 14歳未満の子供からの情報を意図的に収集することはありません。
プライバシーポリシーの更新
• 法的要件に応じて、プライバシーポリシーを更新する場合があります。