The Abuse of Pickle Files in AI Model Supply Chains: A Growing Security Threat

July 5, 2025Cybersecurity News

TL;DR

As artificial intelligence (AI) and machine learning (ML) continue to transform industries, the security of their supply chains has become a critical concern. One of the most significant and underappreciated risks involves the abuse of Python’s pickle files—a serialization format widely used for saving and sharing ML models. Recent incidents have demonstrated how attackers can exploit pickle files to compromise entire AI supply chains, posing substantial risks to organizations and end users alike.

Understanding Pickle File Vulnerabilities

Python’s pickle module enables the serialization and deserialization of complex objects, including trained machine learning models. While this functionality is convenient, it comes with a serious caveat: pickle deserialization can execute arbitrary code embedded within the file. This means that loading a malicious pickle file—often via common methods such as pickle.load() or torch.load()—can trigger the execution of malware or other harmful payloads on the host system without the user’s knowledge.

Key Abuse Techniques

Remote Code Execution (RCE): Attackers can craft pickle files that execute system commands during deserialization, enabling them to install malware, steal data, or gain persistent access to the victim’s environment.
Model Manipulation and Backdoors: Malicious payloads can alter model weights, insert backdoors, or manipulate data flows, potentially leading to unauthorized access or data leakage.
Persistence and Propagation: Sophisticated techniques, such as embedding self-replicating code, allow malicious payloads to persist through model updates and even propagate to derivative models.
Supply Chain Compromise: By uploading tainted models to trusted repositories or platforms, attackers can target a wide audience of unsuspecting developers and organizations.

Real-World Incidents

The threat is not theoretical. Security researchers have uncovered malicious models uploaded to popular platforms such as Hugging Face and PyPI. These models contained hidden code that, when loaded, contacted remote servers, downloaded additional malware, or exfiltrated sensitive information. Alarmingly, attackers have also found ways to bypass security scanners—such as picklescan—by exploiting differences in how files are parsed or by obfuscating malicious content within ZIP archives.

Why the AI Supply Chain Is at Risk

Several factors amplify the risk posed by malicious pickle files:

Implicit Trust: Many ML frameworks assume that serialized model files are safe, leading to a lack of verification or validation during loading.
Open-Source Reliance: The widespread use of open-source models and code increases exposure to potentially compromised assets.
Insufficient Security Controls: Existing security tools and practices often fail to detect advanced or obfuscated threats embedded in pickle files.

Best Practices for Mitigation

To address these risks, organizations should adopt a proactive and security-first approach to managing AI model supply chains:

Treat Model Files as Executables: Recognize that serialized model files can contain executable code and should be handled with the same caution as software binaries.
Avoid Untrusted Sources: Only load pickle files from trusted, verifiable sources. Where possible, prefer safer serialization formats that do not support code execution.
Enhance Supply Chain Security: Implement rigorous validation, continuous monitoring, and behavioral analysis of imported models, especially in production environments.
Update Security Tools: Regularly update and test security scanners to detect new evasion techniques and ensure comprehensive coverage against emerging threats.

Last updated on July 5, 2025

United States	~59% of ransomware attacks globally Thousands per year
Poland	1,000+ per week
Russia	Highest cybercrime threat level
China	Thousands per year
India	115% surge in attacks Q2 2024
Ukraine	Significant surge since 2022
Brazil	Among top countries for blocked attacks
Mexico	65% of businesses hit in 2024
Germany	High targeted rate (EU)
France	High targeted rate (EU)

AS Name	ASN
Bharat Sanchar Nigam Ltd	9829
No.31,Jin-rong Street	4134
CHINA UNICOM China169 Backbone	4837
DigitalOcean, LLC	14061
HUAWEI INTERNATIONAL PTE. LTD.	136907
Amazon.com, Inc.	14618
Alibaba (US) Technology Co., Ltd.	45102
Google LLC	396982
Amazon.com, Inc.	16509
3xK Tech GmbH	200373

IP Address	Notable Exploits/Context
104.238.159.149	SharePoint zero-day, broad exploitation
107.191.58.76	SharePoint zero-day, government targets
96.9.125.147	SharePoint, previously Ivanti exploits
139.162.47.194	Exploits on CitrixBleed 2
38.180.148.215	CitrixBleed 2 campaigns
185.224.128.17	High activity, Netherlands
89.248.163.200	High activity, Netherlands
15.235.218.150	Associated with APT, active C2
45.9.148.114	Associated with C2, malicious netflow
91.107.150.184	C2 infrastructure, recent IoC

Understanding Pickle File Vulnerabilities

Key Abuse Techniques

Real-World Incidents

Why the AI Supply Chain Is at Risk

Best Practices for Mitigation

Share this:

Related