Enterprises that pull community models: the May 2026 Hugging Face typosquat that installed a Windows infostealer

In May 2026, a typosquatted Hugging Face repository—Open-OSS/privacy-filter—distributed a loader that ultimately installed a Rust-based credential stealer on Windows machines; the repo was downloaded over 200,000 times before removal. For teams that pull community models into development or production, this incident reframes the threat: attackers can hide executable installers in model repos, not just poison training data.

Concrete mechanics of the attack chain

The malicious repo copied OpenAI’s Privacy Filter model card but contained a loader.py that disabled SSL verification and polled a public JSON paste service for commands, letting operators rotate payloads without changing the repo files. On Windows the loader executed concealed PowerShell commands to fetch a batch script that modified Microsoft Defender exclusions and created a one-shot SYSTEM scheduled task; that scheduled task ran the payload and immediately deleted itself, leaving little on-disk persistence.

The final stage was a compiled Rust infostealer that harvested Chromium/Firefox cookies and passwords, Discord tokens, FileZilla and wallet files, and AWS/other credentials; it used VM and sandbox detection and attempted to disable AMSI and ETW, then exfiltrated results over authenticated HTTP. The campaign’s multi-stage design — remote command fetch, ephemeral privilege escalation, and wrapped native binary — illustrates how model repos can serve as a long-lived staging point for rotating, high-privilege payloads.

Why this is not just “data poisoning”

ShinyHunters’ Salesforce assault exposed 275M Canvas users — a supply-chain problem, not just an LMS outage

Many defenders treat public models primarily as a data-integrity risk; this incident instead exploited executable code paths. Unsafe Python deserialization (pickle) and executable files in model.py or dependency lists can run arbitrary code at load time, so loading a model can be equivalent to running unreviewed scripts. Security firms ProtectAI, Hiddenlayer and Wiz have flagged more than 3,000 malicious files on Hugging Face, including fake profiles impersonating Meta, Visa, SpaceX and a 23AndMe genomics model that harvested AWS credentials — all demonstrating that the platform’s trust assumptions are routinely abused for code execution attacks.

Practical controls you can apply now

Treat models as code. At minimum, enforce three pipeline checkpoints: pre-download provenance (signed or verified sources), static scanning before deserialization (YARA, Gitleaks, custom OPA rules), and runtime containment (sandboxed loading, least-privilege execution, Linux security modules). Prefer non-executable serialization formats such as safetensors over pickle; where pickle is unavoidable, run deserialization in an isolated, ephemeral environment that cannot escalate to SYSTEM or access secret stores.

Stage / Artifact	Attacker technique	Checkpoint / Immediate action
Repository files (loader.py, model.py)	Remote command fetch, disabled SSL, embedded installers	Block network calls at import time; static scan for subprocess/PowerShell usage
Serialized models (.pkl)	Pickle deserialization executes code	Require safetensors or signed artifacts; isolate deserializers
Runtime behavior	Scheduled tasks, Defender exclusions, one-shot privilege escalation	Monitor for task creation, Defender policy changes, and unusual outbound HTTP

Operational checks, vendor signals, and the next standardization checkpoint

Audit immediate signals: search your dependency trees for .pkl and files importing subprocess/requests/os.system at load time; flag any model that reaches out to remote pastes or raw HTTP endpoints during import. Add runtime alerts for creation of scheduled tasks, changes to Defender policies, or processes spawning PowerShell from Python interpreters. Hugging Face has been verifying corporate profiles since 2022 and integrated ProtectAI scanning, but platform-level warnings are an imperfect substitute for organization-level controls.

The crucial next checkpoint is industry adoption of standardized security frameworks and automated model-supply-chain scanners: signed model manifests, mandatory static scanning in CI, and format whitelists (no pickles by default). Until such frameworks exist and are widely enforced, teams must apply policy gates in their MLOps pipelines and assume community models can contain executable installers as readily as useful weights.

Quick operational Q&A

Q — Should I stop using community models entirely?
A — Not necessarily; require provenance, scan artifacts, and load in an isolated environment before any network access or secret exposure.

Q — Is switching to safetensors enough?
A — It eliminates code execution via deserialization but doesn’t stop malicious loader scripts or dependency files; combine format controls with static scanning.

Q — How fast should I act?
A — Immediate: add CI static scans and block model imports that perform remote fetches at load time. Medium term: require signed manifests and sandboxed test loads before promoting models to shared environments.

Malware Found in Trending Hugging Face Repository “Open-OSS/privacy-filter”

Hackers Have Uploaded Thousands Of Malicious Files To AI’s Biggest Online Repository

Fake OpenAI Repository on Hugging Face Delivered Infostealer Malware to 244,000 Downloads

Tagged AI security, code execution attacks, credential theft, Hugging Face security, MLOps security, model repository threats, pickle deserialization risks, Rust infostealer, sandboxing models, static scanning

Future Byte Daily