Question 1

What is the ML supply chain and what components does it include?

Accepted Answer

The ML supply chain is the full set of components that together produce and run a machine learning system. It includes eight major components: (1) Training data sources and collection scripts. (2) Data preprocessing and transformation code. (3) Training framework dependencies (PyTorch, TensorFlow, JAX and all transitive packages). (4) Model weights, checkpoints, and the training scripts that produce them. (5) Model distribution channels such as Hugging Face Hub, private model registries, and artifact stores. (6) Serving framework dependencies including inference servers and containerisation. (7) Application dependencies for any product built on top of the model. (8) Agent memory and persistent context files for agentic deployments. Each component is a distinct attack surface. Compromising any one can affect the model's behaviour, the security of the serving environment, or the integrity of the agent's operating instructions.

Question 2

What is dependency confusion and how does it work?

Accepted Answer

Dependency confusion is a supply chain attack technique discovered and published by Alex Birsan in 2021. It exploits the resolution order that package managers use when multiple registries are configured. When an organisation has an internal package registry for private packages and also uses a public registry like PyPI or npm, some package manager configurations check the public registry for the highest available version rather than prioritising the internal registry. An attacker who learns the name of a private internal package can register a package with the same name on the public registry at a higher version number. The package manager then installs the attacker's public package instead of the legitimate internal one, because it appears to be a newer version. The attacker's package can contain any code including data exfiltration, backdoors, or system modification.

Question 3

What happened in the PyTorch December 2022 supply chain attack?

Accepted Answer

In December 2022, attackers published a malicious package named torchtriton on the public PyPI registry. torchtriton was a legitimate dependency of PyTorch's nightly build. PyTorch's nightly builds were configured to resolve dependencies in a way that checked PyPI before the official PyTorch package index. The malicious torchtriton package on PyPI was at a higher version than the legitimate one on the PyTorch index. Users who installed the PyTorch nightly build between December 25 and 30 2022 received the malicious package silently alongside their legitimate PyTorch installation. The package exfiltrated system hostname, username, current working directory, SSH keys from the .ssh directory, Git configuration, environment variables, and the contents of /etc/passwd. The PyTorch team discovered the attack and notified users on December 31 2022.

Question 4

How do pickle-based model files create a code execution risk?

Accepted Answer

Python's pickle serialisation format allows objects to define custom deserialization behaviour through the __reduce__ method. When a pickled object is loaded, Python calls __reduce__ to reconstruct the object. __reduce__ can return arbitrary code to execute. A malicious model file can include a pickled object whose __reduce__ method runs system commands: exfiltrating data, installing backdoors, or modifying files. PyTorch's .pt and .pth model files use pickle internally. Calling torch.load() on an untrusted model file without the weights_only=True parameter executes any code embedded in the file. Trail of Bits documented in 2022 that model files on Hugging Face were found to contain such payloads.

Question 5

What is safetensors and how does it prevent pickle exploits?

Accepted Answer

Safetensors is a model weight serialisation format developed by Hugging Face in 2022, designed explicitly to prevent the code execution risk of pickle. A safetensors file stores model weights as flat binary tensor data with a JSON metadata header describing tensor shapes and data types. There is no code execution path during loading: the loader reads the binary tensor data directly into memory without calling any user-defined methods. No __reduce__, no arbitrary Python, no code execution possible from the file contents. The safetensors library also validates the header before reading tensors, preventing certain header-manipulation attacks. Safetensors files use the .safetensors extension and can be loaded with the safetensors library or through transformers via trust_remote_code=False with safetensors preference.

Question 6

What is the Cisco 2025 agent memory attack and why is it different from prompt injection?

Accepted Answer

Cisco researchers Amy Chang and Idan Habler demonstrated in 2025 that a rogue npm or pip dependency can modify the memory file that Claude Code uses to store persistent agent instructions. When the agent starts up, it reads this memory file and follows the instructions it finds. If a malicious dependency has modified the file, the agent follows the attacker's instructions silently and indefinitely, with no error signal to the operator. This is different from prompt injection in several important ways: it does not require any user interaction after the initial malicious package install; it persists across agent restarts because the modification is in a persistent file; and it targets the agent's control plane (its operating identity and instructions) rather than individual runtime requests. The affected file is not runtime input but the equivalent of a startup configuration that defines how the agent behaves for all subsequent operations.

Question 7

How does VectaX encrypted memory fix the Cisco-style attack?

Accepted Answer

VectaX from Mirror Security stores agent memory in encrypted form using Fully Homomorphic Encryption (FHE). The memory file is ciphertext rather than plaintext. A malicious process that writes to the memory file cannot write valid encrypted instructions without the encryption key: it can only write arbitrary bytes that the agent cannot decrypt and interpret as instructions. When the agent attempts to read its memory, cryptographic verification fails on any tampered content before that content is ever used. The attacker can modify the file's bytes but cannot modify what the agent reads. This is a structural fix that removes the plaintext memory file from the threat surface entirely, compared to integrity detection (cryptographic hashing) which detects that a change occurred but does not prevent the attack from affecting the agent before detection and remediation happen.

Question 8

What is an ML SBOM and what should it contain?

Accepted Answer

An ML SBOM (Software Bill of Materials for machine learning) is a machine-readable inventory of every component that went into producing and running a model. A complete ML SBOM should include: training data sources with version identifiers and hashes, base model and checkpoint identifiers with SHA-256 hashes of weight files, all Python dependency versions and hashes (pip freeze output with hashes), preprocessing and evaluation code at a specific commit, serving framework and containerisation versions, and for agentic deployments the agent memory configuration. Standard SBOM formats are SPDX (ISO/IEC 5962:2021) and CycloneDX. The US Executive Order 14028 from 2021 requires SBOMs for software supplied to the federal government and has driven broader adoption. An SBOM makes it possible to trace any component back to its source and to detect when a component has been substituted.

Question 9

What are the SLSA framework levels for ML supply chains?

Accepted Answer

SLSA (Supply-chain Levels for Software Artifacts), developed by Google and now maintained by the OpenSSF, defines four levels of supply chain security. Level 1: the build (or training) process is scripted and documented, producing provenance information. Level 2: the build uses a version-controlled, hosted build service that generates authenticated provenance. Level 3: the build service is hardened against modification by the developer, and provenance is non-forgeable. Level 4: all changes require two-person review and builds are hermetic (isolated from external dependencies during execution). For ML training pipelines, Level 1 means the training script and configuration are version-controlled and reproducible. Level 2 adds authenticated provenance records that link model weights to the exact training run that produced them. Higher levels require hardened training infrastructure.

Question 10

How does a private package mirror prevent dependency confusion?

Accepted Answer

A private package mirror (such as JFrog Artifactory, Sonatype Nexus, or AWS CodeArtifact) serves as the sole package source for an organisation's build environments. Instead of pip querying PyPI directly, it queries the private mirror. The mirror is configured to proxy specific public packages from PyPI (with verification of hashes) and to serve internal packages. Crucially, the mirror can be configured to refuse any public package that has the same name as an internal package, regardless of version. This prevents the dependency confusion resolution vulnerability because the attacker's higher-versioned public package never reaches the pip resolver. Combined with pip install --index-url pointing only at the private mirror and --no-deps or hash pinning for critical dependencies, this provides a strong structural defence.

Question 11

What data does the PyTorch 2022 torchtriton attack exfiltrate?

Accepted Answer

The malicious torchtriton package published on PyPI in December 2022 exfiltrated the following from affected systems: the system hostname, the current username, the current working directory, the contents of SSH private and public keys from the user's .ssh directory, the Git global configuration file from ~/.gitconfig which may contain credentials, all environment variables (which may contain API keys, tokens, and other secrets), and the contents of /etc/passwd which contains user account information. The data was sent to a remote server controlled by the attackers. Any user who installed PyTorch nightly between December 25 and 30 2022 should consider their SSH keys and any API keys stored in environment variables or Git configuration to be compromised.

Question 12

What tools exist for scanning ML model files for malicious content?

Accepted Answer

Several tools address ML model file security. ModelScan is an open-source tool (from Protect AI) that scans model files including PyTorch .pt/.pth, TensorFlow SavedModel, and Keras files for unsafe serialisation patterns and known malicious payloads. It checks for dangerous pickle opcodes and known exploit patterns. The Hugging Face Hub runs automated safety scanning on uploaded models and flags models containing pickle-based dangerous patterns, marking them in the UI. PyTorch's own torch.load function supports weights_only=True mode since PyTorch 1.13 to prevent code execution during loading. For new model development, adopting safetensors format from the start avoids the risk entirely. Combining ModelScan scanning, weights_only loading, and safetensors preference provides layered defence.

ML Supply Chain
Security

The ML supply chain

Dependency confusion and typosquatting

Malicious ML packages beyond typosquatting

Pickle exploits in model weight files

Hugging Face model hub risk

The Cisco 2025 agent memory attack

SBOM for ML systems

Cryptographic integrity controls

Production supply chain security checklist

Encrypted agent memory and supply chain scanning for ML systems

ML Supply ChainSecurity