Dependency Confusion: Supply Chain Attack in Web3

definition

SUPPLY CHAIN ATTACK

What is Dependency Confusion?

A software supply chain attack where an attacker publishes a malicious package to a public repository with the same name as a private, internal dependency used by an organization.

Dependency confusion, also known as a namespace confusion attack, exploits the default behavior of package managers like npm, PyPI, or NuGet. When a build system (e.g., for Node.js, Python, or .NET) attempts to resolve a dependency, it typically checks public repositories before internal, private registries. If an attacker publishes a malicious package with a higher version number than the internal one, the package manager may automatically download and execute the public, hostile code, compromising the build pipeline and potentially the entire organization.

The attack relies on the precedence of package sources. In a standard configuration, tools like pip or npm query the public PyPI or npm registry first. An organization using a private package named @acme/internal-tool version 1.0.0 is vulnerable if an attacker uploads a package with the same name and version 99.0.0 to the public registry. The build system, seeking the "latest" version, will fetch the malicious 99.0.0 instead of the trusted internal 1.0.0. This attack vector is particularly effective against development and continuous integration/continuous deployment (CI/CD) environments.

Mitigation strategies involve configuring package managers to prioritize internal sources. This can be done by explicitly defining the private registry URL first in configuration files (e.g., .npmrc, pip.conf) or using scoped packages (like @company/package) which are less likely to have public conflicts. Additional defenses include using vendoring (checking dependencies into source control), employing tools that detect dependency mismatches, and implementing strict access controls and integrity verification for all external packages.

how-it-works

SECURITY VULNERABILITY

How Dependency Confusion Works

An attack vector that exploits the default behavior of package managers to inject malicious code into software builds.

Dependency confusion is a software supply chain attack where an attacker publishes a malicious package to a public repository with a higher version number than a legitimate internal package of the same name. This attack exploits the default resolution behavior of package managers like npm, PyPI, and NuGet, which typically prioritize the public repository over a private one when fetching dependencies. The core vulnerability lies in a misconfigured build environment that queries external package indexes before or instead of internal, private registries.

The attack unfolds in a predictable sequence. First, the attacker scouts a target organization for internal package names, often found in public source code or build logs. Next, they publish a malicious package with that exact name to a public registry, setting its version number higher than any existing internal version. When the organization's build system or a developer's machine runs an install command, the package manager pulls the higher-versioned, malicious package from the public source, believing it to be the intended update. This injects the attacker's code into the development or production pipeline.

This exploit is possible due to common configuration oversights. Many development teams use internal packages with names that are not namespaced or scoped uniquely. Furthermore, build tools are often configured with the public registry as the default or primary source. Without explicit configuration to prioritize the private feed or use mandatory scopes, the system is vulnerable. The malicious payload can range from data exfiltration and credential theft to establishing a persistent backdoor within the organization's applications.

A famous real-world example is the coordinated attack discovered by security researcher Alex Birsan in 2021, who successfully executed this technique against dozens of major tech companies. By identifying internal package names from various sources and publishing benign but detectable packages to public registries, he demonstrated the widespread prevalence of vulnerable configurations. This research highlighted that even organizations with robust security postures often overlook this specific build-chain dependency.

key-features

DEPENDENCY CONFUSION

Key Characteristics of the Attack

Dependency confusion is a supply chain attack that exploits the priority order of package managers to inject malicious code into a build process. Its effectiveness relies on specific technical and procedural weaknesses.

01

Exploits Package Manager Priority

The attack capitalizes on how package managers resolve dependencies. When a project references a private/internal package (e.g., @mycompany/tool), the attacker publishes a malicious package with the same name on a public registry (e.g., npm, PyPI). If the build system is configured to check public sources first, it will download and execute the attacker's code instead of the intended private package.

02

Relies on Weak or Absent Namespacing

This attack is most effective against organizations that do not use scoped packages or registry namespaces (e.g., @org/package). Using unscoped package names (e.g., internal-utils) makes them globally available for squatting on public registries. Proper configuration of scoped registries and .npmrc or pip.conf files to prioritize private sources is a critical defense.

03

Automated Discovery via Dependency Scanning

Attackers often use automated tools to scan for internal package names. They might:

Parse public source code repositories for package.json or requirements.txt files.
Examine leaked build logs or CI/CD configuration files.
Use common naming conventions for private packages (e.g., companyname-*). This automation allows for large-scale, opportunistic attacks.

04

Payload Execution During Build/Install

The malicious payload is executed during the install or build phase of the victim's development pipeline. Common payloads include:

Post-install scripts that run automatically upon package installation.
Malicious code bundled into the module itself, designed to exfiltrate secrets, source code, or environment variables.
Persistence mechanisms that create backdoors within the build environment or deployed application.

05

Targets CI/CD Pipelines and Developers

The primary targets are automated systems, not end-users. Continuous Integration/Continuous Deployment (CI/CD) pipelines are ideal because they run fresh installs in a trusted environment, often with elevated permissions and access to sensitive credentials. Individual developers running npm install or pip install on their machines are also vulnerable if their environment is misconfigured.

06

Preventive Configuration is Key

Mitigation is entirely configuration-based. Essential practices include:

Always using scoped package names for private dependencies.
Configuring package managers to exclusively use the private registry for those scopes.
Employing .npmrc, .yarnrc, or pip.conf files to enforce source priority.
Implementing registry firewalls or upstream proxies that block public requests for internal package names.

ecosystem-usage-context

SUPPLY CHAIN ATTACKS

Vulnerable Ecosystems & Tools

Dependency confusion is a software supply chain attack where an attacker publishes a malicious package to a public registry with a name identical to a private/internal dependency used by a target organization.

01

Attack Vector & Mechanism

The attack exploits how package managers (npm, PyPI, NuGet) resolve dependencies. When a project's configuration file (like package.json or requirements.txt) references a private package, the manager checks public registries first. An attacker publishes a higher-versioned, malicious package with the same name. The build system inadvertently downloads and executes the attacker's code, leading to remote code execution (RCE) or data exfiltration.

02

Prerequisites & Vulnerable Configs

This attack succeeds under specific, common conditions:

Mixed registry usage: Projects that pull some dependencies from public (npmjs.com) and some from private registries.
Missing scope or namespace: Private packages not published under an organization scope (e.g., @company/package).
Overly permissive resolution: Package managers configured without strict registry priority or allow-list rules.
Internal package names that are not globally unique (e.g., utils, api-client).

03

Real-World Example & Impact

In 2021, security researcher Alex Birsan demonstrated this attack against major tech firms (Apple, Microsoft, Tesla, Uber). By uploading benign but detectable packages to public registries, he proved the vulnerability was widespread. The impact is severe:

Full build pipeline compromise: Attackers gain access to CI/CD secrets and source code.
Downstream propagation: The malicious code is bundled into production applications.
Pivoting to internal networks: The payload can establish a foothold behind corporate firewalls.

04

Mitigation Strategies

Defense requires locking down the dependency resolution process:

Use scoped packages: Always publish private packages under an organization scope (e.g., @acme/core).
Configure registry priority: Force package managers to check the private registry first (e.g., using .npmrc with @acme:registry).
Employ allow-listing: Use tools like Sonatype Nexus or GitHub Packages to proxy public registries and block unwanted packages.
Implement dependency pinning: Use lockfiles (package-lock.json, pipenv) and hash verification.
Run internal registry mirrors for all dependencies.

05

Related Concepts

Typosquatting: Publishing malicious packages with names similar to popular ones (e.g., lodash vs lodashh).
Brandjacking: Uploading packages that appear to be official SDKs or tools for a company.
Software Bill of Materials (SBOM): A formal inventory of components used to track dependencies and identify vulnerabilities.
CI/CD Security: Securing the continuous integration and deployment pipelines that are the primary target of these attacks.

06

Tools for Detection & Prevention

Several tools help identify and prevent dependency confusion:

OSS Index / Snyk: Scans project dependencies for known vulnerabilities and malicious packages.
GitHub Dependabot / CodeQL: Monitors dependency manifests and alerts on suspicious changes or publicly known malicious packages.
Renovate: Automated dependency updates that can be configured with strict registry rules.
JFrog Xray: Scans artifacts and dependencies for security violations across development pipelines.
Manual auditing tools: Like npm audit or pip-audit for post-installation checks.

blockchain-context

SECURITY VULNERABILITY

Dependency Confusion in Blockchain Development

An attack vector where a malicious actor publishes a package with the same name as an internal, private dependency to a public registry, tricking a build system into downloading and executing the malicious code.

Dependency confusion, also known as a namespace confusion attack, is a software supply chain attack that exploits the default behavior of package managers like npm, pip, or Maven. When a project's build script references both private, internal packages and public repositories, the package manager typically prioritizes the public source if a higher version number is found. An attacker can typosquat or brandjack the name of a company's internal package and publish a malicious version with an inflated version number (e.g., 99.99.99) to a public registry. The next time the project's CI/CD pipeline runs npm install or pip install, it may inadvertently fetch and execute the attacker's code.

In the blockchain context, this vulnerability is particularly severe due to the high-value nature of the assets and operations involved. A malicious dependency could: - Compromise private keys or mnemonic phrases during build or runtime. - Inject backdoors into smart contract bytecode before deployment. - Tamper with deployment scripts to redirect funds or alter contract addresses. - Exfiltrate sensitive environment variables or configuration files containing RPC endpoints and API keys. The automated, trust-minimized nature of blockchain deployment pipelines makes them prime targets, as a successful attack can lead to the irreversible loss of funds or control over a protocol.

Mitigating dependency confusion requires a defense-in-depth strategy. Developers must configure package managers to explicitly prioritize private registries over public ones. For npm, this involves using scoped packages (@myorg/package-name) and setting the registry in a .npmrc file. For Python, using a --index-url flag pointing to a private PyPI server is crucial. Additionally, organizations should employ dependency auditing tools to scan for and alert on any attempts to pull packages from unexpected sources. Implementing strict version pinning and using lockfiles (package-lock.json, Pipfile.lock) can prevent automatic upgrades to malicious new versions, adding another layer of protection.

security-considerations

DEPENDENCY CONFUSION

Security Risks & Impact

Dependency confusion is a software supply chain attack where an attacker publishes a malicious package to a public registry (like npm, PyPI) with a name identical to a private, internal dependency used by an organization. Build systems, configured to prioritize public sources, inadvertently fetch and execute the attacker's code.

01

Attack Vector & Mechanism

The attack exploits the package manager's resolution order. Most tools (npm, pip, Maven) check public repositories before private ones. An attacker registers a package with the same name as a company's internal library. When a developer or CI/CD pipeline runs npm install or pip install, the system pulls the higher-versioned malicious package from the public registry instead of the trusted private one, introducing backdoors, data exfiltrators, or ransomware into the build.

02

Real-World Impact & Examples

This is not a theoretical risk. In 2021, security researcher Alex Birsan demonstrated the attack against Apple, Microsoft, Tesla, and Uber, successfully executing code on their internal systems by publishing benign test packages. The impact ranges from:

Credential Theft: Stealing AWS keys or internal API tokens.
Persistence: Establishing a foothold in development and production environments.
Reputation Damage: Compromised software shipped to customers. These proofs-of-concept led to widespread security reassessments.

03

Primary Risk Factors

Organizations are vulnerable if they:

Use internal private packages with generic names (e.g., @company/ui-components).
Lack scope configuration in package managers (e.g., not using npm scopes @company/ correctly).
Have insecure CI/CD pipelines that pull dependencies without explicit source locking.
Rely on public registry fallbacks without authentication requirements for private dependencies. The risk is highest in organizations with large, complex codebases and automated build processes.

04

Mitigation Strategies

Effective defenses involve configuring package managers to prioritize internal sources:

Use Scoped Packages: Adopt namespaced names (e.g., @yourcompany/pkg-name) which are globally unique.
Configure Registry Precedence: Always set the private registry as the primary source in .npmrc, pip.conf, or pom.xml.
Employ Authentication: Require credentials for private registry access; public registries should not have credentials.
Implement Lockfiles: Use package-lock.json, pipenv, or yarn.lock to pin exact dependency versions and sources.
Run Dependency Audits: Regularly scan for packages with names conflicting with internal ones.

05

Related Concepts

Dependency confusion is a subset of broader software supply chain threats:

Typosquatting: Malicious packages with names similar to popular ones (e.g., lodash vs lodash).
Brandjacking: Publishing malicious packages under a reputable organization's name.
Chain of Trust: The integrity model for verifying software from source to deployment, often enforced via code signing and SBOMs (Software Bill of Materials).
CI/CD Compromise: Attacks targeting the build pipeline itself, which dependency confusion can facilitate.

mitigation-strategies

DEPENDENCY CONFUSION

Prevention & Mitigation Strategies

Dependency confusion is a software supply chain attack where an attacker publishes a malicious package with a higher version number than a legitimate internal package to a public registry, tricking a build system into downloading and executing the attacker's code. These strategies focus on securing the package installation process.

01

Configure Private Registry Priority

The primary technical defense is to configure your package manager (npm, pip, Maven, etc.) to always prioritize your private registry over public ones like npmjs.org or PyPI. This is typically done via configuration files (e.g., .npmrc, pip.conf) or command-line flags that set the private registry as the upstream source, preventing the tool from inadvertently querying public repositories for internal package names.

02

Use Scoped Packages

Adopt scoped package names (e.g., @yourcompany/package-name) for all internal dependencies. Public registries enforce that scopes are unique to an organization or user, making it impossible for an attacker to publish a malicious package with the same scoped identifier. This provides a namespace-based guarantee against confusion attacks for those packages.

03

Implement Package Locking & Pinning

Use lockfiles (like package-lock.json, pipfile.lock, Cargo.lock) to pin the exact version and source of every dependency. Combine this with a policy of dependency pinning or using a hash-based integrity check (e.g., npm's integrity field) to ensure the build system installs only the cryptographically verified, intended artifact, regardless of registry configuration.

04

Deploy a Repository Firewall / Proxy

Deploy an intermediary proxy or repository firewall (e.g., Sonatype Nexus Firewall, JFrog Xray, Azure Artifacts) between your build systems and the public internet. These tools can:

Block downloads of public packages with names that conflict with internal ones.
Scan all downloaded packages for known vulnerabilities and malicious code.
Cache approved packages to improve security and build speed.

05

Conduct Regular Audits & Monitoring

Proactively monitor public registries for packages that squat on your internal package names. Use automated tools to:

Scan public registries for names matching your internal dependencies.
Audit your dependency trees (npm audit, snyk test, OWASP Dependency-Check) for known malicious packages.
Monitor build logs for unexpected downloads from public sources.

06

Enforce CI/CD Pipeline Security

Harden your continuous integration and deployment pipelines by:

Running builds in isolated, ephemeral environments with clean caches.
Using immutable, pre-approved base images for build agents.
Implementing mandatory code signing and attestation for internal packages.
Requiring multi-factor authentication for publishing to internal registries.

SUPPLY CHAIN ATTACKS

Dependency Confusion vs. Typosquatting

A comparison of two distinct software supply chain attack vectors that exploit package managers.

Feature	Dependency Confusion	Typosquatting
Core Attack Vector	Namespace/registry poisoning	Package name impersonation
Primary Target	Internal/private dependencies	Public, popular packages
Mechanism	Uploads a malicious public package with a higher version number than a private/internal package of the same name.	Registers a public package with a name similar to a legitimate one (e.g., 'requets' vs. 'requests').
Prerequisite	Relies on misconfigured package managers that query public registries before private ones.	Relies on developer typos or autocomplete errors during installation.
Victim Action	Runs a standard install/update command in a vulnerable environment.	Mistakenly types or selects the wrong package name.
Primary Defense	Correctly configuring internal package sources and using scoped packages or namespace protections.	Vigilance during installation, using lockfiles, and automated tooling to detect suspicious packages.
Example	A company uses an internal package '@acme/utils'. An attacker publishes 'utils' to npmjs.com with version 99.0.0.	An attacker publishes 'python-dateutl' to PyPI, hoping to catch users intending to install 'python-dateutil'.

real-world-examples

REAL-WORLD ATTACKS

Notable Incidents & Examples

Dependency confusion is a software supply chain attack where an attacker publishes a malicious package with the same name as a private, internal dependency to a public registry, tricking a build system into downloading and executing the attacker's code.

01

Alex Birsan's 2021 Proof-of-Concept

Security researcher Alex Birsan demonstrated the widespread vulnerability by publishing benign test packages to public registries like npm, PyPI, and RubyGems. The attack successfully infiltrated over 35 major tech companies, including Apple, Microsoft, and Tesla, by exploiting misconfigured internal build tools that prioritized public repositories over private ones.

EXPLORE

02

The PyTorch Nightly Build Compromise

In December 2022, the official PyTorch machine learning framework was compromised via a dependency confusion attack. A malicious package named torchtriton was uploaded to PyPI, impersonating an internal dependency. The malicious code exfiltrated system data, including environment variables and files. This incident highlighted the risk to even highly security-conscious open-source projects.

03

Attack Vector: Public Registry Priority

The core exploit relies on a build tool's dependency resolution order. Common misconfigurations include:

Not specifying a private registry in package manager config files (e.g., .npmrc, pip.conf).
Using unscoped package names for internal dependencies.
Public registries (npm, PyPI) being checked before internal artifact repositories by default.

04

Mitigation: Using Scoped Packages

A primary defense is using scoped package names (e.g., @company/private-package). Scopes are namespace identifiers that are unique to an organization or user on a registry. This prevents namespace squatting on public registries, as an attacker cannot publish a package under another organization's reserved scope.

05

Mitigation: Configuring Registry Precedence

Build systems must be explicitly configured to prioritize internal sources. Key actions include:

Setting the internal registry as the primary source in configuration files.
Using .npmrc scopes or PyPI's --index-url to point to private indexes.
Implementing repository firewalls or proxies (like JFrog Artifactory) that block public packages with names matching internal ones.

06

Related Concept: Typosquatting

While dependency confusion targets internal package names, typosquatting is a related attack targeting public packages. Attackers publish malicious packages with names similar to popular public packages (e.g., requets instead of requests), hoping developers will mistype the name during installation. Both are forms of software supply chain attack.

DEPENDENCY CONFUSION

Frequently Asked Questions

Dependency confusion is a software supply chain attack targeting build systems. It exploits the trust developers place in package managers by tricking them into downloading and executing malicious code from public repositories instead of intended internal packages. This section addresses common questions about its mechanisms, prevention, and real-world impact.

Dependency confusion is a software supply chain attack where an attacker publishes a malicious package with a higher version number to a public repository (like npm, PyPI, or RubyGems) using the same name as a private, internal package used by a target organization. The attack exploits the default behavior of package managers (e.g., npm, pip) which, when resolving dependencies, often prioritize the public repository over a private one if the public version has a higher semantic version number. This causes the build system to automatically download and execute the attacker's malicious code instead of the legitimate internal library, potentially leading to data exfiltration, backdoor installation, or further network compromise.

Dependency Confusion

What is Dependency Confusion?

How Dependency Confusion Works

Key Characteristics of the Attack

Exploits Package Manager Priority

Relies on Weak or Absent Namespacing

Automated Discovery via Dependency Scanning

Payload Execution During Build/Install

Targets CI/CD Pipelines and Developers

Preventive Configuration is Key

Vulnerable Ecosystems & Tools

Attack Vector & Mechanism

Prerequisites & Vulnerable Configs

Real-World Example & Impact

Mitigation Strategies

Related Concepts

Tools for Detection & Prevention

Dependency Confusion in Blockchain Development

Security Risks & Impact

Attack Vector & Mechanism

Real-World Impact & Examples

Primary Risk Factors

Mitigation Strategies

Related Concepts

Prevention & Mitigation Strategies

Configure Private Registry Priority

Use Scoped Packages

Implement Package Locking & Pinning

Deploy a Repository Firewall / Proxy

Conduct Regular Audits & Monitoring

Enforce CI/CD Pipeline Security

Dependency Confusion vs. Typosquatting

Notable Incidents & Examples

Alex Birsan's 2021 Proof-of-Concept

The PyTorch Nightly Build Compromise

Attack Vector: Public Registry Priority

Mitigation: Using Scoped Packages

Mitigation: Configuring Registry Precedence

Related Concept: Typosquatting

Frequently Asked Questions

Related Concepts & Attacks

The Attack Vector

Namespace Confusion & Typosquatting

Software Bill of Materials (SBOM)

Mitigation: Configuration & Scoping

Mitigation: Pipeline Hardening

Real-World Example & Impact

Get In Touch today.

Get In Touch
today.