The MCP package looked clean. The installed tree did not.

By Codcompass Team·2026-05-16·82 min read

Securing the MCP Tool Surface: Why Transitive Dependency Scanning is Non-Negotiable

Current Situation Analysis

The Model Context Protocol (MCP) has rapidly evolved into the standard interface for connecting AI agents to external tools and data sources. As organizations integrate MCP servers into their agent workflows, a critical security assumption is being tested: that the safety of an MCP server is defined by its top-level package.

This assumption is flawed. MCP servers are installable tool surfaces. When an operator or an agent runtime installs an MCP server, the package manager resolves a complete dependency tree. This tree includes transitive dependencies that the server author may not directly control, yet these dependencies execute within the same security context as the server itself.

A recent audit of 31 MCP server packages across npm and PyPI revealed a massive discrepancy between surface-level checks and actual runtime risk. The audit performed two distinct evaluations for each package:

Direct Package Check: Scanning only the top-level package metadata and declared dependencies.
Installed Tree Scan: Resolving the package, installing it in isolation, and scanning the full dependency graph.

The results were stark. Direct package checks identified only 1 finding across the entire population. In contrast, installed tree scans uncovered 69 findings. This indicates that 35% of the audited servers (11 out of 31) introduced vulnerabilities through their transitive dependencies, even when the top-level package appeared clean.

This gap is often overlooked because many security tools default to shallow scanning. Operators assume that if a package has no known vulnerabilities attached to its identifier, it is safe to deploy. However, the audit data proves that the risk profile of an MCP server is determined by its runtime dependency graph, not its registry entry. Furthermore, the audit covered 21 npm packages and 10 PyPI packages, highlighting that MCP is a polyglot ecosystem; security programs limited to a single language runtime are inherently incomplete.

WOW Moment: Key Findings

The delta between shallow and deep scanning is not marginal noise; it represents a fundamental blind spot in current MCP adoption practices. The severity distribution of the findings further underscores the urgency. The installed tree scan detected vulnerabilities ranging from critical to medium, with a significant concentration in high-severity categories.

Scan Strategy	Findings Detected	Unique Vulnerabilities	Critical/High Severity	Ecosystem Coverage
Top-Level Package Check	1	1	0	Single Ecosystem
Installed Dependency Tree	69	54	36 (2 Critical, 34 High)	npm + PyPI

Why this matters:

Risk Multiplier: The tree scan detected 68x more findings than the package check. Relying on shallow scans leaves the vast majority of supply chain risk undetected.
Severity Concentration: Of the 54 unique vulnerabilities found in trees, 36 were rated Critical or High. This is not low-severity clutter; these are actionable risks that could compromise the host environment or the data accessed by the MCP server.
Polyglot Reality: With nearly one-third of the targets being PyPI packages, any MCP security posture that ignores Python dependencies is leaving a significant portion of the attack surface unmonitored.

Core Solution

To mitigate transitive dependency risks in MCP deployments, organizations must shift from package-level validation to runtime tree analysis. This requires a workflow that resolves dependencies in isolation, captures the exact installed versions, and scans the full graph before deployment.

Architecture Decisions

Isolation: MCP servers must be installed in a sandboxed environment during the audit phase. Installing directly into a host environment risks contamination and makes it difficult to attribute findings to the specific server under test.
**

Lockfile Capture:** The audit process should generate and preserve lockfiles. This ensures reproducibility and allows for precise version pinning in production. 3. Polyglot Support: The scanning infrastructure must handle both npm and PyPI resolution mechanisms, as MCP servers are distributed across both ecosystems. 4. Metadata Enrichment: Beyond vulnerability scanning, the process should check registry metadata for deprecation status and yanked releases, which indicate maintenance abandonment or known issues.

Implementation Example

The following TypeScript example demonstrates a robust audit workflow. This implementation defines a McpServerRuntimeAuditor that orchestrates isolation, resolution, scanning, and metadata enrichment.

import { exec } from 'child_process';
import { promisify } from 'util';
import * as fs from 'fs/promises';
import * as path from 'path';

const execAsync = promisify(exec);

// Domain interfaces for the audit workflow
interface McpServerSpec {
  registry: 'npm' | 'pypi';
  identifier: string;
  version: string;
}

interface Vulnerability {
  id: string;
  severity: 'critical' | 'high' | 'medium' | 'low' | 'unknown';
  package: string;
  version: string;
  path: string;
}

interface AuditReport {
  spec: McpServerSpec;
  findings: Vulnerability[];
  treeDepth: number;
  metadata: {
    deprecated: boolean;
    yanked: boolean;
    lastPublished: string;
  };
  riskScore: number;
}

class McpServerRuntimeAuditor {
  private sandboxDir: string;

  constructor() {
    this.sandboxDir = path.join(process.cwd(), '.mcp-audit-sandbox');
  }

  async audit(spec: McpServerSpec): Promise<AuditReport> {
    // Step 1: Create isolated environment
    await this.prepareSandbox();

    try {
      // Step 2: Resolve and install in isolation
      await this.installInSandbox(spec);

      // Step 3: Capture dependency tree
      const tree = await this.captureDependencyTree(spec.registry);

      // Step 4: Scan tree for vulnerabilities
      const findings = await this.scanTree(tree);

      // Step 5: Enrich with registry metadata
      const metadata = await this.fetchRegistryMetadata(spec);

      // Step 6: Calculate risk score
      const riskScore = this.calculateRiskScore(findings, metadata);

      return {
        spec,
        findings,
        treeDepth: tree.depth,
        metadata,
        riskScore,
      };
    } finally {
      // Cleanup sandbox to prevent resource leaks
      await this.cleanupSandbox();
    }
  }

  private async prepareSandbox(): Promise<void> {
    await fs.rm(this.sandboxDir, { recursive: true, force: true });
    await fs.mkdir(this.sandboxDir, { recursive: true });
  }

  private async installInSandbox(spec: McpServerSpec): Promise<void> {
    const installCmd = spec.registry === 'npm'
      ? `npm install ${spec.identifier}@${spec.version} --no-save --ignore-scripts`
      : `pip install ${spec.identifier}==${spec.version} --target ${this.sandboxDir}/libs`;

    // Execute in sandbox directory
    await execAsync(installCmd, { cwd: this.sandboxDir });
  }

  private async captureDependencyTree(registry: 'npm' | 'pypi'): Promise<any> {
    // Implementation depends on SCA tool integration
    // For npm: npm ls --json
    // For PyPI: pip freeze or pipdeptree
    const cmd = registry === 'npm'
      ? 'npm ls --json --all'
      : 'pipdeptree --json';

    const { stdout } = await execAsync(cmd, { cwd: this.sandboxDir });
    return JSON.parse(stdout);
  }

  private async scanTree(tree: any): Promise<Vulnerability[]> {
    // Integrate with SCA engine (e.g., Trivy, Grype, npm audit, pip-audit)
    // This function parses the tree and queries vulnerability databases
    // Returns structured vulnerability objects
    return []; // Placeholder for actual scan results
  }

  private async fetchRegistryMetadata(spec: McpServerSpec): Promise<any> {
    // Query registry API for deprecation, yanked status, and timestamps
    // npm: npm view <pkg>
    // PyPI: pip index versions <pkg> or PyPI JSON API
    return { deprecated: false, yanked: false, lastPublished: new Date().toISOString() };
  }

  private calculateRiskScore(findings: Vulnerability[], metadata: any): number {
    let score = 0;
    const severityWeights = { critical: 10, high: 5, medium: 2, low: 1, unknown: 0 };

    for (const finding of findings) {
      score += severityWeights[finding.severity] || 0;
    }

    if (metadata.deprecated) score += 20;
    if (metadata.yanked) score += 30;

    return score;
  }

  private async cleanupSandbox(): Promise<void> {
    await fs.rm(this.sandboxDir, { recursive: true, force: true });
  }
}

// Usage Example
async function runAudit() {
  const auditor = new McpServerRuntimeAuditor();
  const serverSpec: McpServerSpec = {
    registry: 'npm',
    identifier: '@example/mcp-server',
    version: '1.2.0',
  };

  const report = await auditor.audit(serverSpec);
  
  if (report.riskScore > 50) {
    console.error(`High risk detected for ${serverSpec.identifier}. Score: ${report.riskScore}`);
    process.exit(1);
  }
  
  console.log('Audit passed. Report:', JSON.stringify(report, null, 2));
}

Rationale for Choices:

--ignore-scripts / --target: Prevents execution of arbitrary install scripts during the audit, reducing the risk of the audit process itself being compromised.
Sandbox Cleanup: Ensures no residual artifacts remain, which is critical for CI/CD pipelines running multiple audits.
Risk Scoring: Aggregates findings and metadata into a single score, enabling automated gating in deployment pipelines.

Pitfall Guide

The "Clean Package" Fallacy
- Explanation: Assuming a package is safe because npm audit <pkg> or pip audit <pkg> returns no results. This only checks the top-level package, ignoring transitive dependencies where 98% of vulnerabilities in this audit were hidden.
- Fix: Always scan the resolved dependency tree. Use tools that analyze the lockfile or installed node_modules/site-packages directory.
Polyglot Blindness
- Explanation: Implementing security checks only for npm while ignoring PyPI. MCP servers are distributed across both ecosystems. An audit program that only scans JavaScript packages leaves Python-based MCP servers unverified.
- Fix: Ensure your scanning infrastructure supports both npm and PyPI resolution and vulnerability databases.
Floating Version Dependencies
- Explanation: Configuring MCP servers with floating versions (e.g., ^1.0.0 or latest). This allows the runtime to pull in new versions of dependencies that may introduce vulnerabilities without explicit approval.
- Fix: Pin exact versions in MCP configurations. Use lockfiles to enforce deterministic builds.
Ignoring Deprecation and Yanked Status
- Explanation: Focusing solely on CVEs and missing packages that are deprecated or yanked. A deprecated package may not have active maintenance, meaning new vulnerabilities will not be patched. Yanked releases indicate known issues.
- Fix: Enrich audit results with registry metadata. Flag any server using deprecated or yanked packages as high risk.
Context Blindness in Severity Assessment
- Explanation: Treating all vulnerabilities as equally exploitable. Some findings in the dependency tree may be in code paths not executed by the MCP server, or may require conditions not present in the agent runtime.
- Fix: Perform triage to assess exploitability. However, do not dismiss findings without analysis; the audit showed that many high-severity issues were present, and context analysis should be a secondary step, not a replacement for scanning.
Static Analysis Only
- Explanation: Relying on source code scanning without checking dependencies. Source scanning cannot detect vulnerabilities in third-party libraries that are pulled in at install time.
- Fix: Combine static analysis with Software Composition Analysis (SCA) of the runtime dependency tree.
Host Environment Contamination
- Explanation: Installing MCP servers directly into the host environment for testing. This can mask dependency conflicts and make it difficult to isolate which vulnerabilities belong to which server.
- Fix: Use containerized or ephemeral environments for installation and scanning. Never audit in a shared host context.

Production Bundle

Action Checklist

Implement Transitive Scanning: Configure your CI/CD pipeline to scan the full dependency tree of all MCP servers before deployment.
Enforce Version Pinning: Update MCP configurations to use exact version pins. Reject floating version ranges.
Enable Polyglot Support: Verify that your security tools cover both npm and PyPI ecosystems.
Isolate Execution: Run MCP servers in isolated containers or sandboxes to limit blast radius.
Monitor Metadata: Set up alerts for deprecation and yanked releases of MCP servers in use.
Review High/Critical Findings: Establish a triage process for vulnerabilities detected in dependency trees.
Audit Lockfiles: Ensure lockfiles are committed and scanned, not just package.json or requirements.txt.
Test in Staging: Validate MCP server behavior and security posture in a staging environment that mirrors production.

Decision Matrix

Scenario	Recommended Approach	Why	Cost Impact
Pre-Production MCP Evaluation	Full Tree Scan + Isolation	Comprehensive risk assessment before integration.	Low (CI time increase)
Runtime MCP Server Deployment	Version Pinning + Sandboxed Execution	Mitigates risk of drift and limits exploitability.	Medium (Infra complexity)
Legacy MCP Integration	Shallow Scan + Upgrade Plan	Triage existing risk while planning remediation.	Low (Immediate) / High (Remediation)
High-Risk Data Access	Full Tree Scan + Manual Triage	Ensures no vulnerabilities exist in critical paths.	High (Engineering effort)
Internal MCP Tools	Automated Tree Scan + Gating	Prevents introduction of vulnerable dependencies.	Low (Automation setup)

Configuration Template

The following GitHub Actions workflow snippet demonstrates how to integrate MCP server auditing into a pipeline. This example uses a hypothetical mcp-audit action that performs tree scanning.

name: MCP Server Security Audit

on:
  pull_request:
    paths:
      - 'mcp-servers/**'
  schedule:
    - cron: '0 0 * * 1' # Weekly scan

jobs:
  audit-mcp-servers:
    runs-on: ubuntu-latest
    strategy:
      matrix:
        # Define MCP servers to audit
        server:
          - { registry: 'npm', id: '@org/mcp-data-tool', version: '2.1.0' }
          - { registry: 'pypi', id: 'mcp-file-processor', version: '1.4.2' }

    steps:
      - name: Checkout Repository
        uses: actions/checkout@v4

      - name: Run MCP Runtime Audit
        uses: codcompass/mcp-audit-action@v1
        with:
          registry: ${{ matrix.server.registry }}
          identifier: ${{ matrix.server.id }}
          version: ${{ matrix.server.version }}
          fail-on-severity: 'high'
          output-format: 'sarif'

      - name: Upload Audit Results
        if: always()
        uses: github/codeql-action/upload-sarif@v3
        with:
          sarif_file: audit-results.sarif

Quick Start Guide

Inventory MCP Servers: List all MCP servers currently in use or planned for integration, including registry and version.
Run Initial Audit: Execute a tree scan against each server using the McpServerRuntimeAuditor or equivalent tool. Review the findings report.
Pin Versions: Update your MCP configurations to pin exact versions based on the audit results. Remove floating ranges.
Integrate CI: Add the audit step to your CI pipeline to prevent regression. Configure gating on high-severity findings.
Monitor: Set up periodic scans to detect new vulnerabilities in existing servers and monitor registry metadata for deprecations.

🎉 Mid-Year Sale — Unlock Full Article

Base plan from just $4.99/mo or $49/yr

7-day free trial · Cancel anytime · 30-day money-back