January 14, 2026

12 min read

Dillon Browne

Cloudflare Workers Sandbox SDK Guide

Deploy secure code execution at the edge with Cloudflare Sandbox SDK. Production patterns for AI agents, interactive environments, and CI/CD pipelines.

cloudflare edge-computing containers devops ai

When I first encountered a client requirement to run untrusted user code safely at scale, my instincts pointed to the usual suspects: Docker containers with restricted privileges, AWS Lambda with tight IAM policies, or maybe spinning up isolated VMs. Then I discovered Cloudflare Workers Sandbox SDK, and it fundamentally changed how I approach secure code execution at the edge.

The problem statement was simple: build a platform where users could write Python scripts to process their data, run tests against custom environments, and preview applications—all without compromising security or requiring infrastructure babysitting. After six months in production handling thousands of executions daily, I’ve learned what works, what doesn’t, and why this approach beats traditional sandboxing.

What Makes Sandbox SDK Different

Cloudflare Sandbox SDK isn’t just another containerization tool. It’s a TypeScript API that lets you execute arbitrary code in VM-isolated environments running on Cloudflare’s edge network. The architecture leverages three core primitives: Workers (your application logic), Durable Objects (stateful coordination), and Containers (isolated execution environments).

Here’s what sets it apart from Docker or serverless functions:

VM-level isolation instead of process isolation: Each sandbox runs in its own microVM with a separate kernel. Unlike Docker’s shared-kernel model, there’s no risk of container escape attacks. Even if malicious code exploits a kernel vulnerability, it can’t break out to the host or other sandboxes.

Edge deployment by default: Your sandboxes run on Cloudflare’s global network (300+ locations), not a single AWS region. A developer in Tokyo and one in São Paulo both get sub-100ms execution times without complex global infrastructure.

Persistent execution state: Sandboxes maintain identity across requests. When you call getSandbox(env.Sandbox, 'user-123'), subsequent calls route to the same sandbox instance. Install a package once, use it in all future executions. No cold-start penalties after the first run.

Built-in preview URLs: Expose HTTP services running in your sandbox with auto-generated public URLs. Perfect for preview environments, testing web apps, or building cloud IDEs.

These aren’t incremental improvements over existing tools—they’re architectural advantages that enable entirely new use cases.

Getting Started: Your First Sandbox

Let me walk through a practical example that demonstrates the core workflow. We’ll build a code execution API that accepts Python code, runs it safely, and returns results.

First, install the SDK and initialize a Workers project:

npm install @cloudflare/sandbox
npx wrangler init sandbox-demo
cd sandbox-demo

Configure wrangler.toml to bind the Sandbox Durable Object:

name = "sandbox-demo"
main = "src/index.ts"
compatibility_date = "2024-01-01"

[[durable_objects.bindings]]
name = "Sandbox"
class_name = "Sandbox"

Now implement the Worker:

import { getSandbox } from '@cloudflare/sandbox';

// Export Sandbox class (required for Durable Objects)
export { Sandbox } from '@cloudflare/sandbox';

interface Env {
  Sandbox: DurableObjectNamespace;
}

export default {
  async fetch(request: Request, env: Env): Promise<Response> {
    if (request.method !== 'POST') {
      return new Response('Method Not Allowed', { status: 405 });
    }

    const { code, userId } = await request.json();

    // Get user-specific sandbox
    const sandbox = getSandbox(env.Sandbox, `user-${userId}`);

    try {
      // Create Python execution context
      const ctx = await sandbox.createCodeContext({ language: 'python' });

      // Execute code with streaming output
      let output = '';
      const result = await sandbox.runCode(code, {
        context: ctx.id,
        stream: true,
        onOutput: (chunk) => {
          output += chunk;
          console.log(`[${userId}] ${chunk}`);
        },
      });

      // Cleanup context
      await sandbox.deleteCodeContext(ctx.id);

      return Response.json({
        success: result.success,
        output: result.output || output,
        error: result.error,
      });
    } catch (error) {
      return Response.json(
        { success: false, error: String(error) },
        { status: 500 }
      );
    }
  }
};

Deploy to Cloudflare:

npx wrangler deploy

Test it:

curl -X POST https://sandbox-demo.your-subdomain.workers.dev \
  -H "Content-Type: application/json" \
  -d '{
    "userId": "alice",
    "code": "print(sum([1, 2, 3, 4, 5]))"
  }'

Response:

{
  "success": true,
  "output": "15\n",
  "error": null
}

This example demonstrates the fundamental pattern: create a sandbox, establish execution context, run code, stream output, cleanup. Everything else builds on these primitives.

Build Interactive Development Environments with Sandbox SDK

After building code execution APIs, my next requirement was more complex: users needed full development environments with file operations, package management, and web preview capabilities. Think VS Code in the browser, but serverless.

Here’s how I implemented it using Sandbox SDK’s filesystem and preview URL features:

import { getSandbox } from '@cloudflare/sandbox';

export { Sandbox } from '@cloudflare/sandbox';

interface Env {
  Sandbox: DurableObjectNamespace;
}

export default {
  async fetch(request: Request, env: Env): Promise<Response> {
    const url = new URL(request.url);
    const userId = url.searchParams.get('userId');

    if (!userId) {
      return new Response('Missing userId', { status: 400 });
    }

    const sandbox = getSandbox(env.Sandbox, `dev-env-${userId}`);

    // Route: Initialize project
    if (url.pathname === '/init') {
      await sandbox.mkdir('/workspace/project', { recursive: true });
      
      // Create package.json
      await sandbox.writeFile(
        '/workspace/project/package.json',
        JSON.stringify({ name: 'my-app', version: '1.0.0' })
      );

      // Create index.js
      await sandbox.writeFile(
        '/workspace/project/index.js',
        `const express = require('express');
const app = express();

app.get('/', (req, res) => {
  res.send('Hello from Sandbox!');
});

app.listen(3000, () => {
  console.log('Server running on port 3000');
});`
      );

      return Response.json({ status: 'initialized' });
    }

    // Route: Install dependencies
    if (url.pathname === '/install') {
      const result = await sandbox.exec('cd /workspace/project && npm install express', {
        stream: true,
        onStdout: (line) => console.log(`[${userId}] ${line}`),
      });

      return Response.json({
        success: result.exitCode === 0,
        output: result.stdout,
        error: result.stderr,
      });
    }

    // Route: Start server with preview URL
    if (url.pathname === '/preview') {
      // Start Express server in background
      await sandbox.startProcess('node /workspace/project/index.js', {
        cwd: '/workspace/project',
      });

      // Expose port 3000 with auto-generated URL
      const previewUrl = await sandbox.exposePort(3000);

      return Response.json({
        previewUrl,
        message: 'Server started, access via preview URL',
      });
    }

    // Route: Read file
    if (url.pathname === '/read') {
      const filePath = url.searchParams.get('path');
      const content = await sandbox.readFile(`/workspace/project/${filePath}`);
      
      return new Response(content, {
        headers: { 'Content-Type': 'text/plain' },
      });
    }

    // Route: Write file
    if (url.pathname === '/write' && request.method === 'POST') {
      const { path, content } = await request.json();
      await sandbox.writeFile(`/workspace/project/${path}`, content);
      
      return Response.json({ status: 'written' });
    }

    return new Response('Not Found', { status: 404 });
  }
};

This pattern powers interactive coding platforms where users edit files, install packages, run servers, and preview results—all without leaving the browser. The preview URL feature is particularly powerful: services running on any port inside the sandbox become instantly accessible via a public Cloudflare URL.

Deploy CI/CD Pipelines Using Cloudflare Sandboxes

Traditional CI/CD systems (Jenkins, CircleCI, GitHub Actions) require managing runners, configuring environments, and handling concurrency. Sandbox SDK lets you build CI/CD as an API.

Here’s a production-ready implementation for running tests:

import { getSandbox } from '@cloudflare/sandbox';

export { Sandbox } from '@cloudflare/sandbox';

interface Env {
  Sandbox: DurableObjectNamespace;
  GITHUB_TOKEN: string;
}

interface TestResult {
  commit: string;
  success: boolean;
  output: string;
  duration: number;
}

export default {
  async fetch(request: Request, env: Env): Promise<Response> {
    if (request.method !== 'POST') {
      return new Response('Method Not Allowed', { status: 405 });
    }

    const { repository, ref } = await request.json();
    
    // Create unique sandbox per test run
    const runId = crypto.randomUUID();
    const sandbox = getSandbox(env.Sandbox, `ci-${runId}`);

    const startTime = Date.now();

    try {
      // Clone repository
      console.log(`Cloning ${repository} at ${ref}`);
      await sandbox.gitCheckout(repository, { ref });

      // Install dependencies
      console.log('Installing dependencies');
      const installResult = await sandbox.exec('npm install', {
        cwd: '/workspace',
        stream: true,
        onStdout: (line) => console.log(`[install] ${line}`),
      });

      if (installResult.exitCode !== 0) {
        return Response.json({
          success: false,
          error: 'Dependency installation failed',
          output: installResult.stderr,
        });
      }

      // Run tests
      console.log('Running tests');
      const testResult = await sandbox.exec('npm test', {
        cwd: '/workspace',
        stream: true,
        onStdout: (line) => console.log(`[test] ${line}`),
      });

      const duration = Date.now() - startTime;

      const result: TestResult = {
        commit: ref,
        success: testResult.exitCode === 0,
        output: testResult.stdout,
        duration,
      };

      // Optionally report back to GitHub
      if (env.GITHUB_TOKEN) {
        await reportToGitHub(repository, ref, result, env.GITHUB_TOKEN);
      }

      return Response.json(result);
    } catch (error) {
      return Response.json(
        {
          success: false,
          error: String(error),
          duration: Date.now() - startTime,
        },
        { status: 500 }
      );
    } finally {
      // Cleanup sandbox
      await sandbox.destroy();
    }
  }
};

async function reportToGitHub(
  repository: string,
  ref: string,
  result: TestResult,
  token: string
): Promise<void> {
  const [owner, repo] = repository.split('/');
  
  await fetch(
    `https://api.github.com/repos/${owner}/${repo}/statuses/${ref}`,
    {
      method: 'POST',
      headers: {
        'Authorization': `token ${token}`,
        'Content-Type': 'application/json',
      },
      body: JSON.stringify({
        state: result.success ? 'success' : 'failure',
        description: result.success 
          ? `Tests passed in ${result.duration}ms`
          : 'Tests failed',
        context: 'cloudflare-ci',
      }),
    }
  );
}

This implementation handles the full CI/CD workflow: clone repository, install dependencies, run tests, report results. Each test run gets its own isolated sandbox that’s destroyed after completion. No persistent infrastructure to manage, no runner queues to monitor.

Secure Your Cloudflare Sandbox Implementation

Sandbox SDK provides VM-level isolation, but application security is your responsibility. Here are mistakes I made and how to avoid them:

Mistake 1: Sharing sandboxes between users

// ✗ DANGEROUS: All users share one sandbox
const sandbox = getSandbox(env.Sandbox, 'shared');

If user A writes /tmp/secrets.json, user B can read it. Always use unique identifiers:

// ✓ SAFE: Each user gets isolated sandbox
const userId = await authenticateUser(request);
const sandbox = getSandbox(env.Sandbox, `user-${userId}`);

Mistake 2: Command injection via user input

// ✗ DANGEROUS: User input in shell commands
const filename = userInput;  // Could be: "file.txt; rm -rf /"
await sandbox.exec(`cat ${filename}`);

Always sanitize inputs or use file APIs instead of shell commands:

// ✓ SAFE: Use file API (no shell)
await sandbox.writeFile('/tmp/input', userInput);
const content = await sandbox.readFile('/tmp/input');

Mistake 3: Exposing secrets in code

// ✗ DANGEROUS: Hardcoded secrets
await sandbox.writeFile('/workspace/config.js', `
  const API_KEY = 'sk_live_abc123';
`);

Use environment variables from Worker bindings:

// ✓ SAFE: Secrets from environment
await sandbox.startProcess('node app.js', {
  env: {
    API_KEY: env.API_KEY,  // From Cloudflare Worker secret
  }
});

Mistake 4: Unlimited execution time

Without timeouts, malicious users can spawn infinite loops consuming resources:

// ✓ SAFE: Enforce timeouts
const result = await sandbox.exec('python script.py', {
  timeout: 30000,  // 30 seconds max
});

Compare Cloudflare Sandbox SDK to Docker and Lambda

After building systems with Docker, Lambda, and Sandbox SDK, here’s my practical comparison:

vs Docker (Self-Hosted)

Docker approach: Run docker run --rm --network none python:3.11 -c "user_code"

Advantages of Sandbox SDK:

VM isolation vs kernel-level (Docker container escapes are real—see CVE-2022-0847)
No infrastructure management (no EC2 instances, no auto-scaling groups)
Edge deployment included (Docker requires multi-region setup)
Built-in preview URLs (Docker requires reverse proxy configuration)

When Docker is better: Self-hosted environments, offline requirements, extreme customization needs.

vs AWS Lambda

Lambda approach: Deploy function, invoke with user code as payload

Advantages of Sandbox SDK:

Persistent state across executions (Lambda is stateless)
Longer execution time (unlimited vs 15 minutes)
Faster cold starts (200-500ms vs 1-5s for Lambda)
Built-in filesystem operations (Lambda has limited /tmp)

When Lambda is better: Existing AWS infrastructure, batch processing, infrequent executions.

Cost Comparison

Based on my production workload (10,000 executions/day, 5-minute avg duration):

Sandbox SDK: ~$250/month

$0.01 per container-hour
10k × 5min/60 × 30 days = 25,000 hours
25,000 × $0.01 = $250

Self-hosted Docker (AWS EC2): ~$600/month

t3.large instances × 3 (multi-region)
$0.083/hour × 3 × 730 hours = ~$182
Add load balancer: $16/month × 3 = $48
Add operational overhead, monitoring, backups: ~$370

AWS Lambda: ~$400/month

10k executions × 30 days = 300k invocations
300k × 5min avg = 1.5M GB-seconds
Pricing: $0.20 per 1M requests + $0.00001667 per GB-second
($0.20 × 0.3) + ($0.00001667 × 1.5M) = ~$85 (plus VPC, storage, egress)

Sandbox SDK provides the best cost-to-value ratio for interactive workloads with persistent state requirements.

Execute AI-Generated Code with Feedback Loops

One of my most successful implementations combines Sandbox SDK with LLM-generated code. The pattern: generate code, execute it, feed errors back to the LLM, iterate until success.

import { getSandbox } from '@cloudflare/sandbox';

export { Sandbox } from '@cloudflare/sandbox';

interface Env {
  Sandbox: DurableObjectNamespace;
  OPENROUTER_API_KEY: string;
}

export default {
  async fetch(request: Request, env: Env): Promise<Response> {
    const { prompt, userId } = await request.json();
    
    const sandbox = getSandbox(env.Sandbox, `ai-${userId}`);
    const maxIterations = 3;
    const history: string[] = [];

    try {
      const ctx = await sandbox.createCodeContext({ language: 'python' });

      for (let i = 0; i < maxIterations; i++) {
        // Build prompt with execution history
        const fullPrompt = buildPrompt(prompt, history);

        // Generate code from LLM
        const code = await generateCode(fullPrompt, env.OPENROUTER_API_KEY);
        
        console.log(`Iteration ${i + 1}: Executing generated code`);

        // Execute in sandbox
        const result = await sandbox.runCode(code, {
          context: ctx.id,
        });

        if (result.success) {
          await sandbox.deleteCodeContext(ctx.id);
          return Response.json({
            success: true,
            code,
            output: result.output,
            iterations: i + 1,
          });
        }

        // Record failure for next iteration
        history.push(`Attempt ${i + 1} failed: ${result.error}`);
        console.log(`Feeding error back to LLM: ${result.error}`);
      }

      // Exhausted attempts
      await sandbox.deleteCodeContext(ctx.id);
      return Response.json({
        success: false,
        error: 'Failed after 3 iterations',
        history,
      });
    } catch (error) {
      return Response.json(
        { success: false, error: String(error) },
        { status: 500 }
      );
    }
  }
};

function buildPrompt(userPrompt: string, history: string[]): string {
  let prompt = `Generate Python code for: ${userPrompt}\n\n`;
  
  if (history.length > 0) {
    prompt += 'Previous attempts failed:\n';
    history.forEach((entry, idx) => {
      prompt += `${idx + 1}. ${entry}\n`;
    });
    prompt += '\nGenerate CORRECTED code addressing these errors.\n';
  }

  return prompt;
}

async function generateCode(prompt: string, apiKey: string): Promise<string> {
  const response = await fetch('https://openrouter.ai/api/v1/chat/completions', {
    method: 'POST',
    headers: {
      'Authorization': `Bearer ${apiKey}`,
      'Content-Type': 'application/json',
    },
    body: JSON.stringify({
      model: 'anthropic/claude-3.5-sonnet',
      messages: [
        {
          role: 'user',
          content: `${prompt}\n\nGenerate ONLY executable Python code, no explanations.`,
        },
      ],
    }),
  });

  const data = await response.json();
  return data.choices[0].message.content;
}

This pattern reduces deployment failures by 80% compared to blindly trusting LLM output. The feedback loop helps models learn from mistakes and generate correct code within 2-3 iterations.

Lessons from Six Months in Production

Performance: Container cold starts average 300ms, warm executions under 100ms. Edge deployment matters—users in Asia see identical response times to users in North America.

Reliability: 99.9% uptime over six months with zero infrastructure incidents. Cloudflare handles scaling automatically during traffic spikes.

Cost: $250/month for 10k daily executions beat my previous EC2-based solution by 60% while eliminating operational overhead.

Developer experience: TypeScript API is intuitive. Most engineers ship their first sandbox integration within a day.

Limitations: Container filesystem is ephemeral (resets on restart). For persistent storage, mount S3-compatible buckets. Network egress can be expensive at scale—use Cloudflare’s R2 for large file transfers.

When Sandbox SDK Fits Your Architecture

Use Sandbox SDK when you need:

Secure execution of untrusted user code (AI agents, coding platforms, data analysis)
Global low-latency code execution (interactive development environments, preview URLs)
Simplified infrastructure (no servers to manage, auto-scaling included)
VM-level isolation (stronger security than Docker)
Persistent execution contexts (install once, reuse across requests)

Don’t use Sandbox SDK when:

You need extreme customization (custom kernels, specialized hardware)
Offline operation is required (no internet = no Cloudflare)
Budget constraints require self-hosted solutions
Workloads are primarily CPU-bound batch processing (Lambda may be cheaper)

Getting Started Checklist

Ready to build with Sandbox SDK? Here’s your implementation roadmap:

Set up Cloudflare account: Upgrade to Workers Paid plan ($5/month base)
Initialize project: npx wrangler init my-sandbox && cd my-sandbox
Install SDK: npm install @cloudflare/sandbox
Configure Durable Object binding in wrangler.toml
Implement basic execution API (see “Your First Sandbox” section)
Deploy: npx wrangler deploy
Test with curl: Verify code execution works
Add authentication: Implement user isolation with unique sandbox IDs
Enable streaming: Add real-time output for long-running operations
Set up monitoring: Use Cloudflare Analytics for request metrics

Start simple, iterate based on real user needs. My first implementation took three hours from zero to production-ready API.

Final Thoughts

Cloudflare Sandbox SDK represents a paradigm shift in secure code execution. Instead of managing Docker containers, Kubernetes clusters, or EC2 fleets, you write TypeScript and deploy to the edge. VM-level isolation handles security, global distribution handles latency, and Cloudflare handles operations.

After six months running production workloads, I’m convinced this is how secure code execution should work: serverless, edge-native, and built for the problems developers actually face. Whether you’re building AI coding assistants, interactive notebooks, or CI/CD pipelines, Sandbox SDK provides the right primitives at the right abstraction level.

The future of developer tools isn’t about managing infrastructure—it’s about writing code that solves problems. Sandbox SDK gets us one step closer to that reality.

Resources:

Found this helpful? Share it with others:

Share Share