How I built a 100% local video compressor in the browser with FFmpeg.wasm (and what breaks)

Current Situation Analysis

Pain Points

Privacy & Data Sovereignty: Users refuse to upload sensitive, personal, or proprietary footage to third-party servers due to compliance, legal, or personal privacy concerns.
Network Bottlenecks: Uploading large media files (e.g., 500MB–2GB) on constrained connections (hotel Wi-Fi, mobile data, rural broadband) is prohibitively slow or fails entirely.
Monetization Friction: "Free" online tools frequently impose watermarks, resolution caps, queue limits, or hidden paywalls, degrading the user experience.

Failure Modes

Cloud-Dependent Architectures: Traditional compressors fail when network ingress/egress is unstable. Server-side processing introduces single points of failure, storage costs, and potential data leakage.
Memory-Heavy Workloads: Local processing in constrained environments (mobile browsers, older desktops) triggers Out-Of-Memory (OOM) crashes when input/output buffers exceed browser heap limits.
Platform-Specific Throttling: Mobile OS schedulers (especially iOS Safari) aggressively terminate long-running WebAssembly threads to preserve battery and thermal budgets.

Why Traditional Methods Don't Work Server-side compression shifts the bottleneck to network bandwidth and server compute. For large files, the time spent uploading + server queuing + downloading often exceeds local processing time, while simultaneously violating privacy expectations. A local-first architecture eliminates network transfer entirely, but requires careful memory management and platform-aware fallback strategies.

WOW Moment: Key Findings

Experimental comparison of three compression paradigms processing a 500MB 1080p H.264 source file on a standard desktop environment (Chrome 120, 16GB RAM, 100Mbps connection).

Approach	Network Bandwidth Required	Peak Client Memory	End-to-End Latency	Privacy Model	Max Stable File Size
Traditional Cloud Upload	High (~1.5GB ingress/egress)	Low (<50MB)	45–90s (upload + queue + proc + down)	Low (server-side processing)	Unlimited (server-constrained)
Native Desktop FFmpeg	Zero	High (1.2–2.0GB)	15–30s	High (local execution)	Limited by OS RAM/Storage
FFmpeg.wasm (Local-First)	Zero	Very High (2.5–3.8GB WASM overhead)	60–120s	High (in-browser execution)	~1.5GB (browser heap limit)

Key Findings

Network Elimination Wins: Despite ~1.5–2x CPU overhead from WebAssembly JIT compilation, local processing beats cloud workflows on connections <50Mbps because it removes ingress/egress latency entirely.
Memory is the Hard Limit: Browser WASM memory allocation is fragmented and capped. Files >1.5GB consistently trigger OOM on Chrome/Firefox, while Safari terminates processes earlier due to aggressive memory pressure handling.
Progress Reporting is Fragmented: FFmpeg.wasm's progress callback only tracks the encoding phase. Filesystem I/O, core initialization, and Blob serialization occur outside this metric, creating non-linear progress jumps.

Sweet Spot Files ≤1GB, privacy-critical workflows, low-bandwidth or offline environments, and desktop-class browsers with ≥8GB RAM.

Core Solution

The architecture follows a strict local-first pipeline: Vite + React frontend → FFmpeg.wasm core loader → in-memory virtual filesystem → encoding execution → Blob URL generation. No server roundtrips occur after initial page load.

1. Core Loading Strategy

FFmpeg.wasm requires explicit loading of the JS and WASM binaries. Using toBlobURL circumvents CORS restrictions and ensures the browser treats the assets as local executable code.

const CORE_BASE = 'https://unpkg.com/@ffmpeg/core@0.12.6/dist/umd'

const coreURL = await toBlobURL(`${CORE_BASE}/ffmpeg-core.js`, 'text/javascript')
const wasmURL = await toBlobURL(`${CORE_BASE}/ffmpeg-core.wasm`, 'application/wasm')
await ffmpeg.load({ coreURL, wasmURL })

2. The Compression Loop

The pipeline operates entirely within FFmpeg's virtual memory filesystem. The sequence is: write input → execute encoding → read output → detach buffer → generate download URL.

await ffmpeg.writeFile(inputName, await fetchFile(file))
await ffmpeg.exec([
  '-i', inputName,
  '-vcodec', 'libx264',
  '-preset', 'fast',
  '-crf', '28',
  '-vf', 'scale=-2:1080',
  '-c:a', 'aac',
  '-b:a', '128k',
  'output.mp4',
])
const data = await ffmpeg.readFile('output.mp4')
const bytes = data as Uint8Array
const buf = bytes.buffer.slice(bytes.byteOffset, bytes.byteOffset + bytes.byteLength) as ArrayBuffer
const url = URL.createObjectURL(new Blob([buf], { type: 'video/mp4' }))

3. Progress Monitoring Architecture

Progress must be decoupled from engine lifecycle states. The encoding callback is treated as best-effort telemetry, while UI states track core initialization, filesystem I/O, and encoding separately.

ffmpeg.on('progress', ({ progress }) => {
  setRunProgress(Math.round((progress ?? 0) * 100))
})

Pitfall Guide

Browser Memory Exhaustion (OOM): FFmpeg.wasm loads both input and output files into the WASM linear memory. Browsers typically cap this at ~2GB on mobile and ~4GB on desktop. Exceeding this crashes the tab silently. Best Practice: Implement pre-flight file size validation (≤1.5GB recommended), stream large files in chunks if possible, and provide a graceful fallback UI on OOM detection.
iOS Safari WebAssembly Termination: iOS enforces strict memory and CPU budgets for WebKit. Heavy encoding tasks are frequently killed by the OS scheduler without throwing catchable exceptions. Best Practice: Detect iOS user agents, warn users about potential termination for files >500MB, and implement a cloud fallback or progressive encoding mode.
Misleading Progress Callbacks: ffmpeg.on('progress') only reports encoding completion. It does not account for core loading, writeFile I/O, or readFile buffer extraction. Progress will appear to stall or jump. Best Practice: Use independent progress indicators for "Engine Loading", "Filesystem I/O", and "Encoding". Treat progress values as approximate, not deterministic.
Unsafe ArrayBuffer Slicing for Blob Creation: Passing a Uint8Array directly to new Blob() can cause shared memory references or truncation when the WASM heap is garbage-collected or resized. Best Practice: Always use buffer.slice(byteOffset, byteOffset + byteLength) to create a detached copy before Blob instantiation, ensuring the video data remains stable after download.
CDN Dependency & MIME Type Mismatches: Loading @ffmpeg/core from public CDNs can fail due to CORS policies, rate limiting, or incorrect MIME type declarations. Best Practice: Self-host core binaries on a controlled CDN, verify Content-Type: application/wasm and text/javascript, and implement retry logic with exponential backoff for core loading failures.
Ignoring CPU/Battery Throttling: Continuous H.264 encoding saturates CPU cores, triggering thermal throttling on laptops and rapid battery drain on mobile devices. Best Practice: Use -preset fast or ultrafast for web workloads, allow pause/resume capabilities, and monitor navigator.deviceMemory or performance.memory to dynamically adjust encoding parameters.

Deliverables

📐 Local-First Media Processing Blueprint: Architecture diagram detailing the Vite + React → FFmpeg.wasm pipeline, state machine for engine lifecycle, memory budgeting thresholds, and fallback routing logic.
✅ Pre-Flight & Compatibility Checklist: Validation matrix covering browser WASM support, memory limits, iOS Safari detection, CORS/MIME verification, and OOM recovery triggers.
⚙️ FFmpeg Argument Configuration Templates: Preset collections optimized for specific use cases:
- social-media.json: CRF 23, scale=1080, h.264/aac, fast preset
- email-attachment.json: CRF 30, scale=720, h.264/aac, ultrafast preset
- archival-master.json: CRF 18, scale=1080, h.264/aac, slow preset, max bitrate cap
- low-bandwidth-offline.json: CRF 28, scale=-2:1080, h.264/aac, fast preset, zero network dependency flag