Parsing hundreds of megabytes of JSON in milliseconds in React Native

By Codcompass Team·2026-05-17·8 min read

Bypassing the JavaScript Heap: Lazy JSON Parsing on React Native

Current Situation Analysis

Mobile applications frequently inherit legacy data contracts that ship massive JSON payloads. Whether it's a bulk configuration sync, a historical data dump, or an unoptimized third-party API, developers routinely encounter responses ranging from tens to hundreds of megabytes. The standard mental model for handling these responses assumes that JSON.parse is a lightweight, synchronous operation. This assumption holds true for kilobyte-scale payloads but collapses under the weight of megabyte-scale documents.

The fundamental misunderstanding lies in conflating syntax validation with memory allocation. When JSON.parse executes in a JavaScript runtime, it does not merely validate the structure. It walks the entire document and materializes a complete object graph. Every nested object, array, string, number, and boolean becomes a distinct JavaScript value. The engine must allocate hidden classes, property storage, array backing stores, and interned strings. For a 250 MB JSON file, this process triggers exponential heap growth. The JavaScript engine attempts to construct millions of interconnected objects, causing garbage collection cycles to stall, the UI thread to freeze, and peak memory consumption to balloon to 2–4× the raw file size.

Empirical testing against the public data_250mb.json fixture demonstrates the severity of this bottleneck. On iOS, standard parsing completes only after several seconds of CPU saturation, with the JavaScript heap spiking to approximately 1.2 GB. On Android, the runtime consistently triggers an out-of-memory condition and terminates the process before parsing finishes. The failure is not caused by slow syntax parsing; it is caused by the impossibility of materializing a gigabyte-scale object graph within the constrained memory limits of a mobile device. Teams that treat large JSON as a parsing problem miss the actual constraint: the JavaScript heap cannot sustain full-tree materialization.

WOW Moment: Key Findings

The breakthrough comes from shifting the parsing boundary from the JavaScript runtime to native memory. By leveraging SIMD-accelerated C++ parsing and exposing a lazy navigation interface, the cost model changes from heap allocation to memory mapping. The document remains in native memory, and JavaScript only receives lightweight handles to specific paths.

Approach	Parse Time (iOS)	Parse Time (Android)	Peak Memory Footprint
Standard `JSON.parse`	~3–5 seconds	OOM Crash	~1.2 GB JS Heap
Native Lazy `JsonView`	~100 ms	~500 ms	~400 MB Native Buffer

This comparison reveals a fundamental architectural shift. The native approach does not eliminate memory usage; it relocates it. The raw bytes and simdjson padding reside in native memory, avoiding JavaScript heap allocation entirely. Navigation operations (getValue, atPath, asString) execute targeted C++ lookups and cross the bridge only when returning small primitives or nested handles. The JavaScript runtime never sees the full object graph unless explicitly requested. This enables deterministic performance on constrained devices and transforms an impossible operation into a sub-second workflow.

Core Solution

The implementation relies on three coordinated layers: native buffer management, hybrid object bridging, and lazy traversal. Each layer addresses a specific bottleneck in the traditional parsing pipeline.

1. Native Buffer Initialization

The process begins by loading the JSON file into a padded native buffer. The simdjson library requires padding

for safe SIMD boundary reads. Instead of copying the file into a JavaScript string, the buffer is allocated in native memory. This eliminates the initial string-to-heap conversion and provides a contiguous memory region optimized for parallel parsing.

2. Hybrid Object Bridging

React Native's bridge introduces serialization overhead when transferring complex structures. To avoid this, the library uses Nitro Modules to expose a C++ HybridJsonView directly to JavaScript. The root handle returned to the JS runtime is not a plain object; it is a proxy backed by native memory. Method calls on this proxy execute in C++, perform targeted navigation, and return only the requested slice. This architecture minimizes bridge traffic and prevents accidental full-tree serialization.

3. Lazy Traversal & Explicit Materialization

Navigation methods like atPath or getValue resolve JSON pointers or dotted paths within the native buffer. Scalar extraction (asString, asNumber) converts only the matched node to a JavaScript primitive. Subtree materialization (asObject, rawJson) is deliberately expensive. It forces the engine to allocate a JavaScript object graph for the requested slice. By making materialization explicit, developers control exactly when and where heap allocation occurs.

Implementation Example

The following TypeScript wrapper demonstrates the architecture with renamed interfaces and a production-ready structure. It abstracts the native module calls while enforcing safe resource management.

import { NativeModules, NativeEventEmitter } from 'react-native';

// Native module interface (backed by Nitro + simdjson)
interface NativeFastJsonModule {
  loadDocument(filePath: string): Promise<string>; // Returns native handle ID
  resolvePath(handleId: string, jsonPointer: string): Promise<unknown>;
  extractScalar(handleId: string, jsonPointer: string, type: 'string' | 'number' | 'boolean'): Promise<unknown>;
  materializeSubtree(handleId: string, jsonPointer: string): Promise<Record<string, unknown> | unknown[]>;
  disposeDocument(handleId: string): void;
}

const FastJsonNative = NativeModules.FastJsonModule as NativeFastJsonModule;

export class LazyJsonReader {
  private handleId: string | null = null;
  private filePath: string;

  constructor(filePath: string) {
    this.filePath = filePath;
  }

  async initialize(): Promise<void> {
    if (this.handleId) throw new Error('Document already initialized');
    this.handleId = await FastJsonNative.loadDocument(this.filePath);
  }

  async getScalar<T extends string | number | boolean>(
    path: string,
    type: T extends string ? 'string' : T extends number ? 'number' : 'boolean'
  ): Promise<T | null> {
    this.assertReady();
    const raw = await FastJsonNative.extractScalar(this.handleId!, path, type);
    return (raw as T) ?? null;
  }

  async getSubtree(path: string): Promise<Record<string, unknown> | unknown[] | null> {
    this.assertReady();
    const raw = await FastJsonNative.materializeSubtree(this.handleId!, path);
    return (raw as Record<string, unknown> | unknown[]) ?? null;
  }

  async resolvePointer(path: string): Promise<unknown> {
    this.assertReady();
    return FastJsonNative.resolvePath(this.handleId!, path);
  }

  cleanup(): void {
    if (this.handleId) {
      FastJsonNative.disposeDocument(this.handleId);
      this.handleId = null;
    }
  }

  private assertReady(): void {
    if (!this.handleId) throw new Error('Call initialize() before accessing document');
  }
}

Architecture Rationale

Why Nitro Modules? Traditional TurboModules require manual bridge serialization for complex return types. Nitro generates type-safe C++/Swift/Kotlin bindings that expose hybrid objects directly to JavaScript, eliminating intermediate serialization steps.
Why simdjson? Standard parsers validate JSON sequentially. simdjson uses SIMD instructions to process 16+ bytes per cycle, achieving multi-gigabyte throughput. It is the industry standard for high-performance JSON parsing in C++ ecosystems.
Why explicit disposal? Native buffers are not garbage-collected by the JavaScript runtime. Without deterministic cleanup, repeated parsing operations leak native memory proportional to the file size. The disposeDocument call ensures the padded buffer is freed immediately after use.

Pitfall Guide

1. Omitting Deterministic Cleanup

Explanation: Native buffers allocated by loadDocument persist in native memory until explicitly freed. JavaScript garbage collection does not track native allocations. Forgetting to call disposeDocument causes cumulative memory leaks that eventually trigger OOM crashes. Fix: Wrap reader usage in a try/finally block or implement a resource manager that guarantees cleanup regardless of execution path.

2. Materializing the Root Node

Explanation: Calling materializeSubtree on $ or the root path forces the entire document into the JavaScript heap. This reproduces the exact OOM condition the native approach was designed to avoid. Fix: Only materialize deeply nested slices required for UI rendering. Keep navigation lazy until the final transformation step.

3. Bridge Thrashing via Fine-Grained Loops

Explanation: Each resolvePath or extractScalar call crosses the JavaScript-to-native bridge. Iterating over thousands of items with individual path calls introduces significant latency and CPU overhead. Fix: Use wildcard resolution (atPathWithWildcard equivalent) to batch extractions, or materialize a parent array and iterate in JavaScript. Measure bridge round-trip costs before choosing a strategy.

4. Assuming Zero Memory Overhead

Explanation: The native approach avoids JavaScript heap allocation but still requires native memory roughly equal to the file size plus simdjson padding (~10–15%). On low-RAM devices, holding multiple large documents simultaneously will exhaust system memory. Fix: Implement a document pool with strict concurrency limits. Serialize access to large files and release buffers immediately after processing.

5. Applying to Small Payloads

Explanation: The overhead of native module invocation, buffer allocation, and hybrid object creation outweighs the benefits for payloads under 5 MB. Standard JSON.parse is highly optimized for small documents and executes faster with less complexity. Fix: Implement a size threshold. Use native lazy parsing only for documents exceeding 10–15 MB. Fall back to standard parsing for typical API responses.

6. Ignoring Platform Performance Variance

Explanation: iOS and Android exhibit different parsing characteristics due to CPU architecture, storage I/O speeds, and runtime configurations. Benchmarks show Android parsing times can be ~5× slower than iOS for identical payloads. Fix: Never hardcode timeout expectations. Implement platform-aware loading states and measure performance on target device tiers during QA.

7. Misusing Path Syntax

Explanation: The navigation API expects specific path formats. Dotted paths like $.metadata.version work for simple traversal, but bracket notation for array indices is often unsupported in the base resolver. Wildcard patterns require dedicated methods. Fix: Validate path strings against the library's specification. Use wildcard resolvers for array iterations and standard dotted paths for object property access.

Production Bundle

Action Checklist

Verify payload size: Implement a threshold check to route documents >10 MB to native lazy parsing and smaller payloads to standard JSON.parse.
Enforce deterministic cleanup: Wrap all LazyJsonReader instances in try/finally blocks to guarantee disposeDocument execution.
Audit materialization points: Search codebase for materializeSubtree or asObject calls. Ensure no root-level or large-slice materialization occurs.
Optimize bridge traffic: Replace fine-grained loop extractions with wildcard resolvers or parent-slice materialization where appropriate.
Implement memory budgeting: Limit concurrent native document handles to 1–2 per screen. Serialize access for bulk sync operations.
Add platform-aware loading states: Account for Android's ~500 ms parse time versus iOS's ~100 ms. Display appropriate progress indicators.
Benchmark on release builds: Debug native binaries introduce significant overhead. Validate performance metrics using production-optimized builds.

Decision Matrix

Scenario	Recommended Approach	Why	Cost Impact
Payload < 5 MB	Standard `JSON.parse`	Lower overhead, faster execution for small trees	Minimal JS heap, negligible CPU
Payload 10–100 MB, selective access	Native Lazy `JsonView`	Avoids full heap allocation, deterministic navigation	~File size native memory, stable JS heap
Payload > 100 MB, full transformation	Server-side sharding + SQLite	Mobile runtime cannot sustain bulk mutation	Shifts compute to backend, reduces client memory
High-frequency sync, mutable state	Binary format (MessagePack/Protobuf)	Faster decode, smaller wire size, native support	Higher initial migration cost, long-term performance gain
Low-RAM device (< 3 GB)	Stream parsing or chunked API	Prevents native buffer OOM, respects system limits	Increased network round-trips, complex state management

Configuration Template

// fastJsonConfig.ts
import { LazyJsonReader } from './LazyJsonReader';

export const createDocumentReader = async (
  localPath: string,
  options: { maxConcurrent?: number; timeoutMs?: number } = {}
): Promise<LazyJsonReader> => {
  const reader = new LazyJsonReader(localPath);
  const timeout = options.timeoutMs ?? 5000;
  
  await Promise.race([
    reader.initialize(),
    new Promise((_, reject) => 
      setTimeout(() => reject(new Error('Document initialization timeout')), timeout)
    )
  ]);

  return reader;
};

// Usage wrapper with automatic cleanup
export const withDocument = async <T>(
  path: string,
  handler: (reader: LazyJsonReader) => Promise<T>
): Promise<T> => {
  const reader = await createDocumentReader(path);
  try {
    return await handler(reader);
  } finally {
    reader.cleanup();
  }
};

Quick Start Guide

Install the native module: Add react-native-fast-json to your project and run pod install (iOS) or sync Gradle (Android). Ensure the New Architecture is enabled for Nitro compatibility.
Cache the payload locally: Download the JSON file to the device's document directory. Native parsing requires a file path, not an in-memory string.
Initialize the reader: Call createDocumentReader with the cached file path. The native buffer is allocated and parsed in the background.
Navigate selectively: Use getScalar or resolvePointer to extract required fields. Avoid calling getSubtree on large arrays or root nodes.
Release resources: Always invoke cleanup() in a finally block. Verify native memory returns to baseline using Xcode Instruments or Android Profiler.

🎉 Mid-Year Sale — Unlock Full Article

Base plan from just $4.99/mo or $49/yr

7-day free trial · Cancel anytime · 30-day money-back