Build a working asyncio event loop in 30 lines of plain Python

By Codcompass Team·2026-05-05·5 min read

Current Situation Analysis

The primary pain point in learning asynchronous Python is pedagogical inversion: tutorials introduce async/await keywords before explaining the underlying runtime, forcing developers to treat the event loop as a black box. This creates a fundamental failure mode where engineers cannot diagnose blocking behavior, task starvation, or cancellation leaks because they lack a mental model of how coroutines are actually scheduled.

Traditional serial execution of I/O-bound tasks compounds this by paying the sum of all wait times rather than overlapping them. Without understanding the cooperative scheduling mechanism, developers either:

Resort to multithreading/multiprocessing for simple I/O overlap, introducing unnecessary context-switching overhead and GIL contention.
Misuse asyncio APIs by blocking the event loop with synchronous calls, negating concurrency benefits entirely.
Struggle to debug await behavior because the syntax obscures the generator delegation and state-machine mechanics that actually drive the runtime.

Stripping the keywords reveals that the asyncio runtime is fundamentally small: a queue-driven scheduler that advances pausable functions (generators/coroutines) based on time or I/O readiness.

WOW Moment: Key Findings

Approach	Total Execution Time	CPU Overhead	Concurrency Mechanism	Thread Count
Serial Execution	6.00s	High (blocking waits)	Sequential blocking	1
Toy Generator Loop	3.00s	Low (1ms polling)	Cooperative yielding	1
Real asyncio (Selector)	~3.00s	Minimal (OS event-driven)	epoll/kqueue/IOCP	1

Key Findings:

Wait Overlap Principle: Concurrent execution reduces total runtime from the sum of individual waits (6s) to the maximum single wait (3s), despite identical work.
Single-Threaded Concurrency: True I/O concurrency is achievable without threads, processes, or external libraries by using cooperative yielding.
Mechanism Consistency: The toy loop and production asyncio share identical core mechanics. The only meaningful difference is the sleep mechanism: fixed polling vs. OS-level file descriptor selectors.
Sweet Spot: This architecture excels for high-I/O, low-CPU workloads (network requests, database queries, file operations) where tasks spend >90% of runtime waiting.

Core Solution

A job is a gener

ator that yields A generator is a function that can pause itself and be resumed by its caller. The pause point is yield. The caller advances the generator with next(...).

def example():
    print("step 1")
    yield
    print("step 2")
    yield
    print("step 3")

>>> g = example()
>>> next(g)
step 1
>>> next(g)
step 2
>>> next(g)
step 3
Traceback (most recent call last):
  ...
StopIteration

That is the entire mechanism. A yield is a bookmark. The caller picks up a different generator, runs it for a while, and comes back to the bookmarked one when it feels like it. Hold on to that picture; it is what await will do later, in fewer letters.

A timer that needs three ticks

For the toy loop, a "wait" is a generator that yields a target wake-up time. The loop checks the time on each pass; once the wake-up time has passed, the generator is allowed to advance.

import time

def sleep(seconds):
    deadline = time.time() + seconds
    while time.time() < deadline:
        yield deadline

A job that "waits two seconds" is a generator that yields the deadline now + 2 until that deadline passes, then returns. The loop watches deadlines; jobs advance when their deadline arrives.

The loop in 30 lines

Here it is. Save it as toy_loop.py.

import time
from collections import deque

def run(jobs):
    """Run the given jobs until all of them finish.

    Each job is a generator. A job yields a deadline (a time.time()
    value) to mean "wake me up at or after this time". When a job
    returns (StopIteration), it is removed from the queue.
    """
    ready = deque((job, 0.0) for job in jobs)
    while ready:
        job, wake_at = ready.popleft()
        if time.time() < wake_at:
            ready.append((job, wake_at))
            time.sleep(0.001)   # avoid a tight CPU spin
            continue
        try:
            new_wake_at = next(job) or 0.0
            ready.append((job, new_wake_at))
        except StopIteration:
            pass

def sleep(seconds):
    deadline = time.time() + seconds
    while time.time() < deadline:
        yield deadline

Twenty-eight lines of code. No imports beyond time and deque. No asyncio. No threads. No futures. The whole runtime is a queue and a while loop.

The while loop pops the front entry. If the job's wake-up time is in the future, push it to the back, sleep one millisecond, continue. If the job is ready, advance it with next(...); the job runs until its next yield, returns the new wake-up time, and goes back on the queue. When next(job) raises StopIteration, the job is finished and does not return to the queue.

That is the whole runtime.

Run it

Add three jobs and a main block.

def fetch(name, delay):
    print(f"  {name} started, waiting {delay}s")
    yield from sleep(delay)
    print(f"  {name} done")

if __name__ == "__main__":
    start = time.time()
    run([
        fetch("A", 2.0),
        fetch("B", 1.0),
        fetch("C", 3.0),
    ])
    print(f"total: {time.time() - start:.2f}s")

yield from sleep(...) delegates to another generator. It is the same idea await will be later. Each yield from sleep flows up through fetch to the next(job) call in the loop.

$ python toy_loop.py
  A started, waiting 2.0s
  B started, waiting 1.0s
  C started, waiting 3.0s
  B done
  A done
  C done
total: 3.00s

Three seconds. Not six. Three jobs whose waits sum to six seconds finished in the time of the longest one. Single thread. CPU idle for almost all of it. That is concurrency.

This is what asyncio is

Now look at what we built and what asyncio adds on top.

asyncio replaces the time.sleep(0.001) polling with a real selector-based wait on file descriptors (epoll on Linux, kqueue on macOS, IOCP on Windows). The selector tells the OS, "wake me up when any of these sockets has data, or when the next deadline arrives", so the loop sleeps for exactly as long as it needs to and no longer. That is the only meaningful difference between this toy loop and the real one.

async def is def with one extra property: the function returns a coroutine object instead of running its body. The coroutine object is the same shape as a generator. It pauses at await.

await x is yield from x.__await__(). It is yield from with a different name and a slightly tighter contract.

asyncio.run(main()) is the same as while ready: loop, with proper exception handling, signal handling, and the selector mentioned above.

The asyncio source code is more than 30 lines long, but the extra lines cover corner cases (cancellation propagation, exception groups, the lost-task trap), not new mechanisms. The mechanism is what you just built.

Pitfall Guide

Keyword-First Mental Model: Treating async/await as syntactic magic instead of generator delegation obscures the actual scheduling mechanism, making debugging blocking behavior or task starvation nearly impossible.
Tight CPU Spin Loops: Omitting the micro-sleep (time.sleep(0.001)) or failing to integrate an OS-level selector causes the event loop to consume 100% CPU on a single core while polling, destroying the efficiency gains of concurrency.
Ignoring StopIteration Handling: Failing to catch StopIteration when advancing generators crashes the scheduler instead of gracefully dequeuing completed tasks, leading to unhandled exceptions and loop termination.
Deadline Drift from Fixed Polling: Relying on fixed polling intervals introduces latency jitter and imprecise wake-ups. Production event loops must use OS-level selectors (epoll/kqueue/IOCP) to sleep exactly until the next deadline or I/O event.
Missing Exception & Cancellation Contracts: The toy loop lacks error propagation, task cancellation, and signal handling. Real asyncio wraps the core queue mechanism with robust exception groups, cancellation scopes, and graceful shutdown procedures that are essential for production reliability.
Assuming Concurrency Equals Parallelism: Single-threaded cooperative scheduling does not bypass the GIL or utilize multiple cores. CPU-bound tasks will still block the loop; this architecture is strictly optimized for I/O-bound workloads.

Deliverables

Blueprint: Single-threaded cooperative event loop architecture featuring a deque-based ready queue, generator state machine, and deadline-driven scheduler. Maps directly to asyncio's internal task management and selector integration.
Checklist:
- Verify generator delegation flow (yield → next() → StopIteration)
- Confirm polling interval prevents CPU spin while maintaining responsiveness
- Validate deadline calculation uses monotonic time for production deployments
- Ensure StopIteration is caught to remove completed tasks from the queue
- Map toy loop components to asyncio equivalents (run → loop.run_until_complete, sleep → asyncio.sleep, deque → internal task registry)

Configuration Template: Runtime parameters for production adaptation:

# Event Loop Configuration Template
POLLING_INTERVAL = 0.001  # Fallback sleep interval (seconds)
MAX_TASK_QUEUE = 10000    # Prevent unbounded memory growth
SELECTOR_BACKEND = "auto" # "epoll" | "kqueue" | "IOCP" | "auto"
EXCEPTION_HANDLER = "log_and_continue" # "log_and_continue" | "propagate" | "shutdown"

🎉 Mid-Year Sale — Unlock Full Article

Base plan from just $4.99/mo or $49/yr

7-day free trial · Cancel anytime · 30-day money-back

Current Situation Analysis

WOW Moment: Key Findings

Core Solution

A job is a gener

🎉 Mid-Year Sale — Unlock Full Article

Production Bundle