Per-User Dev Environments with Fly Machines

Fly Machines are fast-launching VMs behind a simple API, enabling you to launch tightly isolated app instances in milliseconds all over the world.

One interesting use case: running isolated dev environments for your users (or robots). Fly Machines are a safe execution sandbox for even the sketchiest user-generated (or LLM-generated) code.

This blueprint explains how to use Fly Machines to securely host ephemeral development and/or execution environments, complete with dynamic subdomain routing using fly-replay.

Overview

Your architecture should include:

  • Router app(s)
    • A Fly.io app to handle requests to wildcard subdomains (*.example.com). Uses fly-replay headers to transparently redirect each request to the correct app and machine. If you have clusters of users (or robots) in different geographic regions, you can spin up a router app in multiple regions (you might also want to consider a globally distributed datastore like Upstash for Redis).
  • User apps (pre-created)
    • Dedicated per-user (or per-robot) Fly apps, each containing isolated Fly Machines. App and Machine creation is not instantaneous, so we recommend provisioning a pool of these before you need them so you can quickly assign upon request.
  • Fly Machines (with optional volumes)
    • Fast-launching VMs that can be attached to persistent Fly Volumes.

Example Architecture Diagram

Diagram showing router app directing traffic to user apps containing Fly Machines with volumes

Router app(s)

Your router app handles all incoming wildcard traffic. Its responsibility is simple:

User apps

Creating apps dynamically for each user at request time can be slow. To ensure fast provisioning:

  • Pre-create a pool of Fly apps and machines ahead of time (using the Fly Machines API or CLI).
  • Store app details (e.g., app_name: alice-123) in a datastore accessible to your router app.
  • Assign apps to users at provisioning time.

Fly Machines

You’ll want to spin up at least one Machine per user app (but apps can have as many Machines as needed). If your dev environments need persistent storage (data that should survive Machine restarts):

  • Attach Fly Volumes to each machine at creation time.
  • Keep in mind that machine restarts clear temporary filesystem state but preserve volume data.
  • Learn more about the Machines API resource and the Volumes API resource.

Pointers & Footguns

  • Machines & volumes are tied to physical hardware: hardware failures can destroy machines and attached volumes. Always persist important user data (code, config, outputs) to external storage (like Tigris Data or AWS S3).
  • Your users will break their environments: pre-create standby machines to handle hardware & runtime failures, or the inevitable user or robot poisoned environment. Pre-create standby machines that you can quickly activate in these scenarios.
  • Machine restarts reset ephemeral filesystem: the temporary Fly Machine filesystem state resets on Machine restarts, ensuring clean environments. However, volume data remains persistent, making it useful for retaining user progress or state.