Training-ready data pipeline
for robot learning.

Every teleop session - data collection or intervention - automatically outputs structured, labeled data straight into your ML pipeline.

Forge training data interface
What you get

Structured, labeled data - straight into your training loop.

No cleaning. No reformatting. Forge turns every teleop session into ML-ready episodes that plug directly into your pipeline.

ML-ready output

Structured, formatted datasets that plug directly into your training workflows. No cleaning, no reformatting - your engineers use them instantly.

Data mining & curation

Automatically identify the most valuable demonstrations and edge cases across your dataset. Surface the data that moves your model forward.

CI/CD integration

Direct integrations with your build and training infrastructure. Data flows from collection to model training without manual steps.

Task-specific annotation

Labels and metadata matched to your model architecture. Structured annotation your training code expects, out of the box.

Live demo

See Forge on your robot data.

Forge plugs into the same dataset and event bus as Proxy Cloud. Kick off an imitation or VLA training run with one command - Forge handles distributed training, eval, and rollback.

  • Search 10M+ demonstrations by outcome, joint trace, or workspace
  • Train policies on the data you just collected
  • Data pipeline and Forge walkthrough
  • No commitment - 30-minute call with a team member
Forge dataset and training pipeline
live · training pipeline