While Tesla streams acrobatic humanoids, OpenAI is brute-forcing the robot brain—100 contractors teleoperating arms 24/7 to build the largest real-world manipulation data set on Earth.
OpenAI never left robotics—it just stopped showing off. A year-long expansion inside a nondescript San Francisco finance-floor annex has ballooned into a 100-person, three-shift data factory whose sole mission is to teach off-the-shelf Franka arms how to butter toast, fold laundry, and drop a rubber duck into a cup with super-human reliability.
From Rubik’s Cube to Rubber Duck: The Six-Year Arc
- 2020: OpenAI shutters its in-hand cube solver, citing “refocus on other projects” VentureBeat
- 2024: Berkeley GELLO paper drops, showing cheap tele-op can scale; lead author hired by OpenAI in August
- February 2025: Lab opens with 20 contractors and one goal—10 000 hours of manipulation data
- December 2025: Head-count >100, second site green-lit in Richmond, CA
The pivot is strategic. Reinforcement-learning hype ran head-first into reality: reward functions don’t generalize to chaotic kitchens. Imitation learning—especially when you can bank 50 000 “good hours” of pristine motion-capture—scales like large-language models. More data, better priors, emergent dexterity.
Why Arms Beat Humanoids (For Now)
Tesla’s Optimus demos thrill X, but full-body bipeds are data-hungry divas. One 30-second motion-capture wipe-down generates ~3 MB of kinematic data; the same budget funds 12 hours of Franka gripper telemetry. OpenAI’s GELLO rigs—3-D-printed desktop joysticks—cost $73 apiece and map human wrist roll directly to 7-DOF arm joints, slashing domain-gap latency.
Contractors compete on a live leaderboard: “good hours” logged, task success rate, kinematic smoothness. Top performers clear 9 h/day, earning performance bonuses that push hourly pay past $48—far above Amazon warehouse rates and a stealth talent magnet in a tight robotics labor market.
The Supply-Chain Chess Move
Parallel to the lab, OpenAI issued a request for proposals to US manufacturers for consumer-device, robotics, and data-center partnerships. Translation: once the model is trained, the company wants American fabs ready to stamp custom actuators, torque sensors, and edge inference boards at ChatGPT scale—no repeat of the GPU shortage that kneecapped LLM roll-outs.
Developer Take-away: Data Is the New API
- Tele-op datasets will commoditize. Expect open clones of OpenAI’s protocol within 12 months; build sim-to-real pipelines now.
- Low-DOF arms first. Start-ups chasing full humanoids risk capital death; gripper-centric SKUs ship faster and iterate cheaper.
- Benchmark sprint begins. Household-task leaderboards (dishwasher loading, zipper pulling) will become the new GLUE for robotics VCs.
OpenAI isn’t promising a robot butler tomorrow; it is cornering the raw material—high-resolution manipulation data—everyone else will need the day after. If history repeats, that hoard becomes the moat that feeds an on-device model no competitor can match without paying the toll.
Stay locked to onlytrustedinfo.com for the fastest breakdown of every dataset drop, benchmark leak, and supply-chain twist as the robotics race accelerates from prototype parlor tricks to productized hardware at scale.