Amazon · Filed Apr 15, 2025 · Published Jun 11, 2026 · verified — real USPTO data

Zoox Patents a Way to Train Its Self-Driving AI on Situations It Rarely Sees

By Patentlyze Team · Updated Jun 12, 2026

A self-driving car is only as safe as the situations its AI has practiced. Zoox is patenting a method to manufacture more of those practice runs — especially for the rare, dangerous moments that almost never show up in real-world training data.

FIG. 1A — rendered from the official USPTO publication PDF.

Publication number US 2026/0162438 A1

Applicant Zoox, Inc.

Filing date Apr 15, 2025

Publication date Jun 11, 2026

Inventors Po-Jen Lai, Shuangting Liu, Francesco Papi

CPC classification 382/100

Grant likelihood Medium

Examiner CENTRAL, DOCKET (Art Unit OPAP)

Status Docketed New Case - Ready for Examination (Mar 4, 2026)

Parent application is a Continuation of 17956631 (filed 2022-09-29)

Document 20 claims

AI/ML

What Zoox's augmented training data actually does

Imagine teaching a new driver almost entirely on sunny, empty highways — they'd be dangerously underprepared for rush-hour fog or a car stopped in a crosswalk. Self-driving AI has the same problem: it learns from recorded drives, but genuinely dangerous edge cases don't come up often enough to teach the car well.

Zoox's patent describes a way to take the rare scary moments that do appear in recorded data and stretch them into a much larger set of practice examples. The system flags clips tied to tough conditions, then mathematically flips and rearranges the sensor readings — and carefully adjusts everything else in the scene so it still makes physical sense. It can also zero out specific objects in the scene, essentially teaching the AI to handle situations where a pedestrian or car partially disappears from the sensors.

The result is a richer training library built mostly from real data, not guesswork. Zoox's detection model gets many more rehearsals of the situations most likely to cause accidents.

How Zoox flips and zeroes sensor data to fake rare scenarios

The patent describes a two-step augmentation pipeline applied to multichannel sensor data — the kind of layered data structure a self-driving system builds by fusing camera, lidar, and radar readings into a single scene representation.

Step one handles geometric augmentation: the system takes the spatial coordinates in a data frame — where objects are, how they're oriented — and flips or transforms them. Think of it like mirroring a chess board mid-game: every piece moves, but the relationships between pieces stay consistent.

Step two handles non-geometric augmentation: it looks at the specific pixels or data values tied to individual detected objects and modifies or zeros them out. Zeroing means telling the model "this object's sensor return just vanished" — which mimics real sensor noise, occlusion (one object hiding behind another), or adverse weather degrading the signal.

Critically, the two augmentation types stay distinct and are applied in sequence so the final training sample remains physically coherent. The system also identifies which portions of the dataset are flagged as "significant operational conditions" — unusual or high-risk scenarios — and prioritizes those for augmentation, ensuring the trained model gets proportionally more exposure to the situations it needs most.

The output feeds directly into training a machine-learned object-detection model.

What this means for self-driving safety in edge cases

Self-driving AI failures tend to cluster around situations the training data didn't cover well — a shopping cart in the road, a cyclist in heavy rain, a truck partially blocking a sensor. Data augmentation is one of the main tools the industry uses to close those gaps without sending fleets of cars into dangerous situations just to collect footage. Zoox's approach is notable because it applies augmentation selectively to already-identified hard cases rather than randomly across the whole dataset, which should make the extra training more targeted.

For you as a future passenger or pedestrian, this is the kind of unglamorous infrastructure work that determines whether a robotaxi behaves sensibly in the one-in-a-thousand scenario that actually matters. Zoox, an Amazon subsidiary, operates driverless shuttles in Las Vegas, so improvements to edge-case detection have a direct path to real roads.

Editorial take

This is competent, serious safety engineering — not a flashy AI trick. The value here is in the details: using flagged high-risk clips as seeds for augmentation, and keeping geometric and non-geometric changes separate so the training data doesn't become physically nonsensical. It won't make headlines, but it's exactly the kind of incremental work that determines which self-driving programs actually reduce accidents over time.

Get one Big Tech patent every Sunday

Plain English, intelligent commentary, no hype. Free.

Source. Full patent text and figures from the official USPTO publication PDF.

Editorial commentary on a publicly published patent application. Not legal advice.

Zoox Patents a Way to Train Its Self-Driving AI on Situations It Rarely Sees

What Zoox's augmented training data actually does

How Zoox flips and zeroes sensor data to fake rare scenarios

What this means for self-driving safety in edge cases

More from Amazon

More in AI/ML

Get one Big Tech patent every Sunday