Nvidia · Filed Sep 22, 2025 · Published Jun 25, 2026 · verified — real USPTO data

Nvidia Patents a Depth-Judging System That Measures Object Distance From Two Camera Shots

Figuring out how far away something is has always been one of the hardest problems in computer vision. Nvidia's new patent describes a neural network that cracks it by asking a simpler yes-or-no question over and over.

Publication number US 2026/0179239 A1
Applicant NVIDIA Corporation
Filing date Sep 22, 2025
Publication date Jun 25, 2026
Inventors Orazio Gallo, Abhishek Badki, Alejandro Troccoli
CPC classification 382/103
Grant likelihood Medium
Examiner CENTRAL, DOCKET (Art Unit OPAP)
Status Docketed New Case - Ready for Examination (Mar 19, 2026)
Parent application is a Continuation of 16714359 (filed 2019-12-13)
Document 21 claims

How Nvidia's camera depth trick actually works

Imagine trying to park a car by looking at a photo instead of using your own eyes. Judging distance from a flat image is genuinely hard, even for computers. Nvidia's patent describes a system that uses one or two cameras to get around this problem.

Instead of trying to calculate an exact distance all at once, the system works by narrowing it down step by step. It transforms one camera image to match another, then asks a neural network a simple question: is this object closer or farther than a specific known distance? By repeating that question at different distances, it zeroes in on where something actually is.

Think of it like a guessing game where you keep splitting the remaining range in half. Each round of questions cuts the uncertainty down until you have a good answer. The result is a depth estimate that doesn't require expensive specialized hardware, just clever software and standard cameras.

How the neural network bisects depth to find distances

The patent describes a depth estimation system that uses a pair of images captured from one or more cameras. One image is transformed (warped or shifted) to align with the perspective of the other, a technique that lets the system compare how objects appear from slightly different viewpoints.

A neural network then acts as a binary classifier: rather than directly predicting a distance, it answers whether a given object appears to be in front of or behind a reference distance. This sidesteps the harder problem of outputting a raw number.

The system repeats this classification across a set of candidate distances (think of it as a binary search, the same divide-and-conquer logic a library uses to find a book on a shelf). Each pass narrows the window until the object's true distance is resolved to sufficient precision.

The first independent claim (claims 1-31) has been canceled in this publication, which typically signals that the patent is being revised or narrowed during examination. The core technical idea is intact, but the final enforceable scope is still being worked out with the patent office.

What this means for autonomous vehicles and robotics

Accurate depth sensing is a foundational requirement for self-driving vehicles, robots, and any camera system that needs to understand the physical world rather than just capture a picture of it. Most approaches either rely on dedicated hardware like lidar or radar, or they use stereo camera rigs with precisely calibrated spacing. A software-centric approach that works with ordinary cameras could lower the cost and complexity of building these systems.

For Nvidia, which sells the computing platforms that power many autonomous driving and robotics systems, a better depth-estimation method reinforces the value of its hardware. If your car or delivery robot runs on an Nvidia chip, this kind of patent is part of what makes the whole stack work together.

Editorial take

This is a technically sound but fairly specialized patent in a field (monocular and stereo depth estimation) that already has decades of research behind it. The binary-search framing for depth classification is an interesting twist. The cancellation of all 31 independent claims is a yellow flag worth watching: the eventual granted version may be narrower than what's described here.

Get one Big Tech patent every Sunday

Plain English, intelligent commentary, no hype. Free.

Source. Full patent text and figures from the official USPTO publication PDF.

Editorial commentary on a publicly published patent application. Not legal advice.