Disney · Filed Dec 10, 2024 · Published Jun 11, 2026 · verified — real USPTO data

Disney Patents an AI That Maps Key Points on Faces in 3D

Disney has patented a facial tracking technique that quietly sneaks a 3D model of a face into what's otherwise a flat-image recognition task — and that geometry is what makes the system train itself better.

Disney Patent: AI Face Landmark Detection Explained — figure from US 2026/0162369 A1
FIG. 1A — rendered from the official USPTO publication PDF.
Publication number US 2026/0162369 A1
Applicant DISNEY ENTERPRISES, INC.
Filing date Dec 10, 2024
Publication date Jun 11, 2026
Inventors Prashanth CHANDRAN, Gaspard ZOSS, Derek Edward BRADLEY
CPC classification 345/419
Grant likelihood Medium
Examiner GOCO, JOHN PATRICK (Art Unit 2611)
Status Docketed New Case - Ready for Examination (Jan 22, 2025)
Document 20 claims

What Disney's 3D face-mapping system actually does

Imagine you're trying to mark exactly where someone's eyes, nose, and mouth are in a photo. A normal AI would scan the flat picture and guess. Disney's approach does something different: it first builds a rough 3D model of the face — like a digital sculpture — and then uses that shape to figure out where key points should land.

Those 3D markers get projected back down onto the original flat photo, like shining a light through a sculpture onto a wall. The AI then checks how well those projected dots match the real face in the image, and uses any mistakes to correct itself during training.

The result is a facial landmark detector that's taught itself using geometry, not just pixels. For Disney, which constantly needs to track and recreate actor faces for visual effects and animated characters, a more geometrically aware AI could mean cleaner, more consistent results across wildly different lighting and angles.

How the model folds 3D geometry into 2D landmark training

The patent describes a training pipeline for facial landmark detection — the task of precisely identifying key points on a face (eye corners, lip edges, nose tip, etc.) that downstream systems use for things like face replacement, animation rigging, or expression capture.

At its core, the system uses a morphable face model (a mathematical template that can be stretched and shaped to match different faces, similar to a digital clay mold). A machine learning model takes an input image and outputs a set of coefficients — numbers that control how that template is deformed to fit the face in the image.

Once the 3D shape is estimated, specific 3D landmark positions are computed from the model's geometry — points that are defined on the surface of the 3D face. Those points are then projected (mathematically flattened) back into the 2D image plane, producing estimated 2D landmark locations.

The clever part is the training loop: the system computes a loss (an error score measuring how far off the projected 2D landmarks are from ground-truth labels), and uses that error to update the model's weights. This means the model is implicitly learning 3D face geometry as a byproduct of getting 2D landmark placement right — without needing a separate 3D supervision signal.

What this means for digital face cloning and VFX

For a studio like Disney, facial landmark accuracy is load-bearing infrastructure. Whether you're digitally de-aging an actor, animating a photorealistic character, or doing performance capture on set, the precision of face tracking determines how believable the final result looks. A system that anchors its predictions in 3D geometry is inherently more robust to tricky angles or unusual lighting — the kinds of conditions that defeat purely pixel-based detectors.

Beyond VFX, this kind of parametric landmark detection feeds into broader AI pipelines for avatar creation, real-time face filters, and synthetic media. Disney filing in this space signals the company is investing in the foundational tooling that makes those downstream applications more reliable — not just for film, but potentially for theme park experiences, streaming interactivity, and character licensing at scale.

Editorial take

This is genuinely solid foundational work, not a flashy consumer feature. Using 3D morphable model geometry to supervise 2D landmark training is a well-motivated approach — it's the kind of thing that quietly makes everything downstream work better. Disney's VFX pipeline ambitions make this a natural fit, and it's worth paying attention to as synthetic actor technology becomes more central to how blockbusters get made.

Get one Big Tech patent every Sunday

Plain English, intelligent commentary, no hype. Free.

Source. Full patent text and figures from the official USPTO publication PDF.

Editorial commentary on a publicly published patent application. Not legal advice.