Adobe · Filed Nov 22, 2024 · Published May 28, 2026 · verified — real USPTO data

Adobe Patents an AI System That Deconstructs How Light Hits a Scene

Lighting is the hardest part of photo editing to fake — and Adobe's latest patent is aimed squarely at that problem. The filing describes an AI that can look at a photo and pull out a visual map of just one aspect of how light behaves in that scene, like albedo, shading, or specularity, on demand.

Adobe Patent: AI Intrinsic Layer Decomposition Explained — figure from US 2026/0148425 A1
FIG. 1A — rendered from the official USPTO publication PDF.
Publication number US 2026/0148425 A1
Applicant ADOBE INC.
Filing date Nov 22, 2024
Publication date May 28, 2026
Inventors Yangtuanfeng Wang, Jundan Luo, Duygu Ceylan Aksit, Jae Shin Yoon, Nanxuan Zhao, Julien Olivier Victor Philip, Anna Fruehstueck
CPC classification 345/418
Grant likelihood Medium
Examiner WU, YANNA (Art Unit 2615)
Status Response to Non-Final Office Action Entered and Forwarded to Examiner (May 13, 2026)
Document 20 claims

What Adobe's intrinsic-layer AI actually separates

Imagine you take a photo of a coffee mug on a wooden table. The final image is a blend of a lot of different things happening at once: the actual color of the mug's surface, the shadows cast across it, the glossy highlights where light bounces off the glaze, and the texture of the wood underneath. These are all mixed together in a single photo, and separating them out is genuinely hard — even for professional retouchers.

Adobe's patent describes a system where you give an AI model a photo and a text prompt that tells it which of those layers you want to see — something like "show me just the shading" or "give me the albedo map." The AI then produces a clean visual output representing only that layer.

This kind of decomposition is called intrinsic image analysis, and it's been a research topic for decades. Adobe's approach uses a modern latent diffusion model (the same family of AI behind image generators like Stable Diffusion) to do it — potentially making the process faster and more accessible inside tools like Photoshop or Firefly.

How the conditional encoder extracts light modalities

The system takes two inputs: an input image and an input prompt that specifies an "intrinsic modality" — a term for one particular way a scene interacts with light. Common intrinsic modalities include:

  • Albedo — the base color of a surface with all lighting removed
  • Shading — how light and shadow fall across the scene's geometry
  • Specularity — the bright highlights from reflective surfaces
  • Normals — a map of which direction each surface is facing

A conditional image encoder processes the input image and produces what the patent calls a condition embedding — a compressed numerical representation that captures information about the requested modality. Think of it as distilling "what does the shading in this image look like?" into a vector the generative model can work with.

That condition embedding, combined with the text prompt, gets fed into a latent diffusion model (an AI image generator that operates in a compressed "latent" space rather than pixel-by-pixel, which makes it faster). The model then synthesizes a clean output image representing just the requested intrinsic layer.

The key innovation is the joint framing: rather than training separate models for each intrinsic layer, the system handles multiple modalities through a single prompt-conditioned pipeline, with the encoder doing the heavy lifting of distinguishing between them.

What this means for AI-powered photo editing tools

For anyone doing compositing, relighting, or material editing in tools like Photoshop or After Effects, being able to extract clean intrinsic layers automatically is a big deal. Right now, getting a usable albedo or shading map from a real photograph typically requires either specialized capture setups or a lot of manual work. If Adobe can make this one-click, it changes what's practical for everyday creative work.

This also slots directly into Adobe Firefly's generative editing direction — if you can cleanly separate lighting from surface color in any photo, you can relight scenes realistically, swap materials, or composite elements in a way that actually looks physically correct. That's a gap that's currently hard to close with purely generative approaches.

Editorial take

This is solid, focused research from Adobe — not a flashy demo concept, but the kind of foundational capability that makes downstream creative tools dramatically more useful. Intrinsic decomposition has been a white whale in computational photography for a long time, and framing it as a prompt-conditioned diffusion task is a genuinely clever approach. If Adobe can ship this reliably on real-world photos, it's meaningful infrastructure for Firefly.

Get one Big Tech patent every Sunday

Plain English, intelligent commentary, no hype. Free.

Source. Full patent text and figures from the official USPTO publication PDF.

Editorial commentary on a publicly published patent application. Not legal advice.