IBM · Filed Dec 20, 2024 · Published Jun 25, 2026 · verified — real USPTO data

IBM Patents Image Compression That Cuts Storage Without Sacrificing Picture Quality

IBM's new patent describes a way to compress images by doing the heavy, approximate math cheaply and the fine-tuning math carefully, mixing two very different levels of numeric precision in a single pipeline to get the best of both worlds.

IBM Patent: Mixed-Precision Matrix Sketching for Image Compression — figure from US 2026/0181136 A1
FIG. 1A — rendered from the official USPTO publication PDF.
Publication number US 2026/0181136 A1
Applicant International Business Machines Corporation
Filing date Dec 20, 2024
Publication date Jun 25, 2026
Inventors Vasileios Kalantzis, Tayfun Gokmen, Shashanka Ubaru, Lior Horesh
CPC classification 382/232
Grant likelihood Medium
Examiner CENTRAL, DOCKET (Art Unit OPAP)
Status Docketed New Case - Ready for Examination (Jan 29, 2025)
Document 20 claims

What IBM's mixed-precision image compression actually does

Imagine you're squeezing a huge photo down to a smaller file. Normally the computer does all the crunching at the same level of precision throughout, which is either slow and accurate or fast and sloppy.

IBM's patent describes a smarter split: the first, roughest pass of the math runs in a low-precision format (8-bit floating point, the kind of number-crunching used in AI chips to go fast), while the step that nails down the final quality runs in high-precision mode (64-bit). The idea is that you only need the expensive precision where it actually changes the result.

The end product is a compressed version of the image built from just the most important mathematical "ingredients," called singular values and vectors. You keep only as many of those ingredients as you need for the quality level you want, and throw the rest away. The patent claims this approach reduces both storage space and the bandwidth needed to send images around.

How the sketching and decomposition pipeline splits precision

The method is a variant of Singular Value Decomposition (SVD), a classic technique for compressing images and data by breaking a matrix (a grid of pixel values) down into a ranked list of components, then keeping only the top-k most important ones.

What makes this filing distinct is the mixed-precision sketching step. Rather than running the full SVD directly on the large input matrix, the algorithm first creates a sketching matrix (a smaller, randomized proxy that captures the rough shape of the data) and multiplies the two together using 8-bit floating-point arithmetic. That low-precision multiplication is fast and hardware-friendly, especially on modern AI accelerators.

From that product, a basis matrix Q is extracted using high-precision 64-bit math. That basis pins down the geometry of the image's important content accurately. A smaller matrix B is then derived from Q, and SVD is run on that smaller matrix instead of the full original, yielding:

  • Left singular vectors U (how image content is distributed across rows)
  • Right singular vectors V (how it's distributed across columns)
  • Singular values Σ (how important each component is)

Only the top-k components are kept, effectively discarding the least important detail and producing a compressed representation.

What this means for storage-heavy AI and cloud image pipelines

For everyday photo storage the gains here may be modest, but for large-scale data pipelines, the argument gets interesting. Data centers processing millions of images, AI training systems ingesting visual datasets, or satellite imaging workflows all pay real costs in storage and network bandwidth. A compression method that offloads the bulk of its arithmetic to cheap 8-bit hardware without sacrificing final quality could meaningfully cut those costs.

There's also a hardware angle. The 8-bit precision used in the sketching step matches exactly what modern AI accelerator chips (like those in IBM's own systems) are optimized to run fast. Designing compression algorithms around that reality, rather than treating it as a compromise, is the more interesting idea here.

Editorial take

This is a competent, incremental improvement to a decades-old compression technique rather than a new approach to image compression. The genuine contribution is the insight that you can exploit cheap AI-chip arithmetic for the noisy sketch step and reserve expensive precision only for the part that matters. That's pragmatic and worth filing. It's unlikely to change how your phone saves photos, but it could improve throughput in IBM's enterprise and cloud offerings.

Get one Big Tech patent every Sunday

Plain English, intelligent commentary, no hype. Free.

Source. Full patent text and figures from the official USPTO publication PDF.

Editorial commentary on a publicly published patent application. Not legal advice.