New Google Patents · Filed Feb 17, 2025 · Published Jun 4, 2026 · verified — real USPTO data

Google Patents an AI That Reads Your Phone Calls and Suggests What to Say Next

By Patentlyze Team · Updated Jun 5, 2026

Google is patenting a system where an AI chatbot listens to someone speaking on a phone call and starts generating possible replies for you — before they've even finished their sentence.

FIG. 1A — rendered from the official USPTO publication PDF.

Publication number US 2026/0155134 A1

Applicant GOOGLE LLC

Filing date Feb 17, 2025

Publication date Jun 4, 2026

Inventors Yoav Tzur, Ran El Manor, Liran Peretz

CPC classification 704/251

Grant likelihood Medium

Examiner CENTRAL, DOCKET (Art Unit OPAP)

Status Docketed New Case - Ready for Examination (Mar 12, 2025)

Parent application Claims priority from a provisional application 63727511 (filed 2024-12-03)

Document 20 claims

AI/ML

What Google's mid-call suggestion chips actually do

Imagine you're on a phone call but instead of talking yourself, an AI chatbot is handling the conversation on your behalf. As the other person speaks, your screen shows a row of suggested reply buttons — tap one and the AI says it out loud to them, in synthesized speech.

The twist in this patent is the timing. The AI doesn't wait for the other person to finish their sentence before it starts thinking. It begins generating reply suggestions the moment they start talking, so the chips can appear on your screen almost instantly. If the rest of the sentence changes the meaning, the AI will swap out those early suggestions for better ones.

This is effectively an upgraded version of Google's existing Duplex-style AI calling features — the kind that can book a restaurant reservation for you. The new wrinkle is real-time, mid-sentence prediction to make the back-and-forth feel faster and less robotic.

How the system reads partial speech to pre-generate replies

The system works in two parallel tracks running against a live phone call audio stream.

Track 1 — early prediction: The moment the other caller starts speaking, the system feeds that initial audio fragment into the chatbot model. It generates one or more suggestion chips (think of them like quick-reply buttons in a messaging app) — each chip carries a pre-written response that seems plausible given the partial utterance.

Track 2 — confirmation or correction: As the rest of the sentence arrives, the system processes the subsequent audio. It then makes a go/no-go call: do the early chips still make sense for the full utterance? If yes, it renders them on the user's screen immediately. If no, it discards them and generates a fresh set based on the complete sentence.

If the user taps a chip, the chatbot speaks the corresponding suggestion aloud to the other caller via text-to-speech synthesis.
The entire loop — listen, predict, confirm, display, speak — is designed to happen fast enough to feel conversational rather than lagged.
The user's client device (likely a phone or tablet) shows the chips; the remote caller's device plays back the synthesized audio response.

The key technical bet here is that the beginning of a spoken sentence is usually enough to predict the general category of reply needed — even before you know how the sentence ends.

What this means for AI-assisted phone calls

For anyone who uses — or would want to use — an AI proxy to handle routine phone calls (think: scheduling, customer service, quick info requests), this dramatically reduces the awkward pause problem. Current AI calling systems often feel slow because they process the full utterance before responding. Pre-generating suggestions from partial audio is a meaningful latency fix.

It also expands the scope of who this technology helps. People with speech or communication disabilities who rely on AI-assisted calling would benefit enormously from faster, more accurate suggestion chips. And for everyone else, it nudges AI phone assistants closer to feeling like a real, responsive conversation rather than a voice-menu maze.

Editorial take

This is a genuinely clever engineering approach to a real problem — AI phone assistants sound slow because they wait for complete sentences. Google is essentially betting on predictive pre-computation to shave off that lag, which is the right problem to be solving. The fact that it also handles the case where the early prediction is wrong (swapping in new chips) shows this isn't just an optimization hack but a more robust system design.

Get one Big Tech patent every Sunday

Plain English, intelligent commentary, no hype. Free.

Source. Full patent text and figures from the official USPTO publication PDF.

Editorial commentary on a publicly published patent application. Not legal advice.

Google Patents an AI That Reads Your Phone Calls and Suggests What to Say Next

What Google's mid-call suggestion chips actually do

How the system reads partial speech to pre-generate replies

What this means for AI-assisted phone calls

More from New Google Patents

More in AI/ML

Get one Big Tech patent every Sunday