Natural Language Processing

Natural Language Processing (NLP) research at Google focuses on algorithms that apply at scale, across languages, and across domains. Our systems are used in numerous ways across Google, impacting user experience in search, mobile, apps, ads, translate and more.

Our work spans the range of traditional NLP tasks, with general-purpose syntax and semantic algorithms underpinning more specialized systems. We are particularly interested in algorithms that scale well and can be run efficiently in a highly distributed environment.

Our syntactic systems predict part-of-speech tags for each word in a given sentence, as well as morphological features such as gender and number. They also label relationships between words, such as subject, object, modification, and others. We focus on efficient algorithms that leverage large amounts of unlabeled data, and recently have incorporated neural net technology.

On the semantic side, we identify entities in free text, label them with types (such as person, location, or organization), cluster mentions of those entities within and across documents (coreference resolution), and resolve the entities to the Knowledge Graph.

Recent work has focused on incorporating multiple sources of knowledge and information to aid with analysis of text, as well as applying frame semantics at the noun phrase, sentence, and document level.

Recent Publications

See2Refine: Vision-Language Feedback Improves LLM-Based eHMI Action Designers

Ding Xia

Xinyue Gui

Mark Colley

Fan Gao

Zhongyi Zhou

Dongyuan Li

Renhe Jiang

Takeo Igarashi

ACL 26 (2026)

ToolGrad: Efficient Tool-use Dataset Generation with Textual "Gradients"

Zhongyi Zhou

Kohei Uehara

Haoyu Zhang

Jingtao Zhou

Lin Gu

Ruofei Du

Zheng Xu

Tatsuya Harada

ACL 2026 (2026)

VaultGemma

Lynn Chua

Pasin Manurangsi

Prem Eruvbetine

Chiyuan Zhang

Thomas Mesnard

Badih Ghazi

Borja De Balle Pigem

Daogao Liu

Amer Sinha

Pritish Kamath

Yangsibo Huang

Christopher A. Choquette-Choo

George Kaissis

Tris Warkentin

Armand Joulin

Ravi Kumar

Andreas Terzis

Da Yu

Zachary Charles

Ryan McKenna

Ruibo Liu

arxiv (2025)

Towards Conversational Diagnostic AI

Tao Tu

Anil Palepu

Mike Schaekermann

Khaled Saab

Jan Freyberg

Ryutaro Tanno

Amy Wang

Brenna Li

Mohamed Amin

Nenad Tomašev

Shekoofeh Azizi

Karan Singhal

Yong Cheng

Le Hou

Albert Webson

Kavita Kulkarni

Sara Mahdavi

Christopher Semturs

Juro Gottweis

Joelle Barral

Kat Chou

Greg Corrado

Yossi Matias

Alan Karthikesalingam

Vivek Natarajan

Nature (2025) (to appear)

RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation

Aviv Slobodkin

Hagai Taitelbaum

Yonatan Bitton

Brian Gordon

Michal Sokolik

Almog Gueta

Royi Rassin

Dani Lischinski

Idan Szpektor

2025

Sufficient Context: A New Lens on Retrieval Augmented Generation Systems

Hailey Joren

Jianyi Zhang

Chun-Sung Ferng

Da-Cheng Juan

Ankur Taly

Cyrus Rashtchian

International Conference on Learning Representations (ICLR) (2025)

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Natural Language Processing

Recent Publications

Some of our teams

Join us

Google Ai

Google Cloud

Google DeepMind

Google Labs