Data Management

Google is deeply engaged in Data Management research across a variety of topics with deep connections to Google products. We are building intelligent systems to discover, annotate, and explore structured data from the Web, and to surface them creatively through Google products, such as Search (e.g., structured snippets, Docs, and many others). The overarching goal is to create a plethora of structured data on the Web that maximally help Google users consume, interact and explore information. Through those projects, we study various cutting-edge data management research issues including information extraction and integration, large scale data analysis, effective data exploration, etc., using a variety of techniques, such as information retrieval, data mining and machine learning.

A major research effort involves the management of structured data within the enterprise. The goal is to discover, index, monitor, and organize this type of data in order to make it easier to access high-quality datasets. This type of data carries different, and often richer, semantics than structured data on the Web, which in turn raises new opportunities and technical challenges in their management.

Furthermore, Data Management research across Google allows us to build technologies that power Google's largest businesses through scalable, reliable, fast, and general-purpose infrastructure for large-scale data processing as a service. Some examples of such technologies include F1, the database serving our ads infrastructure; Mesa, a petabyte-scale analytic data warehousing system; and Dremel, for petabyte-scale data processing with interactive response times. Dremel is available for external customers to use as part of Google Cloud’s BigQuery.

Recent Publications

Semantic Data Modeling, Graph Query, and SQL, Together at Last?

Jeff Shute

Colin Zheng

Romit Kudtarkar

CIDR (2026) (to appear)

Proactive and Semi-autonomous Agentic Quality Management Framework

RK Neelakandan

Teginder Singh

Bakul Patel

Technical Disclosure Commons (2025)

Filtered Vector Search: State-of-the-art and Research Opportunities

Yannis Papakonstantinou

Anastasia Ailamaki

Yannis Chronis

Helena Caminal

Fatma Ozcan

2025

From the Observability TAG: Designing a Common Query Language for Observability Data

Alolita Sharma

Chris Larsen

Pereira Braga

(2025)

Observability Query Language at Google

Pereira Braga

(2024)

SQL Has Problems. We Can Fix Them: Pipe Syntax In SQL

Jeff Shute

Shannon Bales

Matthew Brown

Jean-Daniel Browne

Brandon Dolphin

Romit Kudtarkar

Andrey Litvinov

Jingchi Ma

John Morcos

Michael Shen

David Wilhite

Xi Wu

Lulan Yu

Proc. VLDB Endow. (2024), pp. 4051-4063 (to appear)

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Data Management

Recent Publications

Some of our teams

Join us

Google Ai

Google Cloud

Google DeepMind

Google Labs