Craig Boutilier

Craig Boutilier is Principal Scientist at Google. He works on various aspects of decision making under uncertainty, with a current focus on sequential decision models: reinforcement learning, Markov decision processes, temporal models, etc.

Positions and Appointments:
He was a Professor in the Department of Computer Science at the University of Toronto (on leave) and Canada Research Chair in Adaptive Decision Making for Intelligent Systems. He received his Ph.D. in Computer Science from the University of Toronto in 1992, and worked as an Assistant and Associate Professor at the University of British Columbia from 1991 until his return to Toronto in 1999. He served as Chair of the Department of Computer Science at Toronto from 2004-2010. He was co-founder (with Tyler Lu) of Granata Decision Systems from 2012-2015, until his move to Google in 2015.

Boutilier was a consulting professor at Stanford University from 1998-2000, an adjunct professor at the University of British Columbia from 1999-2010, and a visiting professor at Brown University in 1998, at the University of Toronto in 1997-98, at Carnegie Mellon University in 2008-09, and at Université Paris-Dauphine (Paris IX) in the spring of 2011. He served on the Technical Advisory Board of CombineNet, Inc. from 2001 to 2010.

Research:
Boutilier's current research efforts focus on various aspects of decision making under uncertainty, including the use of generative models and LLMs, in areas such as: recommender systems, preference modeling and elicitation, mechanism design, game theory and multiagent decision processes, economic models, social choice, computational advertising, Markov decision processes, reinforcement learning and probabilistic inference. His research interests have spanned a wide range of topics, from knowledge representation, belief revision, default reasoning, and philosophical logic, to probabilistic reasoning, decision making under uncertainty, multiagent systems, and machine learning.

Research & Academic Service:
Boutilier is a past Editor-in-Chief of the Journal of Artificial Intelligence Research (JAIR). He was a past Associate Editor with the ACM Transactions on Economics and Computation (TEAC), the Journal of Artificial Intelligence Research (JAIR), the Journal of Machine Learning Research (JMLR), and Autonomous Agents and Multiagent Systems (AAMAS); and he has sat on the editorial/advisory boards of several other journals. Boutilier has organized several international conferences and workshops, including his work as Program Chair of the Twenty-first International Joint Conference on Artificial Intelligence (IJCAI-09) and Program Chair of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI-2000). He has also served on the conference program committees of roughly 75 leading international conferences.

He will serve as Conference Chair of the Thirty-seventh International Joint Conference on Artificial Intelligence (IJCAI-28).

Awards and Honors:
Boutilier is a Fellow of the Royal Society of Canada (RSC), the Association for Computing Machinery (ACM) and the Association for the Advancement of Artificial Intelligence (AAAI). He was the recipient of the 2018 ACM/SIGAI Autonomous Agents Research Award, He was awarded a Tier I Canada Research Chair, an Isaac Walton Killam Research Fellowship, and an IBM Faculty Award. He received the Killam Teaching Award from the University of British Columbia in 1997. He has also received a number of Best Paper awards including:

the 2009 IJCAI-JAIR Best Paper Prize (with R. Brafman, C. Domshlak, H. Hoos, D. Poole, from the Journal of Artificial Intelligence Research);
the 2014 AIJ Prominent Paper Award (with S. Sanner, from the journal Artificial Intelligence);
the 2018 NeurIPS Best Paper Award (w. T. Lu, D. Schuurmans);
the 2022 AIJ Prominent Paper Award (with I. Caragiannis, S. Haber, T. Lu, A. Procaccia and O. Sheffet, from the journal Artificial Intelligence);
the 2023 IFAAMAS Influential Paper Award (with C. Claus, from the International Foundation for Autonomous Agents and Multiagent Systems)

Research Areas

Authored Publications

ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders

Ofer Meshi

Krisztian Balog

Sally Goldman

Avi Caciularu

Guy Tennenholtz

Jihwan Jeong

Amir Globerson

Craig Boutilier

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (EACL-26), Rabat, Morocco (2026), pp. 5270-5304

Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action-Spaces

Haitong Ma

Ofir Nabati

Aviv Rosenberg

Bo Dai

Oran Lang

Craig Boutilier

Na Li

Shie Mannor

Lior Shani

Guy Tennenholtz

Proceedings of the 43rd International Conference on Machine Learning (ICML-26), Seoul, South Korea (2026)

Efficient, Property-Aligned Fan-Out Retrieval via RL-Compiled Diffusion

Patrick Jiang

Judith Li

Moonkyung Ryu

Lily Hu

Kun Su

Zhong Yi Wan

Liam Hebert

Hao Peng

Jiawei Han

Dima Kuzmin

Craig Boutilier

Proceedings of the 43rd International Conference on Machine Learning (ICML-26), Seoul, South Korea (2026)

Diffusion Controller: Framework, Algorithms and Parameterization

Tong Yang

Moonkyung Ryu

Chih-wei Hsu

Guy Tennenholtz

Yuejie Chi

Craig Boutilier

Bo Dai

Proceedings of the 43rd International Conference on Machine Learning (ICML-26), Seoul, South Korea (2026)

ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders

Ofer Meshi

Krisztian Balog

Sally Goldman

Avi Caciularu

Guy Tennenholtz

Jihwan Jeong

Amir Globerson

Craig Boutilier

The 19th Conference of the European Chapter of the Association for Computational Linguistics (EACL-26), Rabat, Morocco (2026)

Efficient, Property-Aligned Fan-Out Retrieval via RL-Compiled Diffusion

Pengcheng Jiang

Judith Yue Li

Moonkyung Ryu

R. Lily Hu

Kun Su

Zhong Yi Wan

Liam Hebert

Hao Peng

Jiawei Han

Dima Kuzmin

Craig Boutilier

2026

Preference Adaptive and Sequential Text-to-Image Generation

Ofir Nabati

Guy Tennenholtz

Chih-wei Hsu

Moonkyung Ryu

Deepak Ramachandran

Yinlam Chow

Sean Li

Craig Boutilier

42nd International Conference on Machine Learning (ICML-25), Vancouver (2025), pp. 45362-45394

Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)

Moonkyung Ryu

Chih-wei Hsu

Yinlam Chow

Mohammad Ghavamzadeh

Craig Boutilier

GENNEXT@SIGIR’25: The 1st Workshop on Next Generation of IR and Recommender Systems with Language Agents, Generative Models, and Conversational AI (2025)

Asking Clarifying Questions for Preference Elicitation with Large Language Models

Ali Montazer

Guy Tennenholtz

Craig Boutilier

Ofer Meshi

(2025)

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

Yinlam Chow

Guy Tennenholtz

Izzeddin Gur

Vincent Zhuang

Bo Dai

Aviral Kumar

Rishabh Agarwal

Sridhar Thiagarajan

Craig Boutilier

Aleksandra Faust

Proceedings of the 13th International Conference on Learning Representations (ICLR-25), Singapore (2025)

Search on Google Scholar

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Craig Boutilier

Research Areas

Join us

Google Ai

Google Cloud

Google DeepMind

Google Labs

Craig Boutilier

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us