Johannes von Oswald

My research is focused on AI, neural network architectures, learning algorithms, mechanistic interpretability, mesa-optimization and meta-learning as well as reinforcement learning.

Research Areas

Algorithms and Theory
Machine Intelligence

Authored Publications

MesaNet: Sequence Modelling by Locally Optimal test-Time Training

Nino Scherrer

Songlin Yang

Blaise Aguera-Arcas

Razvan Pascanu

Alexander Meulemans

Seijin Kobayashi

Yanick Schimpf

Kaitlin Maile

João Sacramento

Maximilian Schlegel

Luca Versari

Oliver Sieberling

Johannes von Oswald

Rif A. Saurous

2025

Learning Randomized Algorithms with Transformers

Johannes von Oswald

Seijin Kobayashi

Yassir Akram

Angelika Steger

2024

Weight decay induces low-rank attention layers

Seijin Kobayashi

Yassir Akram

Johannes von Oswald

2024

Transformers learn in-context by gradient descent

Johannes von Oswald

Eyvind Niklasson

Ettore Randazzo

João Sacramento

Alexander Mordvintsev

Andrey Zhmoginov

Max Vladymyrov

International Conference on Machine Learning (2023), pp. 35151-35174

Search on Google Scholar

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Johannes von Oswald

Research Areas

Join us

Google Ai

Google Cloud

Google DeepMind

Google Labs

Johannes von Oswald

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us