Generating Wikipedia by Summarizing Long Sequences

Peter J. Liu; Mohammad Ahmad Saleh; Etienne Pot; Ben Goodrich; Ryan Sepassi; Lukasz Kaiser; Noam Shazeer

Generating Wikipedia by Summarizing Long Sequences

Peter J. Liu

Mohammad Ahmad Saleh

Etienne Pot

Ben Goodrich

Ryan Sepassi

Lukasz Kaiser

Noam Shazeer

ICLR (2018)

Download Google Scholar

Abstract

We show that generating English Wikipedia articles can be approached as a multi-
document summarization of source documents. We use extractive summarization
to coarsely identify salient information and a neural abstractive model to generate
the article. For the abstractive model, we introduce a decoder-only architecture
that can scalably attend to very long sequences, much longer than typical encoder-
decoder architectures used in sequence transduction. We show that this model can
generate fluent, coherent multi-sentence paragraphs and even whole Wikipedia
articles. When given reference documents, we show it can extract relevant factual
information as reflected in perplexity, ROUGE scores and human evaluations.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Generating Wikipedia by Summarizing Long Sequences

Abstract

Research Areas

Meet the teams driving innovation

Google Ai

Google Cloud

Google DeepMind

Google Labs