Break It Down: A Question Understanding Benchmark

Tomer Wolfson; Mor Geva; Ankit Gupta; Yoav Goldberg; Matt Gardner; Daniel Deutch; Jonathan Berant

Break It Down: A Question Understanding Benchmark

Tomer Wolfson, Mor Geva, Ankit Gupta, Yoav Goldberg, Matt Gardner, Daniel Deutch, Jonathan Berant

Abstract Paper Share

Question Answering TACL Paper

Session 8A: Jul 7 (12:00-13:00 GMT)

Session 9B: Jul 7 (18:00-19:00 GMT)

Abstract: Understanding natural language questions entails the ability to break down a question into the requisite steps for computing its answer. In this work, we introduce a Question Decomposition Meaning Representation (QDMR) for questions. QDMR constitutes the ordered list of steps, expressed through natural language, that are necessary for answering a question. We develop a crowdsourcing pipeline, showing that quality QDMRs can be annotated at scale, and release the Break dataset, containing over 83K pairs of questions and their QDMRs. We demonstrate the utility of QDMR by showing that (a) it can be used to improve open-domain question answering on the HotpotQA dataset, (b) it can be deterministically converted to a pseudo-SQL formal language, which can alleviate annotation in semantic parsing applications. Last, we use Break to train a sequence-to-sequence model with copying that parses questions into QDMR structures, and show that it substantially outperforms several natural baselines.

You can open the pre-recorded video in a separate window.

NOTE: The SlidesLive video may display a random order of the authors. The correct author list is shown at the top of this webpage.

Break It Down: A Question Understanding Benchmark

Tomer Wolfson, Mor Geva, Ankit Gupta, Yoav Goldberg, Matt Gardner, Daniel Deutch, Jonathan Berant

Similar Papers

Syn-QG: Syntactic and Shallow Semantic Rules for Question Generation

Kaustubh Dhole, Christopher D. Manning,

Crossing Variational Autoencoders for Answer Retrieval

Wenhao Yu, Lingfei Wu, Qingkai Zeng, Shu Tao, Yu Deng, Meng Jiang,

What Question Answering can Learn from Trivia Nerds

Jordan Boyd-Graber, Benjamin Börschinger,

Learning to Ask More: Semi-Autoregressive Sequential Question Generation under Dual-Graph Interaction

Zi Chai, Xiaojun Wan,