A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers

Shen-yun Miao, Chao-Chun Liang, Keh-Yih Su

Abstract Paper Share

Resources and Evaluation Short Paper

Session 1B: Jul 6 (06:00-07:00 GMT)
Session 2B: Jul 6 (09:00-10:00 GMT)
Abstract: We present ASDiv (Academia Sinica Diverse MWP Dataset), a diverse (in terms of both language patterns and problem types) English math word problem (MWP) corpus for evaluating the capability of various MWP solvers. Existing MWP corpora for studying AI progress remain limited either in language usage patterns or in problem types. We thus present a new English MWP corpus with 2,305 MWPs that cover more text patterns and most problem types taught in elementary school. Each MWP is annotated with its problem type and grade level (for indicating the level of difficulty). Furthermore, we propose a metric to measure the lexicon usage diversity of a given MWP corpus, and demonstrate that ASDiv is more diverse than existing corpora. Experiments show that our proposed corpus reflects the true capability of MWP solvers more faithfully.
You can open the pre-recorded video in a separate window.
NOTE: The SlidesLive video may display a random order of the authors. The correct author list is shown at the top of this webpage.

Similar Papers

Graph-to-Tree Learning for Solving Math Word Problems
Jipeng Zhang, Lei Wang, Roy Ka-Wei Lee, Yi Bin, Yan Wang, Jie Shao, Ee-Peng Lim,
A representative figure from paper main.362
A Two-Stage Masked LM Method for Term Set Expansion
Guy Kushilevitz, Shaul Markovitch, Yoav Goldberg,
A representative figure from paper main.610
TAG : Type Auxiliary Guiding for Code Comment Generation
Ruichu Cai, Zhihao Liang, Boyan Xu, zijian li, Yuexing Hao, Yao Chen,
A representative figure from paper main.27