A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers
Shen-yun Miao, Chao-Chun Liang, Keh-Yih Su
Resources and Evaluation Short Paper
Session 1B: Jul 6
(06:00-07:00 GMT)
Session 2B: Jul 6
(09:00-10:00 GMT)
Abstract:
We present ASDiv (Academia Sinica Diverse MWP Dataset), a diverse (in terms of both language patterns and problem types) English math word problem (MWP) corpus for evaluating the capability of various MWP solvers. Existing MWP corpora for studying AI progress remain limited either in language usage patterns or in problem types. We thus present a new English MWP corpus with 2,305 MWPs that cover more text patterns and most problem types taught in elementary school. Each MWP is annotated with its problem type and grade level (for indicating the level of difficulty). Furthermore, we propose a metric to measure the lexicon usage diversity of a given MWP corpus, and demonstrate that ASDiv is more diverse than existing corpora. Experiments show that our proposed corpus reflects the true capability of MWP solvers more faithfully.
You can open the
pre-recorded video
in a separate window.
NOTE: The SlidesLive video may display a random order of the authors.
The correct author list is shown at the top of this webpage.
Similar Papers
Graph-to-Tree Learning for Solving Math Word Problems
Jipeng Zhang, Lei Wang, Roy Ka-Wei Lee, Yi Bin, Yan Wang, Jie Shao, Ee-Peng Lim,

A Two-Stage Masked LM Method for Term Set Expansion
Guy Kushilevitz, Shaul Markovitch, Yoav Goldberg,

TAG : Type Auxiliary Guiding for Code Comment Generation
Ruichu Cai, Zhihao Liang, Boyan Xu, zijian li, Yuexing Hao, Yao Chen,

Perturbation Based Learning for Structured NLP tasks with Application to Dependency Parsing
Amichay Doitch, Ram Yazdi, Tamir Hazan, Roi Reichart,
