Interactive Construction of User-Centric Dictionary for Text Analytics

Ryosuke Kohita, Issei Yoshida, Hiroshi Kanayama, Tetsuya Nasukawa

Abstract Paper Share

Information Retrieval and Text Mining Long Paper

Session 1B: Jul 6 (06:00-07:00 GMT)
Session 2A: Jul 6 (08:00-09:00 GMT)
Abstract: We propose a methodology to construct a term dictionary for text analytics through an interactive process between a human and a machine, which helps the creation of flexible dictionaries with precise granularity required in typical text analysis. This paper introduces the first formulation of interactive dictionary construction to address this issue. To optimize the interaction, we propose a new algorithm that effectively captures an analyst's intention starting from only a small number of sample terms. Along with the algorithm, we also design an automatic evaluation framework that provides a systematic assessment of any interactive method for the dictionary creation task. Experiments using real scenario based corpora and dictionaries show that our algorithm outperforms baseline methods, and works even with a small number of interactions.
You can open the pre-recorded video in a separate window.
NOTE: The SlidesLive video may display a random order of the authors. The correct author list is shown at the top of this webpage.

Similar Papers

Bilingual Dictionary Based Neural Machine Translation without Using Parallel Sentences
Xiangyu Duan, Baijun Ji, Hao Jia, Min Tan, Min Zhang, Boxing Chen, Weihua Luo, Yue Zhang,
A representative figure from paper main.143
Topological Sort for Sentence Ordering
Shrimai Prabhumoye, Ruslan Salakhutdinov, Alan W Black,
A representative figure from paper main.248
Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries
Mozhi Zhang, Yoshinari Fujinuma, Michael J. Paul, Jordan Boyd-Graber,
A representative figure from paper main.201