Focusing on applications for analyzing learner language which evaluate semantic appropriateness and accuracy, we build from previous work which modeled some aspects of interaction, namely a picture description task (PDT), with the goal of integrating a spelling correction component in this context. After parsing a sentence and extracting semantic relations, a surprising number of analysis failures stem from misspellings, deviating from expected input in ways that can be modeled when the content of the interaction is known. We thus explore the use of spelling correction tools and language modeling to correct misspellings that often lead to errors
in obtaining semantic forms, and we show that such tools can significantly reduce the number of unanalyzable cases. The work is useful for any context where image descriptions or some expected content is available, but not necessarily expected linguistic forms.
Picture description task; semantic analysis; spelling correction; language modeling
Atkinson, K. (1998). GNU Aspell. http://aspell.net.
Clarkson, P. and Rosenfeld, R. (1997). Statistical language modeling using the CMU-Cambridge Toolkit. In Eurospeech, volume 97, pages 2707–2710.
Dale, R., Anisimoff, I., and Narroway, G. (2012). HOO 2012: A report on the preposition and
determiner error correction shared task. In Proceedings of the Seventh Workshop on Building
Educational Applications Using NLP, pages 54–62, Montréal.
de Marneffe, M.-C., MacCartney, B., and Manning, C. D. (2006). Generating typed dependency
parses from phrase structure parses. In Proceedings of LREC 2006, Genoa, Italy.
DeSmedt, W. (1995). Herr Kommissar: An ICALL conversation simulator for intermediate
German. In Holland, V. M., Kaplan, J., and Sams, M., editors, Intelligent Language Tutors: Theory Shaping Technology, pages 153–174. Lawrence Erlbaum, Mahwah, NJ.
Ellis, R. (2000). Task-based research and language pedagogy. Language Teaching Research,
4(3):193–220.
Fellbaum, C., editor (1998). WordNet: An Electronic Lexical Database. The MIT Press, Cambridge,
MA.
Flor, M. (2012). Four types of context for automatic spelling correction. TAL, 53(2):61–99.
Flor, M. and Futagi, Y. (2012). On using context for automatic correction of non-word
misspellings in student essays. In Proceedings of the Seventh Workshop on Building Educational
Applications Using NLP, pages 105–115. Association for Computational Linguistics.
Flor, M., Futagi, Y., Lopez, M., and Mulholland, M. (2013). Patterns of misspellings in L2 and
L1 English: A view from the ETS Spelling Corpus. In Proceedings of the Second Learner Corpus
Research Conference (LCR 2013).
Forbes-McKay, K. and Venneri, A. (2005). Detecting subtle spontaneous language decline in
early Alzheimer’s disease with a picture description task. Neurological Sciences, 26(4):243–254.
Graff, D., Kong, J., Chen, K., and Maeda, K. (2007). English Gigaword, Third Edition.
Hahn, M. and Meurers, D. (2012). Evaluating the meaning of answers to reading comprehension
questions: A semantics-based approach. In Proceedings of the 7th Workshop on Innovative
Use of NLP for Building Educational Applications (BEA7), pages 326–336, Montreal, Canada. Association for Computational Linguistics.
Heift, T. and Schulze, M. (2007). Errors and Intelligence in Computer-Assisted Language Learning:
Parsers and Pedagogues. Routledge.
Hovermale, D. (2008). SCALE: Spelling Correction Adapted for Learners of English. Pre-CALICO Workshop on “Automatic Analysis of Learner Language: Bridging Foreign Language
Teaching Needs and NLP Possibilities”. March 18-19, 2008. San Francisco, CA.
Hovermale, D. (2010). An analysis of the spelling errors of L2 English learners. In CALICO
2010 Conference, Amherst, MA, USA.
King, L. and Dickinson, M. (2013). Shallow semantic analysis of interactive learner sentences. In Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, pages 11–21, Atlanta, Georgia.
Klein, D. and Manning, C. D. (2003). Accurate unlexicalized parsing. In Proceedings of ACL-03, Sapporo, Japan.
Lachowicz, D. (2003). Enchant. http://abisource.com/projects/enchant.
Leacock, C., Chodorow, M., Gamon, M., and Tetreault, J. (2010). Automated Grammatical Error Detection for Language Learners. Synthesis Lectures on Human Language Technologies. Morgan Claypool.
Manning, C. D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S. J., and McClosky, D. (2014). The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 55–60,
Baltimore, Maryland. Association for Computational Linguistics.
Meurers, D. (2012). Natural language processing and language learning. In Chapelle, C. A., editor, Encyclopedia of Applied Linguistics. Blackwell.
Meurers, D., Ziai, R., Ott, N., and Bailey, S. (2011). Integrating parallel analysis modules to evaluate the meaning of answers to reading comprehension questions. Special Issue on Free-text Automatic Evaluation. International Journal of Continuing Engineering Education and Life-Long Learning (IJCEELL), 21(4):355–369.
Petersen, K. A. (2010). Implicit Corrective Feedback in Computer-Guided Interaction: Does Mode Matter? PhD thesis, Georgetown University, Washington, DC.
Somasundaran, S. and Chodorow, M. (2014). Automated measures of specific vocabulary knowledge from constructed responses (‘Use these words to write a sentence based on this picture’). In Proceedings of the Ninth Workshop on Innovative Use of NLP for Building Educational Applications, pages 1–11, Baltimore, Maryland.
Sproat, R., Black, A. W., Chen, S., Kumar, S., Ostendorf, M., and Richards, C. (2001). Normalization of non-standard words. Computer Speech & Language, 15(3):287–333.