This paper introduces an Intelligent Computer Assisted Language Learning system designed to provide reading input for language learners based on the syntactic complexity of their language production. The system analyzes the linguistic complexity of texts produced by the user and of texts in a pedagogic target language corpus to identify texts that are well-suited to foster acquisition. These texts provide developmental benchmarks offering an individually tailored language challenge, making ideas such as Krashen’s i+1 or Vygotsky’s Zone of Proximal Development concrete and empirically explorable in terms of a broad range of complexity measures in all dimensions of linguistic modeling.
Jan Aarts and Sylviane Granger. 1998. Tag sequences in learner corpora: a key to interlanguage grammar and discourse. In Sylviane Granger, editor, Learner English on Computer, pages 132–141. Longman, London; New York.
Haiyang Ai and Xiaofei Lu. 2013. A corpus-based comparison of syntactic complexity in NNS and NS university students’ writing. In Ana Daz-Negrillo, Nicolas Ballier, and Paul Thompson, editors, Automatic Treatment and Analysis of Learner Corpus Data, pages 249–264. John Benjamins.
Theodora Alexopoulou, Marije Michel, Akira Murakami, and Detmar Meurers. 2017. Analyzing learner language in task contexts: A study case of task-based performance in EFCAMDAT. Language Learning. Special Issue on “Language learning research at the intersection of experimental, corpusbased and computational methods: Evidence and interpretation”.
Kathleen Bardovi-Harlig and Theodora Bofman. 1989. Attainment of syntactic and morphological accuracy by advanced language learners. Studies in Second Language Acquisition, 11(1):17–34.
Bram Bult´e and Alex Housen. 2012. Defining and operationalizing l2 complexity. In Alex Housen, Folkert Kuiken, and Ineke Vedder, editors, Dimensions of L2 Performance and Proficiency, pages 21–46. John Benjamins.
Xiaobin Chen and Detmar Meurers. 2016. CTAP: A web-based tool supporting automatic complexity analysis. In Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity, Osaka, Japan, December. The International Committee on Computational Linguistics.
Maria Chinkina and Detmar Meurers. 2016. Linguistically-aware information retrieval: Providing input enrichment for second language learners. In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, pages 188–198, San Diego, CA.
Rod Ellis and Fangyuan Yuan. 2004. The effects of planning on fluency, complexity, and accuracy in second language narrative writing. Studies in Second Language Acquisition, 26(1):59–84.
Rod Ellis. 2003. Task-based Language Learning and Teaching. Oxford University Press, Oxford, UK.
Dana R. Ferris. 1994. Lexical and syntactic features of esl writing by students at different levels of l2 proficiency. TESOL Quarterly, 28(2):414–420.
Pauline Foster and Peter Skehan. 1996. The influence of planning and task type on second language performance. Studies in Second Language Acquisition, 18(3):299–323.
Sylviane Granger and Paul Rayson. 1998. Automatic profiling of learner texts. In Sylviane Granger, editor, Learner English on Computer, pages 119–131. Longman, New York.
Leslie Grant and April Ginther. 2000. Using computer-tagged linguistic features to describe L2 writing differences. Journal of Second Language Writing, 9(2):123–145.
Kathryn Henry. 1996. Early l2 writing development: A study of autobiographical essays by universitylevel students of russian. The Modern Language Journal, 80(3):309–326.
Alex Housen, Folkert Kuiken, Jane Zuengler, and Ken Hyland. 2009. Complexity, accuracy and fluency in second language acquisition. Applied Linguistics, 30(4):461–473.
Susan Hunston. 2002. Corpora in Applied Linguistics. Cambridge University Press.
Sandra Ishikawa. 1995. Objective measurement of low-proficiency efl narrative writing. Journal of Second Language Writing, 4(1):51 – 69.
Scott Jarvis. 2013. Capturing the diversity in lexical diversity. Language Learning, 63:87–106.
Judit Kormos. 2011. Task complexity and linguistic and discourse features of narrative writing performance. Journal of Second Language Writing, 20(2):148 – 161.
Stephen D. Krashen. 1981. The fundamental pedagogical principle in second language teaching. Studia Linguistica, 35(1–2):50–70, December.
Kristopher Kyle and Scott A Crossley. 2015. Automatically assessing lexical sophistication: Indices, tools, findings, and application. TESOL Quarterly, 49(4):757–786.
Diane Larsen-Freeman. 1978. An ESL index of development. TESOL Quarterly, 12(4):439–448.
Diane Larsen-Freeman. 2006. The emergence of complexity, fluency, and accuracy in the oral and written production of five chinese learners of english. Applied Linguistics, 27(4):590–619.
Roger Levy and Galen Andrew. 2006. Tregex and tsurgeon: tools for querying and manipulating tree data structures. In 5th International Conference on Language Resources and Evaluation, Genoa, Italy.
Lexile. 2007. The Lexile FrameworkR for reading: Theoretical framework and development. Technical report, MetaMetrics, Inc., Durham, NC.
Xiaofei Lu and Haiyang Ai. 2015. Syntactic complexity in college-level English writing: Differences among writers with diverse l1 backgrounds. Journal of Second Language Writing, -:in press.
Xiaofei Lu. 2010. Automatic analysis of syntactic complexity in second language writing. International Journal of Corpus Linguistics, 15(4):474–496.
Xiaofei Lu. 2011. A corpus-based evaluation of syntactic complexity measures as indices of collegelevel esl writers’ language development. TESOL Quarterly, 45(1):36–62, March.
Christopher D. Manning, Mihai Surdeanu, John Bauer, Jenny Finkel, Steven J. Bethard, and David Mc-Closky. 2014. The Stanford CoreNLP natural language processing toolkit. In Association for Computational Linguistics (ACL) System Demonstrations, pages 55–60.
Diana Mazgutova and Judit Kormos. 2015. Syntactic and lexical development in an intensive english for academic purposes programme. Journal of Second Language Writing, 29:3–15.
Danielle A. McNamara, Arthur C. Graesser, Philip M. McCarthy, and Zhiqiang Cai. 2014. Automated evaluation of text and discourse with Coh-Metrix. Cambridge University Press, Cambridge, M.A.
Fanny Meunier and C´eline Gouverneur. 2009. New types of corpora for new educational challenges: Collecting, annotating and exploiting a corpus of textbook material. In Karin Aijmer, editor, Corpora and Language Teaching, pages 179–201. John Benjamins, Amsterdam.
Detmar Meurers and Markus Dickinson. 2017. Evidence and interpretation in language learning research: Opportunities for collaboration with computational linguistics. Language Learning, 67(2). http://dx.doi.org/10.1111/lang.12233.
Akira Murakami. 2013. Individual Variation and the Role of L1 in the L2 Development of English Grammatical Morphemes: Insights From Learner Corpora. Ph.D. thesis, University of Cambridge.
Akira Murakami. 2016. Modeling systematicity and individuality in nonlinear second language development: The case of English grammatical morphemes. Language Learning, 6(4):834–871.
Jessica Nelson, Charles Perfetti, David Liben, and Meredith Liben. 2012. Measures of text difficulty: Testing their predictive value for grade levels and student performance. Technical report, The Council of Chief State School Officers.
Catrin Norrby and Gisela H°akansson. 2007. The interaction of complexity and grammatical processability: The case of Swedish as a foreign language. International Review of Applied Linguistics in Language Teaching, 45:45–68.
Lourdes Ortega and C. Sinicrope. 2008. Novice proficiency in a foreign language: A study of task-based performance profiling on the STAMP test. Technical report, Center for Applied Second Language Studies, University of Oregon.
Lourdes Ortega. 2000. Understanding syntactic complexity: The measurement of change in the syntax of instructed L2 Spanish learners. Unpublished doctoral dissertation, University of Hawaii, Manoa, HI.
Lourdes Ortega. 2003. Syntactic complexity measures and their relationship to L2 proficiency: A research synthesis of college-level L2 writing. Applied Linguistics, 24(4):492–518.
Lourdes Ortega. 2012. A construct in search of theoretical renewal. In B. Szmrecsanyi and B. Kortmann, editors, Linguistic complexity: Second language acquisition, indigenization, contact, pages 127–155. de Gruyter, Berlin.
Lourdes Ortega. 2015. Syntactic complexity in l2 writing: Progress and expansion. Journal of Second Language Writing, 29:82–94.
Joy Reid. 1992. A computer text analysis of four cohesion devices in english discourse by native and nonnative writers. Journal of Second Language Writing, 1(2):79–107.
Norbert Schmitt and Stephen Redwood. 2011. Learner knowledge of phrasal verbs: A corpus-informed study. In Fanny Meunier, Sylvie De Cock, Gaëetanelle Gilquin, and Magali Paquot, editors, A Taste for Corpora. In Honour of Sylviane Granger, pages 173–207. John Benjamins Publishing Company, Amsterdam.
Peter Skehan. 1989. Individual Differences in Second Language Learning. Edward Arnold.
Peter Skehan. 2009. Modelling second language performance: Integrating complexity, accuracy, fluency, and lexis. Applied Linguistics, 30(4):510–532.
Susana M. Sotillo. 2000. Discourse functions and syntactic complexity in synchronous and asynchronous communication. Language Learning & Technology, 4(1):82–119.
Naoko Taguchi, William Crawford, and Danielle Zawodny Wetzel. 2013. What linguistic features are indicative of writing quality? a case of argumentative essays in a college composition program. TESOL Quarterly, 47(2):420–430.
Yukio Tono. 2004. Multiple comparisons of IL, L1 and TL corpora: the case of L2 acquisition of verb subcategorization patterns by Japanese learners of English. In Guy Aston, Silvia Bernardini, and Dominic Stewart, editors, Corpora and Language Learners, pages 45–66. John Benjamins.
Sowmya Vajjala and Detmar Meurers. 2012. On improving the accuracy of readability classification using insights from second language acquisition. In Joel Tetreault, Jill Burstein, and Claudial Leacock, editors, In Proceedings of the 7th Workshop on Innovative Use of NLP for Building Educational Applications, pages 163–173, Montral, Canada, June. Association for Computational Linguistics.
Marjolijn Verspoor, Wander Lowie, and Marijn Van Dijk. 2008. Variability in second language development from a dynamic systems perspective. The Modern Language Journal, 92(2):214–231.
Marjolijn Verspoor, Monika S. Schmid, and Xiaoyan Xu. 2012. A dynamic usage based perspective on L2 writing. Journal of Second Language Writing, 21(3):239–263.
Nina Vyatkina, Hagen Hirschmann, and Felix Golcher. 2015. Syntactic modification at early stages of L2 german writing development: A longitudinal learner corpus study. Journal of Second Language Writing, 29:28–50.
Nina Vyatkina. 2013. Specific syntactic complexity: Developmental profiling of individuals based on an annotated learner corpus. The Modern Language Journal, 97(S1):11–30.
Lev Semenovich Vygotsky. 1986. Thought and Language. MIT Press, Cambridge, MA.
Chuming Wang and Min Wang. 2015. Effect of alignment on l2 written production. Applied Linguistics, 36(5).
Denise Paige Way, Elizabeth G. Joiner, and Michael A. Seaman. 2000. Writing in the secondary foreign language classroom: The effects of prompts and tasks on novice learners of french. The Modern Language Journal, 84(2):171–184.
Kate Wolfe-Quintero, Shunji Inagaki, and Hae-Young Kim. 1998. Second Language Development in Writing: Measures of Fluency, Accuracy & Complexity. Second Language Teaching & Curriculum Center, University of Hawaii at Manoa, Honolulu.
Weiwei Yang, Xiaofei Lu, and Sara Cushing Weigle. 2015. Different topics, different discourse: Relationships among writing topic, measures of syntactic complexity, and judgments of writing quality. Journal of Second Language Writing, 28:53–67.
Hua Flora Zhong. 2016. The relationship between receptive and productive vocabulary knowledge: a perspective from vocabulary use in sentence writing. The Language Learning Journal, Advanced Access.