Learner Corpora

This is the first draft of a collection of links to learner corpora. This includes free publicly available corpora, others for pay if they are not already on jones (see JonesServer), and some which are not publicly available but you may contact the author/referent and try to obtain a copy or sample copy for your research.

Corpus

Author/Referent

Links

Notes

Cambridge Learner Corpus part of the Cambridge International Corpus (CIC)

Cambridge University Press and Cambridge ESOL

http://www.cambridge.org/elt/corpus/learner_corpus.htm

Corpus Escrito del Español L2 (CEDEL2)

Cristóbal Lozano

http://www.uam.es/proyectosinv/woslac/cedel2.htm

Corpus parlato di italiano L2

Osservatorio project

http://elearning.unistrapg.it/osservatorio/Corpora.html

English L2 - Hebrew L1 corpus

Tina Waldman

EVA spoken corpus

A. Hasselgren

http://www.hf.ntnu.no/anla/EVAdescription.htm

FRIDA (French Interlanguage Database)

Sylviane Granger

http://www.fltr.ucl.ac.be/fltr/germ/etan/cecl/Cecl-Projects/Frida/fridatext.htm

International Corpus of Learner English (ICLE)

Sylviane Granger

http://cecl.fltr.ucl.ac.be/Cecl-Projects/Icle/icle.htm

ISLE Speech Corpus

ELRA

http://catalog.elra.info/product_info.php?products_id=568

English L2 - Japanese L1 learner corpus

Asao Kojiro

http://www.eng.ritsumei.ac.jp/asao/lcorpus/

JEFLL (Japanese EFL Learner) Corpora

Yukio Tono (Meikai University, JAPAN)

http://leo.meikai.ac.jp/~tono/jefll.html

JPU Corpus

József Horváth

http://joeandco.blogspot.com/

LONGDALE - Longitudinal Database of Learner English

Sylviane Granger

http://cecl.fltr.ucl.ac.be/LONGDALE.html

Longman Learners' Corpus

Longman

http://www.pearsonlongman.com/dictionaries/corpus/learners.html

Louvain International Database of Spoken English Interlanguage (LINDSEI)

Gaëtanelle Gilquin, Claire Hugon, and Sylviane Granger

http://www.fltr.ucl.ac.be/fltr/germ/etan/cecl/Cecl-Projects/Lindsei/lindsei.htm

Multimedia Adult ESOL Learner Corpus

Adult ESOL Lab School

http://www.labschool.pdx.edu/research/methods/maelc/intro.html

PICLE - Polish sub-corpus of ICLE

Przemek Kaszubski

http://www.staff.amu.edu.pl/~przemka/picle.html

Spanish Learner Language Oral Corpus (SPLLOC)

Laura Dominguez

http://www.splloc.soton.ac.uk/

Standard Speaking Test (SST) Corpus

Communication Research Laboratory and ALC Press

http://leo.meikai.ac.jp/~tono/sst/index.html

Thai English Learner Corpus (TELC)

Assumption University, Thailand

http://iele.au.edu/

Tswana Learner English Corpus (TLEC)

Bertus van Rooy

http://ctext.nwu.ac.za/ProductsCorporaTLEC.html

VOICE (Vienna-Oxford International Corpus of English)

Vienna University and supported by Oxford University Press

http://www.univie.ac.at/Anglistik/voice/

other

MICASE Native, nearnative, and non-native

University of Michigan, English Language Institute

http://www.lsa.umich.edu/eli/micase/index.htm

TED Translanguage English Database

European Language Resources Association (ELRA) and the LDC.

http://www.elda.org/catalogue/en/speech/S0031.html http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2002S04 http://www.phonetik.uni-muenchen.de/Forschung/Publications/Lamel_ICSLP94.ps

COMPARA Portuguese-English parallel corpus

http://www.linguateca.pt/COMPARA/Welcome.html

Chinese Learner English Corpus (CLEC)

Shanghai Foreign Language Education Press

http://langbank.engl.polyu.edu.hk/corpus/clec.html

Learner Business Letters Corpus

Someya Yasumasa

http://ysomeya.hp.infoseek.co.jp

The English-Swedish Parallel Corpus

Lund University Department of English

http://www.englund.lu.se/content/view/66/127/

English-Norwegian Parallel Corpus (ENPC)

University of Oslo Department of British and American Studies

http://www.hf.uio.no/ilos/forskning/forskningsprosjekter/enpc/

HKUST(Hong Kong University of Science and Technology) Corpus of Learner English

J. Milton

TELEC Secondary Learner Corpus (TSLC)

TELEC Teachers of English Language Education Center, Department of Curriculum Studies, The University of Hong Kong

no webpages available

Corpus of Young Learner Interlanguage (CYLI)

Vrije Universiteit Brussel

no webpages available

ELFA Corpus

Tampere University

http://www.tay.fi/laitokset/kielet/engf/research/elfa/corpus.htm

Tools

Other sites

Here are some other sites with lists of learner corpora.

Thank you to Elena Cotos at Iowa State whose list of learner corpora led to the creation of this list.

LearnerCorpora (last edited 2008-11-25 18:07:01 by CharlesJochim)