The International Corpus of Crosslinguistic Interlanguage (ICCI)

 

About The Project

The project of International Corpus of Crosslinguistic Interlanguage (ICCI) is an international joint project of learner corpus initiated by Dr. Yukio Tono from Tokyo University of Foreign Studies(TUFS), Japan, in 2007 and started in 2008. Its aim is to compile corpora of young learners of English across different proficiency levels and L1 backgrounds in the world. There are currently 10 scholars from 8 countries/regions (Hong Kong, Germany, Israel, Japan, Poland, Singapore, Spain, and Taiwan) actively contributed to this project.

ICCI is one of the research projects within the framework of the G-COE program. The Global COE Program is a 5-year government-funded project for promoting research as the centre of excellence in specialized fields. Tokyo University of Foreign Studies (TUFS) has been granted as one of such leading research institutes in the field of linguistics and language education. The special theme for this COE program for TUFS is called "corpus-based linguistics and language education", in which three major disciplines, namely, field linguistics, corpus linguistics, and language education are to be closely linked to each other to train researchers who are competent in doing research on various aspects of language with the integrated perspectives of the above three fields.

Project Director:

Yukio Tono, Tokyo University of Foreign Studies(TUFS), Japan.

Members:

 

About The Data

Here is a list of transcripts to share among the the team members:

  1. Transcripts: 561 raw transcripts (updated on 19/01/2009)
  2. Plain Texts: 524 valid texts (unicode and ready for WordSmith, updated on 19/01/2009)
  3. XML (with POS): 524 XML files (updated on 19/01/2009)

Online Query

Click this link to get access to the the web-based query and Xaira-based query.