The ODIN Data
ODIN stands for the Online
Database of Interlinear Text. It is a collection of interlinear glossed text (IGT) instances extracted from linguistic documents on the Web.
If you use ODIN in your study, please cite the following papers:
- William D. Lewis and Fei Xia, 2010.
Developing ODIN: A Multilingual Repository of Annotated Language Data for Hundreds of the World's Languages,
Journal of Literary and Linguistic Computing (LLC), 25(3):303-319.
- Fei Xia, William D. Lewis, Michael W. Goodman, Joshua Crowgey, and
Emily M. Bender, 2014. Enriching ODIN, in Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), Reykjavik, Iceland. [pdf]
- Version 1: The ODIN database was first released at
the linguistlist website.
You can still access this website for a GUI search interface.
We will update that website periodically.
- Version 2: Click here to download the database, which includes the IGT instances in the plain text format and the Xigt format.
modified on 7/25/2014