The ODIN Data
Overview
ODIN stands for the Online
Database of Interlinear Text. It is a collection of interlinear glossed text (IGT) instances extracted from linguistic documents on the Web.
Citation
If you use ODIN in your study, please cite the following papers:
- William D. Lewis and Fei Xia, 2010.
Developing ODIN: A Multilingual Repository of Annotated Language Data for Hundreds of the World's Languages,
Journal of Literary and Linguistic Computing (LLC), 25(3):303-319.
[pdf]
- Fei Xia, William D. Lewis, Michael W. Goodman, Joshua Crowgey, and
Emily M. Bender, 2014. Enriching ODIN, in Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), Reykjavik, Iceland. [pdf]
Download ODIN
- Version 1: The ODIN database was first released at
the linguistlist website.
You can still access this website for a GUI search interface.
We will update that website periodically.
- Version 2: Click here to download the database, which includes the IGT instances in the plain text format and the Xigt format.
Last
modified on 7/25/2014