ODIN

The ODIN Data

Overview

ODIN stands for the Online Database of Interlinear Text. It is a collection of interlinear glossed text (IGT) instances extracted from linguistic documents on the Web.

Citation

If you use ODIN in your study, please cite the following papers:

William D. Lewis and Fei Xia, 2010. Developing ODIN: A Multilingual Repository of Annotated Language Data for Hundreds of the World's Languages, Journal of Literary and Linguistic Computing (LLC), 25(3):303-319. [pdf]
Fei Xia, William D. Lewis, Michael W. Goodman, Joshua Crowgey, and Emily M. Bender, 2014. Enriching ODIN, in Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), Reykjavik, Iceland. [pdf]

Download ODIN

Version 1: The ODIN database was first released at the linguistlist website. You can still access this website for a GUI search interface. We will update that website periodically.
Version 2: Click here to download the database, which includes the IGT instances in the plain text format and the Xigt format.

Last modified on 7/25/2014