Linguistics 575: Computational Methods in Language Documentation

Spring Quarter, 2015

Course Info

This course will consider computational approaches to facilitating endangered language documentation, with a particular focus on the methods developed in the AGGREGATION Project. Term projects will center on extending AGGREGATION's inference to be able to extract additional types of information out of interlinear glossed text for more complete answers to the Grammar Matrix customization system questionnaire.

Prereqs: 566/567 are not prereqs for this class, though they may be helpful background. Programming is required. (The AGGREGATION codebase is in Python.)

Schedule of Topics and Assignments (still subject to change)

4/1 Introduction, organization
Bender et al 2013
Bender et al 2014
AGG Phase II grant proposal (on CommonView)
4/8 Grammar Matrix, MOM Bender et al 2002
Bender et al 2010 (on CommonView)
Wax 2014
KWLA: K and W due
4/15 AGG repository (EMB out) Check out AGG repository; run scripts
4/22 IGT, Leipzig Glossing Rules, RiPLes/intent, Xigt Bow et al 2003
Bickel et al 2008
Xia & Lewis 2007
Goodman et al ip (on CommonView)
Initial project proposals due
4/29 Discussion: Term projects [prepare slides]  
5/6 Related Work See: RW reading assignment  
5/13 Discussion: Evaluation  
5/15 Evaluation plans due
5/20 Discussion Hands dirty (i.e. start coding)
5/27 Error Analysis Initial working prototype
6/3 Error Analysis  
6/5     KWLA papers due
6/11     Final projects due 11pm


Workshops and other resources

IGT/Leipzig Glossing Rules


LinGO Grammar Matrix



