William D. Lewis

Publications and other papers of interest:

Please note: I am no longer actively updating this page. For my most recent publications, please see my site at http://research.microsoft.com/en-us/people/wilewis/.

Xia, F., Lewis, W. D., and Poon, H. (2009), ‘Language ID in the Context of Harvesting Language Data off the Web’, in Proceedings of The 12th Conference of the European Chapter of the Association of Computational Linguistics (EACL), Athens, Greece, March 2009. http://faculty.washington.edu/wlewis2/papers/EACL-XLP-2009.pdf

Lewis, W. D. & Xia, F. (2009), ‘Parsing, Projecting & Prototypes: Repurposing Linguistic Data on the Web’, in Proceedings of The 12th Conference of the European Chapter of the Association of Computational Linguistics (EACL), Athens, Greece, March 2009. http://faculty.washington.edu/wlewis2/papers/Lewis-Xia-EACL-2009.pdf

Xia, F. & Lewis, W. D. (2009), ‘Applying NLP Technologies to the Collection and Enrichment of Language Data on the Web to Aid Linguistic Research’, in Proceedings of The 12th Conference of the European Chapter of the Association of Computational Linguistics (EACL), Athens, Greece, March 2009. http://faculty.washington.edu/wlewis2/papers/xia-lewis-eacl-2009.pdf

Lewis, W. D. & Xia, F. (2008), ‘Automatically Identifying Computationally Relevant Typological Features’, in Proceedings of The Third International Joint Conference on Natural Language Processing (IJCNLP). Hyderabad, January 2008. http://faculty.washington.edu/wlewis2/papers/LewisXia-ijcnlp08t-06.pdf

Xia, F. & Lewis, W. D. (2008), ‘Repurposing Theoretical Linguistic Data for Tool Development and Search’, in Proceedings of The Third International Joint Conference on Natural Language Processing (IJCNLP). Hyderabad, January 2008. http://faculty.washington.edu/wlewis2/papers/XiaLewis-ijcnlp08d-12.pdf

Xia, F. & Lewis, W. D. (2007), ‘Multilingual Structural Projection across Interlinearized Text’, in The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007), Rochester, NY, April 22-27, 2007. http://faculty.washington.edu/wlewis2/papers/xl-naacl07-16.pdf

Lewis, W. D. (2006), ODIN: A Model for Adapting and Enriching Legacy Infrastructure, in ‘Proceedings of the e-Humanities Workshop, held in cooperation with e-Science 2006: 2nd IEEE International Conference on e-Science and Grid Computing’, Amsterdam. http://faculty.washington.edu/wlewis2/papers/ODIN-eH06.pdf

Farrar, S. O. & Lewis, W. D. (2006), ‘The GOLD Community of Practice: An Infrastructure for Linguistic Data on the Web’, Language Resources and Evaluation. http://faculty.washington.edu/wlewis2/papers/FarLew06.pdf

Jinguji, D., W. D. Lewis, E. N. Efthimiadis, J. Minor, A. Bertram, S. Eggers, J. Johanson, B. Nisonger, P. Yu, and Z. Zhou (to appear), The University of Washington’s UWCLMAQA System, in ‘Proceedings of the Text Retrieval Conference (TREC) 2006’, Gaithersburg, Maryland. http://faculty.washington.edu/wlewis2/papers/TREC06-final.pdf

Lewis, W. D. Xia, F. & Jinguji, D. (2006), ‘Enriching Language Data through Projected Structures’, in , The Workshop on Computational Linguistics for Less-studied Languages, organized by Texas Linguistics Society (TLSX), Austin, Texas, Nov 3-5, 2006. http://faculty.washington.edu/wlewis2/papers/LewisXiaJinguji06.pdf

Lewis, W. D., Farrar, S. & Langendoen, D. T. (2006), Linguistics in the Internet age: Tools and Fair Use, in ‘Proceedings of the EMELD06 Workshop on Digital Language Documentation: Tools and Standards: The State of the Art’, Lansing, MI.

Gerken, L., Wilson, R. & Lewis, W. D. (2005), ‘17 month olds can use distributional cues to form syntactic categories’, Journal of Child Language 32, 249–268.

Simons, G. F., Fitzsimons, B., Langendoen, D. T., Lewis, W. D., Farrar, S. O., Lanham, A., Basham, R. & Gonzalez, H. (2004), A model for interoperability: XML documents as an RDF database, in ‘Proceedings of the EMELD Workshop on Databases’, Detroit, MI.

http://faculty.washington.edu/wlewis2/papers/Sim-etal04a.pdf

 

Simons, G. F., Lewis, W. D., Farrar, S. O., Langendoen, D. T., Fitzsimons, B. & Gonzalez, H. (2004), The semantics of markup: Mapping legacy markup schemas to a common semantics, in ‘Proceedings of the 4th workshop on NLP and XML (NLPXML2004): held in cooperation with ACL04’, Barcelona, Spain, pp. 25–32.

http://faculty.washington.edu/wlewis2/papers/Sim-etal04b.pdf

Vigliocco, G., Vinson, D., Lewis, W. D. & Garrett, M. (2004), ‘Representing the meaning of object and action words: The featural and unitary semantic space (FUSS) hypothesis’, Cognitive Psychology 48(4), 422–488.

Lewis, W. D. (2003), Mining and migrating interlinear glossed text, in ‘Proceedings of the EMELD Workshop on Digitizing and Annotating Texts and Field Recordings’, East Lansing, MI.

http://emeld.org/workshop/2003/Lewis-paper.pdf

 

Farrar, S., Lewis, W. D., and Langendoen, D. T.  (2002a)  A common ontology for linguistic concepts, in 'Proceedings of the Knowledge Technologies Conference', Seattle, WA.

http://faculty.washington.edu/wlewis2/papers/FarLewLang02a.pdf

 

Farrar, S., Lewis, W. D. & Langendoen, D. T. (2002b), An ontology for linguistic annotation, in ‘Semantic Web Meets Language Resources: Papers from the AAAI Workshop, Technical Report WS0216’, AAAI Press, Menlo Park, CA, pp. 11–19.

http://faculty.washington.edu/wlewis2/papers/FarLewLang02b.pdf

 

Langendoen, D. T., Farrar, S. & Lewis, W. D. (2002), Bridging the markup gap: smart search engines for language researchers, in ‘Proceedings of the International Workshop on Resources and Tools in Field Linguistics’, Las Palmas, Gran Canaria, Spain.

http://faculty.washington.edu/wlewis2/papers/LangFarLew02.pdf

Lewis, W. D., Farrar, S. & Langendoen, D. T. (2001), Building a knowledge base of morphosyntactic terminology, in ‘Proceedings of the IRCS Workshop on Linguistic Databases’, University of Pennsylvania, pp. 150–156. http://www.ldc.upenn.edu/annotation/database/papers/Langendoen etal/24.2.langendoen.pdf