Publications (by research area)
1. Bridging NLP and Linguistics (The RiPLes and the AGGREGATION projects)
- Workshops:
- Fei Xia, William Lewis, and Lori Levin (eds), 2010.
Proceedings of the ACL Workshop on NLP and Linguistics: Find the Common Ground, in conjunction with ACL/EACL 2010. (Sponsored by NSF)
[workshop url]
- Developing ODIN:
- William Lewis and Fei Xia, 2010.
Developing ODIN: A Multilingual Repository of Annotated Language Data for Hundreds of the World's Languages,
Journal of Literary and Linguistic Computing (LLC), 25(3):303-319.
- Fei Xia, Carrie Lewis and William D. Lewis, 2010.
The Problems of Language Identification within Hugely Multilingual Data Sets,
Proceedings of the 7th International Conference on Language Resources and
Evaluation (LREC 2010), pages 2790-2797, Valletta, Malta, May 19-21, 2010.
- Fei Xia, Carrie Lewis, and William Lewis, 2010.
Language ID for a Thousand Languages, eLanguage,
LSA Annual Meeting Extended Abstracts, Baltimore, Maryland, Jan 7-10, 2010.
- Fei Xia, William Lewis and Hoifung Poon, 2009.
Language ID in the Context of Harvesting Language Data off the Web,
Proceedings of the 12th Conference of the European Chapter of the ACL
(EACL-2009), pages 870-878, Athens, Greece, March 30 - April 3, 2009.
- William Lewis and Fei Xia, 2009.
Parsing, Projecting and Prototypes: Repurposing Linguistic Data on the Web,
Proceedings of the EACL 2009 Demonstrations Session, pages 41-44, Athens, Greece, March 30 - April 3, 2009.
- Fei Xia and William Lewis, 2008.
Repurposing Theoretical Linguistic Data for Tool Development and Search,
Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP-2008), pages 529-536, Hyderabad, India, Jan 7-12, 2008.
- Building language profiles and comparing languages:
- Emily M. Bender, Joshua Crowgey, Michael Wayne Goodman, and Fei Xia, 2014. Learning Grammar Specifications from IGT: A Case Study of Chintang, to appear in Proceedings of the Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL), in conjunction with ACL 2014, June 26, Baltimore, Maryland, USA.
- Emily M. Bender, Michael Wayne Goodman, Joshua Crowgey, and Fei Xia, 2013. Towards Creating Precision Grammars from Interlinear Glossed Text: Inferring Large-scale Typological Properties, in Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH 2013), in conjunction of ACL 2013, Sofia, Bulgaria.
- Emily M. Bender, Fei Xia, Joshua Crowgey, and Michael Wayne Goodman, 2013. "Towards Automatic Detection of Morphosyntactic Systems from IGT", in Proceedings of the Workshop on Exploring Data from Language Documentation.
- Ryan Georgi, Fei Xia, and Will Lewis, 2010.
Comparing Language Similarity across Genetic and Typologically-Based Groupings,
Proceedings of the 23rd International Conference on Computational Linguistics
(COLING 2010), pages 385-393, Beijing, China, August 23-27, 2010.
- Fei Xia and William Lewis, 2009.
Applying NLP Technologies to the Collection and Enrichment of Language Data on the Web to Aid Linguistic Research,
Proceedings of the EACL 2009 Workshop on Language Technology and Resources for
Cultural Heritage, Social Sciences, Humanities, and Education (LaTeCH-SHELT\&R 2009), pages 51-59, Athens, Greece, 30 March 2009.
- William Lewis and Fei Xia, 2008.
Automatically Identifying Computationally Relevant Typological Features,
Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP-2008), pages 685-690, Hyderabad, India, Jan 7-12, 2008.
- Structural projection and improving POS tagging and parsing performance:
- Ryan Georgi, Fei Xia, and William D. Lewis, 2015. "Enriching Interlinear Text using Automatically Constructed Annotators", in Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH-2015), in conjunction with ACL 2015, July 30, Beijing, China.
- Ryan Georgi, Fei Xia, and William D. Lewis, 2014. Capturing Divergence in Dependency Trees to Improve Syntactic Projection, Journal of Language Resources and Evaluation (LRE), 48(4), pp 709-739. [eprint]
- Fei Xia, William Lewis, Michael Wayne Goodman, Joshua Crowgey and Emily M. Bender, 2014. Enriching ODIN, in Proceedings of LREC 2014, Reykjavik, Iceland.
- Xuezhe Ma and Fei Xia, 2014. Unsupervised Dependency Parsing with Transferring Distribution via Parallel Guidance and Entropy Regularization, in Proceedings of ACL-2014, Baltimore, MD.
- Ryan Georgi, Fei Xia, and William D. Lewis, 2013. Enhanced and Portable Dependency Projection Algorithms Using Interlinear Glossed Text, short paper, In Proceedings of ACL, Sofia, Bulgaria, Aug 2013.
- Ryan Georgi, Fei Xia, and William D. Lewis, 2012. Improving Dependency Parsing with Interlinear Glossed Text and Syntactic Projection, short paper, In Proceedings of COLING. Mumbai, India, Dec 2012.
- Ryan Georgi, Fei Xia, and William D. Lewis. 2012.
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms,
In Proceedings of LREC, Istanbul, Turkey, May 22-25, 2012.
- Fei Xia and William Lewis, 2007.
Multilingual Structural Projection across Interlinearized Text,
Proceedings of NAACL HLT 2007, pages 452-459, Rochester, NY, April 22-27, 2007.
- William Lewis, Fei Xia, and Daniel Jinguji, 2007.
Projecting structure onto data for resource-poor and endangered languages,
LSA Annual Meeting, Anaheim, CA, 4-7 January 2007.
- William Lewis, Fei Xia, and Dan Jinguji, 2006.
Enriching Language Data through Projected Structures,
Proceedings of the Workshop on Computational Linguistics for Less-studied Languages, Texas Linguistics Society 10 (TLSX), pages 85-98, Austin, Texas, Nov 3-5, 2006.
- Tools and packages:
- Ryan Georgi, Michael Wayne Goodman, and Fei Xia, 2016. "A Web-framework for ODIN Annotation", in Proceedings of ACL-2016 System Demonstrations, pp 31-36, Aug 7-10, Berlin, Germany.
- Fei Xia, William D. Lewis, Michael W. Goodman, Glenn Slayden, Ryan Georgi, Joshua Crowgey, and Emily Bender, 2016. "Enriching a Massively Multilingual Database of Interlinear Glossed Text", Journal of Language Resources and Evaluation (LRE), 50(2): 321-349.
- Michael Wayne Goodman, Joshua Crowgey, Fei Xia, and Emily M. Bender, 2015. "Xigt: Extensible Interlinear Glossed Text for Natural Language Processing", Journal of Language Resources and Evaluation (LRE), 49(2), pp 455-485.
- Fei Xia, Michael Wayne Goodman, Ryan Georgi, Glenn Slayden, and William D. Lewis, 2015. "Enriching, Editing, and Representing Interlinear Glossed Text", in Proceedings of the 16th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2015), April 14-20, Cairo, Egypt.
2. Treebank development
- Workshops:
- The 6th Linguistic Annotation
Workshop (The LAW VI), in conjunction with ACL-2012, Jeju, Republic of Korea, July 12-13, 2012.
- Workshop on South Asian Languages: Formal Approaches and Computational Resources, in
conjunction with the 2011 Linguistic Summer Institute, Boulder, Colorado. July
23, 2011.
- Workshop on Treebank Annotation, at NAACL 2007, Rochester, NY, April 26, 2007.
(Sponsored by NSF)
- Conversion from dependency structure to phrase structure:
- Rajesh Bhatt, Owen Rambow, and Fei Xia, 2012. Creating a Tree Adjoining Grammar from a Multilayer Treebank, in Proceedings of the 11th International Workshop on Tree Adjoining Grammars and Related Formalisms (TAG+11), pages 162-170, Paris, France, September 2012.
- Rajesh Bhatt and Fei Xia, 2012.
Challenges in Converting between Treebanks: a Case Study from the HUTB,
in Proceedings of META-RESEARCH Workshop on Advanced Treebanking, in conjunction with LREC-2012, Istanbul, Turkey.
- Rajesh Bhatt, Owen Rambow, and Fei Xia, 2011.
Linguistic Phenomena, Analyses, and Representations: Understanding Conversion between Treebanks,
In the Proc. of the IJCNLP, Chiang Mai,Thailand, Nov 9-13, 2011.
- Fei Xia, Owen Rambow, Rajesh Bhatt, Martha Palmer, and Dipti Misra Sharma, 2009.
Towards a Multi-Representational Treebank," the 7th International Workshop on Treebanks and Linguistic Theories (TLT 2009), pages 159-170, Groningen, Netherlands, Jan 23-24, 2009.
- Fei Xia and Martha Palmer, 2001.
Converting Dependency Structures to Phrase Structures,
Proceedings of the 1st Human Language Technology Conference (HLT-2001), San
Diego, Mar 18-21, 2001.
- The Hindi/Urdu Treebank Project:
- Riyaz Ahmad Bhat, Rajesh Bhatt, Annahita Farudi, Prescott Klassen, Bhuvana Narasimhan, Martha Palmer, Owen Rambow, Dipti Misra Sharma, Ashwini Vaidya, Sri Ramagurumurthy Vishnu, and Fei Xia, 2014. The Hindi/Urdu Treebank Project, to appear in the Handbook of Linguistics Annotation (edited by Nancy Ide and James Pustejovsky), Springer Press.
- Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow,
Dipti Misra Sharma, Michael Tepper, Ashwini Vaidya, Fei Xia, 2010.
Empty Categories in a Hindi Treebank,
Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pages 1863-1870, Valletta, Malta, May 19-21, 2010.
- Martha Palmer, Rajesh Bhatt, Bhuvana Narasimhan, Owen Rambow, Dipti
Misra Sharma, and Fei Xia, 2009.
Hindi Syntax: Annotating Dependency, Lexical Predicate-Argument
Structure, and Phrase Structure,
Proceedings of the 7th International Conference on Natural Language Processing (ICON-2009), pages 259-268, Hyderabad, India, Dec 14-17, 2009.
- Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow,
Dipti Misra Sharma, and Fei Xia, 2009.
A Multi-Representational and Multi-Layered Treebank for Hindi/Urdu,
Proceedings of the Third Linguistic Annotation Workshop (LAW 2009), ACL-IJCNLP 2009, pages 186-189, Singapore, 6-7 August 2009.
- The Chinese Penn Treebank Project:
- Nianwen Xue, Fei Xia, Fu-dong Chiou, and Martha Palmer, 2005.
The Penn Chinese Treebank: Phrase Structure Annotation of a Large Corpus,
Journal of Natural Language Engineering, 11(2): 207-238, 2005.
Cambridge University Press.
- Fei Xia, Martha Palmer, Nianwen Xue, Mary Ellen Okurowski, John Kovarik,
Fu-Dong Chiou, Shizhe Huang, Tony Kroch, and Mitch Marcus, 2000.
Developing Guidelines and Ensuring Consistency for Chinese Text Annotation,
Proceedings of the 2nd International Conference on Language Resources and Evaluation (LREC-2000), Athens, Greece, May 31 - June 2, 2000.'
- Fei Xia, 2000.
The Segmentation Guidelines for the Penn Chinese Treebank (3.0),
IRCS Report 00-06, University of Pennsylvania, Oct 2000.
- Fei Xia, 2000.
The Part-of-Speech Guidelines for the Penn Chinese Treebank (3.0),
IRCS Report 00-07, University of Pennsylvania, Oct 2000.
- Nianwen Xue, Fei Xia, Shizhe Huang, and Anthony Kroch, 2000.
The Bracketing Guidelines for the Penn Chinese Treebank (3.0),
IRCS Report 00-08, University of Pennsylvania, Oct 2000.
3. Bio-NLP
- Phenotype detection:
- Cosmin Adrian Bejan, Lucy Vanderwende, Fei Xia, and Meliha Yetisgen-Yildiz, 2013. Assertion modeling and its role in clinical phenotype identification, Journal of Biomedical Informatics, 46(1):68-74.
- Meliha Yetisgen-Yildiz, Cosmin A. Bejan, Lucy Vanderwende, Fei Xia, Heather L. Evans, and Mark M. Wurfel. 2013. "Automated Tools for Phenotype Extraction from Medical Records", Abstract in the 2013 AMIA Joint Summits on Translational Science.
- Michael Tepper, Heather L. Evans, Fei Xia, Meliha Yetisgen-Yildiz. 2013. Modeling Annotator Rationales with Application to Pneumonia Classification, in Proceedings of the 2013 AAAI workshop on Expanding the Boundaries of Health Informatics Using Artificial Intelligence (HIAI 2013), July 15, Bellevue, WA.
- Cosmin Adrian Bejan, Fei Xia, Lucy Vanderwende, Mark M. Wurfel, and Meliha Yetisgen-Yildiz, 2012.
Pneumonia identification using statistical feature selection,
Journal of American Medical Informatics Association (JAMIA), 19(5): 817-823.
- Meliha Yetisgen-Yildiz, Bradford Glavan, Fei Xia, Lucy Vanderwende, and Mark Wurfel, 2011.
Extraction of Pneumonia Cases from Free-Text Intensive Care Unit Reports.
The AMIA 2011 Annual Symposium.
- Meliha Yetisgen-Yildiz, Bradford Glavan, Fei Xia, Lucy Vanderwende, and Mark Wurfel, 2011.
Identifying Patients with Pneumonia from Free-Text Intensive Care Unit Reports.
In Proc. of the ICML workshop on Learning from Unstructured Clinical Text, Bellevue, WA, July 2, 2011.
- Imre Solti, Colin R. Cooke, Fei Xia, and Mark M. Wurfel, 2010.
Peeling Away the Black Box Label: Clinical Validation of a MaxEnt Machine Learning Character N-gram Feature Set for Acute Lung Injury,
Proceedings of the 2010 AMIA Summit on Translational Bioinformatics, San Francisco, CA, March 10-12, 2010.
- Imre Solti, Colin Cooke, Fei Xia, and Mark Wurfel, 2009:
Automated classification of radiology reports for acute lung injury: Comparison of keyword and machine learning based natural language processing approaches,
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine Workshop (BIBM-2009), pages 314-319, Washington DC, November 1-4, 2009.
- Detecting critical recommendations:
- Meliha Yetisgen-Yildiz, Martin Gunn, Fei Xia, and Tom Payne, 2013. Text Processing Pipeline to Extract Recommendations from Radiology Reports, Journal of Biomedical Informatics (JBI), 46(2):354-362.
- Meliha Yetisgen-Yildiz, Martin Gunn, Fei Xia, and Tom Payne, 2011.
Automatic Identification of Critical Follow-Up Recommendation Sentences in Radiology Reports.
In Proc. of the AMIA 2011 Annual Symposium, Washington DC, Oct 22-26, 2011.
- Clinical corpus annotation:
- Prescott Klassen, Fei Xia, and Meliha Yetisgen, 2016. "Annotating and Detecting Medical Events in Clinical Notes", in Proceedings of the 10th Language Resources and Evaluation Conference (LREC 2016), May 23-28, Portoroz, Slovenia.
- Prescott Klassen, Fei Xia, Lucy Vanderwende and Meliha Yetisgen, 2014. Annotating Clinical Events in Text Snippets for Phenotype Detection, in Proceedings of LREC 2014, Reykjavik, Iceland.
- Meliha Yetisgen-Yildiz, Prescott Klassen, Lucy Vanderwende, and Fei Xia, 2014. "A New Corpus for Clinical Events with Change of State", in Proceedings of the 2014 AMIA Joint Summit on Translational Science, San Francisco, April 7-11.
- Lucy Vanderwende, Fei Xia, and Meliha Yetisgen-Yildiz, 2013. Annotating Change of State for Clinical Events, in Proceedings of the 1st Workshop on Events: Definition, Detection, Coreference, and Representation, in conjunction with NAACL-2013, Atlanta, GA.
- Fei Xia and Meliha Yetisgen-Yildiz, 2012.
Clinical corpus annotation: challenges and strategies,
in Proceedings of the third Workshop on Building and Evaluating Resources for Biomedical Text Mining, in conjunction with LREC-2012, Istanbul, Turkey.
- Ozlem Uzuner, Imre Solti, Fei Xia, and Eithon Cadag, 2010.
Community Annotation Experiment for Ground Truth Generation for the i2b2 Medication Challenge,
Journal of the American Medical Informatics Association (JAMIA), 17:519-523.
- Meliha Yetisgen-Yildiz, Imre Solti, and Fei Xia, 2010.
Using Amazon's Mechanical Turk for Annotating Medical Named Entities,
Proceedings of the AMIA 2010 Annual Symposium, Washington DC, Nov 13-17, 2010.
- Meliha Yetisgen-Yildiz, Imre Solti, Fei Xia, and Scott Halgrim, 2010.
Preliminary Experiments with Amazon's Mechanical Turk for Annotating Medical Named Entities,
Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, pages 180-183, Los Angeles, June 2010.
- Extracting medication information (the 2009 i2b2 challenge):
- Scott Halgrim, Fei Xia, Imre Solti, Eithon Cadag, Ozlem Uzuner, 2011.
A cascade of MaxEnt classifiers applied to extracting medication information from discharge summaries,
Journal of Biomedical Semantics 2011, 2 (Suppl 3):S2.
- Scott Halgrim, Fei Xia, Imre Solti, Eithon Cadag and Ozlem Uzuner, 2010.
Extracting Medication Information from Discharge Summaries,
Proceedings of the NAACL HLT 2010 Second Louhi Workshop on Text and Data Mining of Health Documents, pages 61-67, Los Angeles, June 2010.
- Scott Halgrim, Fei Xia, Imre Solti, Eithon Cadag, and Ozlem Uzuner, 2010.
Statistical Extraction of Medication Information from Clinical Records,
2010 AMIA Summit on Translational Bioinformatics, San Francisco,
CA, March 10-12, 2010.
- Fei Xia, Imre Solti, and Ozlem Uzuner, 2009.
UW Internal Annotation Guidelines for the 2009 i2b2 Challenge and UW Medication IE System, Manuscript.
- Ozlem Uzuner, Imre Solti, and Fei Xia, 2009.
The i2b2 Medication Extraction Challenge Preliminary Annotation Guidelines,
- Ozlem Uzuner, Imre Solti, and Fei Xia, 2009.
The i2b2 Medication Extraction Challenge Evaluation Metrics,
- Other Bio-NLP topics:
- Meliha Yetisgen-Yildiz, Cosmin A. Bejan, Prescott Klassen, Michael Tepper, Lucy Vanderwende, and Fei Xia, 2013. "Text Processing Tools from the University of Washington Biomedical Language Processing Group", in Proceedings of the 2013 AMIA Symposium.
- Louise Deleger, Katalin Molnar, Guergana Savova, Fei Xia, Todd Lingren, Qi Li, Keith Marsolo, Anil G. Jegga, Megan Kaiser, Laura Stoutenborough, and Imre Solti, 2013.
Large Scale Evaluation of Automated Clinical Note De-identification and its Impact on Information Extraction.
Journal of the American Medical Informatics Association (JAMIA),
20(1): 84-94.
- Michael Tepper, Fei Xia, and Meliha Yetisgen-Yildiz, 2012. Smoking Status Detection across Domains, in Proceedings of the AMIA Fall Symposium, Chicago, Illinois, November 2012.
- Michael Tepper, Daniel Capurro, Fei Xia, Lucy Vanderwende, and Meliha Yetisgen-Yildiz, 2012.
Statistical Section Segmentation in Free-Text Clinical Records.
In the Proceedings of the LREC, Istanbul, Turkey, May 22-25, 2012.
- Cuijun Wu, Fei Xia, Louise Deleqer, and Imre Solti, 2011.
Statistical Machine Translation for Biomedical Text: Are We There Yet?
In the Proc. of the AMIA 2011 Annual Symposium, Washington DC, Oct 22-26, 2011.
- Imre Solti, Scott Halgrim, and Fei Xia, 2010.
Addressing the Annotation Bottleneck for Clinical Natural Language Processing: Testing the Feasibility of Domain Adaptation for Medical Text,
Proceedings of the AMIA 2010 Annual Symposium, Washington DC, Nov 13-17, 2010.
4. Domain Adaptation
- Yan Song and Fei Xia, 2014. Modern Chinese Helps Archaic Chinese Processing: Finding and Exploiting the Shared Properties, in Proceedings of LREC 2014, Reykjavik, Iceland.
- Yan Song and Fei Xia, 2013. A Common Case of Jekyll and Hyde: the Synergistic Effect of Using Divided Source Training Data for Feature Augmentation, in Proceedings of IJCNLP, Oct 14-18. Nagoya, Japan.
- Xuezhe Ma and Fei Xia, 2013. Dependency Parser Adaptation with Subtrees from Auto-Parsed Target Domain Data, short paper, In Proceedings of ACL, Sofia, Bulgaria, Aug 2013.
- Yan Song, Prescott Klassen, and Fei Xia, 2012. Entropy-based Training Data Selection for Domain Adaptation, short paper, In Proceedings of COLING. Mumbai, India, Dec 2012.
- Dong Wang and Fei Xia, 2012.
Effort of Genre Variation and Prediction of System Performance,
In Proceedings of LREC, Istanbul, Turkey, May 22-25, 2012.
- Yan Song and Fei Xia, 2012.
Using a Goodness Measurement for Domain Adaptation: A Case Study on Chinese Word Segmentation,
In Proceedings of LREC, Istanbul, Turkey, May 22-25, 2012.
5. Chinese NLP
- Workshops:
- Qing Ma and Fei Xia (eds), 2003.
Proceedings of the 2nd SIGHAN Workshop on Chinese Language Processing (SIGHAN-2003), in conjunction with ACL 2003.
[workshop proceedings (preface)]
- Martha Palmer, Mitch Marcus, Aravind Joshi, and Fei Xia (eds), 2000.
Proceedings of the 2nd Chinese Language Processing Workshop (CLP-2000),
in conjunction with ACL 2000.
[workshop proceedings (front matter)]
- The 1st Chinese Language Processing Workshop (CLP-1998), Philadelphia, PA, June 30 - July 2, 1998.
- POS tagging:
- Alex Cheng, Fei Xia, and Jianfeng Gao, 2010.
A comparison of unsupervised methods for Part of Speech Tagging in Chinese,
Proceedings of the 23rd International Conference on Computational Linguistics
(COLING 2010), Poster Volume, pages 135-143, Beijing, China, August 23-27, 2010.
- Fei Xia and Lap Cheung, 2006.
Features, Bagging, and System Combination for the Chinese POS Tagging Task,
Proceedings of the 5th SIGHAN Workshop on Chinese Language Processing (SIGHAN 2006), pages 25-32, Sydney, Australia, July 22-23, 2006.
- The Chinese Penn Treebank Project (see the "Treebank Development" section)
- Others:
- Kam Tang Lau, Yan Song, and Fei Xia, 2013. The Construction of a Segmented and Part-of-speech Tagged Archaic Chinese Corpus: A Case Study on Huainanzi (in Chinese), in Proceedings of the 12th China National Conference on Computational Linguistics (CNCCL 2013), Oct 10-12, Suzhou, China.
6. Machine Translation
- Finding parallel text:
- Achim Ruopp and Fei Xia, 2008.
Finding parallel texts on the web using cross-language information retrieval,
Proceedings of the 2nd International Workshop on Cross Lingual Information Access, in conjunction with IJCNLP-2008, pages 18-25, Hyderabad, India, Jan 7-12, 2008.
- Statistical MT:
- Fei Xia and Michael McCord, 2004.
Improving a Statistical MT System with Automatically Learned Rewrite Patterns", the 20th International Conference on Computational Linguistics (COLING 2004), Geneva, Switzerland, Aug 22-29, 2004.
- Christoph Tillmann and Fei Xia, 2003.
A Phrase-Based Unigram Model for Statistical Machine Translation,
Proceedings of the 3rd Human Language Technology Conference (HLT/NAACL 2003), Edmonton, Canada, May 27 -- June 2, 2003.
- Y. Al-Onaizan, R. Florian, M. Franz, H. Hassan, Y. S. Lee, S. McCarley, K. Papineni, S. Roukos, J. Sorensen, C. Tillmann, T. Ward, F. Xia, 2003.
TIPS: A Translingual Information Processing System,
Proceedings of the 3rd Human Language Technology Conference (HLT/NAACL-2003), Demonstration Session, pages 1-2, Edmonton, Canada, May 27 - June 2, 2003.
- Transfer-based MT:
- Hiyan Alshawi, Adam Buchsbaum, and Fei Xia, 1997.
A Comparison of Head Transducers and Transfer for a Limited Domain Translation,
Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL-1997), pages 360-365, Madrid, Spain, July 7-11, 1997.
- Hiyan Alshawi and Fei Xia, 1997.
English-to-Mandarin Speech Translation with Head Transducers,
Proceedings of the Workshop of Spoken Language Translation (SLT-1997), pages 54-60, Madrid, Spain, July 11, 1997.
7. Tree Adjoining Grammar
- Grammar Extraction (LexTract):
- Fei Xia and Martha Palmer, 2010.
From Treebank to Tree-Adjoining Grammar,
In Supertagging: Using Complex Lexical Descriptions in Natural Language Processing, edited by Srinivas Bangalore and Aravind K. Joshi, pages 35-72, MIT Press, 2010.
- Fei Xia, Chung-hye Han, Martha Palmer and Aravind Joshi, 2001.
Automatically Extracting and Comparing Lexicalized Grammars for Different Languages,
Proceedings of the 17th International Joint conference on Artificial Intelligence (IJCAI-2001), pages 1321-1326, Seattle, Aug 4-10, 2001.
- Fei Xia, Martha Palmer, and Aravind Joshi, 2000.
A Uniform Method of Grammar Extraction and Its Applications,
Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC-2000), pages 53-62, Hong Kong, Oct 7-8, 2000.
- Fei Xia, Chung-hye Han, Martha Palmer, and Aravind Joshi, 2000.
Comparing Lexicalized Treebank Grammars Extracted from Chinese, Korean, and English Corpora,
Proceedings of the 2nd Chinese Language Processing Workshop (CLP-2000), pages 52-59, Hong Kong, Oct 8, 2000.
- Fei Xia and Martha Palmer, 2000.
Evaluating the Coverage of LTAGs on Annotated Corpora,
Proceedings of the Workshop on Using Evaluation within HLT Programs: Results and Trends, Athens, Greece, May 30, 2000.
- Fei Xia and Tonia Bleam, 2000.
A Corpus-based Evaluation of Syntactic Locality in TAGs,
Proceedings of the 5th International Workshop on Tree Adjoining Grammar and Related Formalisms (TAG+ 2000), pages 215-220, Paris, France, May 25-27, 2000.
- Fei Xia and Martha Palmer, 2000.
Comparing and Integrating Tree Adjoining Grammars,
Proceedings of the 5th International Workshop on Tree Adjoining Grammar and Related Formalisms (TAG+ 2000), pages 265-268, Paris, France, May 25-27, 2000.
- Fei Xia, 1999.
Extracting Tree Adjoining Grammars from Bracketed Corpora,
Proceedings of the 5th Natural Language Processing Pacific Rim Symposium (NLPRS-99), pages 398-403, Beijing, China, Nov. 1999.
- Grammar Generation (LexOrg):
- Fei Xia, Martha Palmer, and Vijay Shanker, 2010.
Developing Tree-Adjoining Grammars with Lexical Descriptions,
in Supertagging: Using Complex Lexical Descriptions in Natural Language Processing, edited by Srinivas Bangalore and Aravind K. Joshi, pages 73-110, MIT Press, 2010.
- Fei Xia, Martha Palmer and K. Vijay-Shanker, 2005.
Automatically Generating Tree Adjoining Grammars from Abstract Specifications,
Journal of Computational Intelligence, 21(3), 246-287, 2005.
- Fei Xia, Martha Palmer, and K. Vijay-Shanker, 1999.
Towards Semi-automating Grammar Development,
Proceedings of the 5th Natural Language Processing Pacific Rim Symposium (NLPRS-99), pages 96-101, Beijing, China, Nov. 1999.
- Fei Xia, Martha Palmer, K. Vijay-Shanker and Joseph Rosenzweig, 1998.
Consistent Grammar Development Using Partial-Tree Descriptions for LTAGs,
Proceedings of the 4th International Workshop on Tree Adjoining Grammar and Related Formalisms (TAG+ 1998), page 180-183, Philadelphia, Aug 1-3, 1998.
- Other Topics on LTAG:
- Anoop Sarkar, Fei Xia, and Aravind Joshi, 2000.
Some Experiments on Indicators of Parsing Complexity for Lexicalized Grammars,
In Proceedings of Efficiency in Large-Scale Parsing Systems Workshop, Luxembourg, Germany, Aug 5, 2000.
- Christy Doran, Beth Ann Hockey, Anoop Sarkar, B. Srinivas and Fei Xia, 2000.
Evolution of the XTAG System,
in Tree Adjoining Grammars: Formalisms, Linguistic Analysis and Processing,
a CSLI volume edited by Anne Abeille and Owen Rambow, pages 371-404, 2000.
- Martha Palmer, Chung-hye Han, Fei Xia, Dania Egedi and Joseph Rosenzweig, 2000.
Constraining Lexical Selection across Languages Using Tree Adjoining Grammars,
in Tree Adjoining Grammars: Formalisms, Linguistic Analysis and Processing,
a CSLI volume edited by Anne Abeille and Owen Rambow, pages 445-466, 2000.
- C. Doran, B. Hockey, P. Hopely, J. Rosenzweig, A. Sarkar, B. Srinivas, F. Xia, A. Nasr and O. Rambow, 1997.
Maintaining the Forest and Burning out the Underbrush in XTAG,
Proceedings of the Workshop on Computational Environments for Grammar Development and Language Engineering (ENVGRAM-1997), pages 30-37, Madrid, Spain, July 12, 1997.
- Chung-hye Han, Fei Xia, Martha Palmer and Joseph Rosenzweig, 1996.
Capturing Language Specific Constraints on Lexical Selection with Feature-Based LTAGs,
Proceedings of International Conference on Chinese Computing (ICCC-1996), pages 106-113, Singapore, June 1996.
8. Other topics:
- Morphological induction:
- Michael Tepper and Fei Xia, 2010.
Inducing Morphemes Using Light Knowledge,
Journal of ACM Transactions on Asian Language Information Processing (TALIP), 9(3): 1-38, 2010.
- Michael Tepper and Fei Xia, 2008.
A Hybrid Approach to the Induction of Underlying Morphology,
Proceedings of the Third International Joint Conference on Natural Language
Processing (IJCNLP-2008), pages 17-24, Hyderabad, India, Jan 7-12, 2008.
- Social media:
- Maria Antoniak, Eric Bell, and Fei Xia, 2015. "Extracting Topic-Specific Synonyms from Twitter", in Proceedings of the 10th Annual Women in Machine Learning Workshop, in conjunction with NIPS, Montreal, Canada, Dec 7.
- Maria Antoniak, Eric Bell, and Fei Xia, 2015. "Leveraging Paraphrase Labels to Extract Synonyms from Twitter", in Proceedings of the 28th International Florida Artificial Intelligence Research Society (FLAIRS) Conference, May 18-20, Hollywood, Florida, USA.
- Kelly Peterson, Matt Hohensee, and Fei Xia, 2011.
Email Formality in the Workplace: A Case Study on the Enron Corpus,
In Proceedings of the 2011 ACL Workshop on Language in Social Media (LSM 2011), Portland, Oregon, June 23, 2011.
- Teaching CL:
- Chris Brew, Martha Palmer, and Fei Xia (eds), 2008.
Proceedings of the 3rd Workshop on Issues in Teaching Computational Linguistics,
in conjunction with ACL 2008.
[workshop proceedings (front matter)]
- Fei Xia, 2008.
The evolution of a statistical NLP course,
In Proceedings of the Third Workshop on Issues in Teaching Computational
Linguistics (TeachCL-2008), pages 45-53, Columbus, Ohio, June 19-20, 2008.
- Emily Bender, Fei Xia, and Erik Bansleben, 2008.
Building a flexible, collaborative, intensive master's program in computational linguistics,
Proceedings of the Third Workshop on Issues in Teaching Computational Linguistics (TeachCL-2008), pages 10-18, Columbus, Ohio, June 19-20, 2008.
Last modified on Oct 9, 2016.