·
Workshops:
o
Fei
Xia, William Lewis, and Lori Levin (eds),
2010. Proceedings of the ACL Workshop on NLP and Linguistics: Find the Common Ground,
in conjunction with ACL/EACL 2010. (Sponsored by NSF) [workshop url] [Proceedings]
·
Developing
ODIN:
o
William
Lewis and Fei Xia, 2010. "Developing ODIN: A
Multilingual Repository of Annotated Language Data for Hundreds of the World's
Languages," Journal of Literary and Linguistic Computing (LLC),
25(3):303-319. [pdf]
o
Fei
Xia, Carrie Lewis and William D. Lewis, 2010. "The Problems of Language
Identification within Hugely Multilingual Data Sets," Proceedings of the
7th International Conference on Language Resources and Evaluation (LREC 2010),
pages 2790-2797, Valletta, Malta, May 19-21, 2010. [pdf]
o
Fei
Xia, William Lewis and Hoifung Poon, 2009.
"Language ID in the Context of Harvesting Language Data off the Web,"
Proceedings of the 12th Conference of the European Chapter of the ACL
(EACL-2009), pages 870-878, Athens, Greece, March 30 - April 3, 2009. [pdf]
o
William
Lewis and Fei Xia, 2009. "Parsing, Projecting
& Prototypes: Repurposing Linguistic Data on the Web," Proceedings of
the EACL 2009 Demonstrations Session, pages 41-44, Athens, Greece, March 30 -
April 3, 2009. [pdf]
o
Fei
Xia and William Lewis, 2008. "Repurposing Theoretical Linguistic Data for
Tool Development and Search," Proceedings of the Third International Joint
Conference on Natural Language Processing (IJCNLP-2008), pages 529-536,
Hyderabad, India, Jan 7-12, 2008. [pdf]
·
Building
language profiles and comparing languages:
o
Ryan
Georgi, Fei Xia and Will
Lewis, 2010. "Comparing Language Similarity across Genetic and Typologically-Based
Groupings," Proceedings of the 23rd International Conference on
Computational Linguistics (COLING 2010), pages 385-393, Beijing, China, August
23-27, 2010. [pdf]
o
Fei
Xia and William Lewis, 2009. "Applying NLP Technologies to the Collection
and Enrichment of Language Data on the Web to Aid Linguistic Research,"
Proceedings of the EACL 2009 Workshop on Language Technology and Resources for
Cultural Heritage, Social Sciences, Humanities, and Education (LaTeCH-SHELT&R 2009), pages 51-59, Athens, Greece, 30
March 2009. [pdf]
o
William
Lewis and Fei Xia, 2008. "Automatically
Identifying Computationally Relevant Typological Features," Proceedings of
the Third International Joint Conference on Natural Language Processing
(IJCNLP-2008), pages 685-690, Hyderabad, India, Jan 7-12, 2008. [pdf]
·
Structural
projection:
o
Fei
Xia and William Lewis, 2007. "Multilingual Structural Projection across Interlinearized Text, "
Proceedings of NAACL HLT 2007, pages 452-459, Rochester, NY, April 22-27, 2007.
[pdf]
o
William
Lewis, Fei Xia, and Dan Jinguji, 2006.
"Enriching Language Data through Projected Structures", Proceedings
of the Workshop on Computational Linguistics for Less-studied Languages, Texas
Linguistics Society 10 (TLSX), pages 85-98, Austin, Texas, Nov 3-5, 2006. [pdf]
·
Workshops:
o
The 6th Linguistic Annotation
Workshop (The LAW VI), in conjunction with ACL-2012, Jeju,
Republic of Korea, July 12-13, 2012.
o
Workshop
on South Asian Languages: Formal Approaches and Computational Resources, in
conjunction with the 2011 Linguistic Summer Institute, Boulder, Colorado. July
23, 2011.
o
Workshop
on Treebank Annotation, at NAACL 2007, Rochester, NY, April 26, 2007.
(Sponsored by NSF)
·
The
Hindi/Urdu Treebank Project
o
Rajesh Bhatt, Owen Rambow,
and Fei Xia, 2011. “Linguistic Phenomena,
Analyses, and Representations: Understanding Conversion between Treebanks”, In the Proc. of the IJCNLP, Chiang Mai,
Thailand, Nov 9-13, 2011. [pdf]
o
Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow,
Dipti Misra Sharma, Michael
Tepper, Ashwini Vaidya, Fei Xia, 2010.
"Empty Categories in a Hindi Treebank", Proceedings of the 7th
International Conference on Language Resources and Evaluation (LREC 2010),
pages 1863-1870, Valletta, Malta, May 19-21, 2010. [pdf]
o
Martha
Palmer, Rajesh Bhatt, Bhuvana Narasimhan,
Owen Rambow, Dipti Misra Sharma, and Fei Xia, 2009.
"Hindi Syntax: Annotating Dependency, Lexical Predicate-Argument
Structure, and Phrase Structure", Proceedings of the 7th International
Conference on Natural Language Processing (ICON-2009), pages 259-268,
Hyderabad, India, Dec 14-17, 2009. [pdf]
o
Rajesh
Bhatt, Bhuvana Narasimhan,
Martha Palmer, Owen Rambow, Dipti
Misra Sharma, and Fei Xia,
2009. "A Multi-Representational and Multi-Layered Treebank for
Hindi/Urdu," Proceedings of the Third Linguistic Annotation Workshop (LAW
2009), ACL-IJCNLP 2009, pages 186-189, Suntec,
Singapore, 6-7 August 2009. [pdf]
·
The
Chinese Penn Treebank Project
o
Nianwen Xue, Fei
Xia, Fu-dong Chiou, and Martha Palmer, 2005.
"The Penn Chinese Treebank: Phrase Structure Annotation of a Large
Corpus", Journal of Natural Language Engineering, 11(2): 207-238, 2005.
Cambridge University Press. [pdf]
o
Fei
Xia, Martha Palmer, Nianwen Xue,
Mary Ellen Okurowski, John Kovarik,
Fu-Dong Chiou, Shizhe
Huang, Tony Kroch, and Mitch Marcus, 2000. "Developing Guidelines and
Ensuring Consistency for Chinese Text Annotation", the 2nd International
Conference on Language Resources and Evaluation (LREC-2000), Athens, Greece,
May 31 - June 2, 2000. [pdf]
o
Fei
Xia, 2000. "The Segmentation Guidelines for the Penn Chinese Treebank
(3.0)", IRCS Report 00-06, University of Pennsylvania, Oct 2000. [pdf]
o
Fei
Xia, 2000. "The Part-of-Speech Guidelines for the Penn Chinese Treebank
(3.0)", IRCS Report 00-07, University of Pennsylvania, Oct 2000. [pdf]
o
Nianwen Xue and Fei
Xia, 2000. “The Bracketing Guidelines for the Penn Chinese Treebank
(3.0)”, IRCS Report 00-08, University of Pennsylvania, Oct 2000. [pdf]
·
The
conversion from dependency structure to phrase structure:
o
Fei
Xia, Owen Rambow, Rajesh Bhatt, Martha Palmer, and Dipti Misra Sharma, 2009.
"Towards a Multi-Representational Treebank," the 7th International
Workshop on Treebanks and Linguistic Theories (TLT
2009), pages 159-170, Groningen, Netherlands, Jan 23-24, 2009. [pdf]
o
Fei
Xia and Martha Palmer, 2001. "Converting Dependency Structures to Phrase
Structures", the 1st Human Language Technology Conference (HLT-2001), San
Diego, Mar 18-21, 2001. [pdf]
·
The
deCIPHR Project:
o
Meliha Yetisgen-Yildiz, Bradford Glavan,
Fei Xia, Lucy Vanderwende,
and Mark Wurfel, 2011. “Identifying Patients
with Pneumonia from Free-Text Intensive Care Unit Reports”. In Proc. of
the ICML workshop on Learning from Unstructured Clinical Text, Bellevue, WA,
July 2, 2011. [pdf]
·
The
2009 i2b2 challenge:
o
Creating
gold standard for the challenge:
1.
Ozlem Uzuner, Imre
Solti, Fei Xia, and Eithon Cadag. "Community Annotation Experiment for Ground
Truth Generation for the i2b2 Medication Challenge", Journal of the
American Medical Informatics Association (JAMIA), 17:519-523. [pdf]
2.
Fei Xia, Imre Solti, and Ozlem Uzuner, 2009. "UW Internal
Annotation Guidelines for the 2009 i2b2 Challenge and UW Medication IE
System," Manuscript. [manuscript]
3.
Ozlem Uzuner, Imre
Solti, and Fei Xia, 2009. "i2b2
Medication Extraction Challenge Preliminary Annotation Guidelines,"
Manuscript. [manuscript]
4.
Ozlem Uzuner, Imre
Solti, and Fei Xia, 2009. "i2b2
Medication Extraction Challenge Evaluation Metrics," Manuscript. [manuscript]
o
Extracting
medication information:
1.
Scott
Halgrim, Fei Xia, Imre Solti, Eithon Cadag, Ozlem Uzuner,
2011. “A cascade of MaxEnt classifiers applied
to extracting medication information from discharge summaries”, Journal
of Biomedical Semantics 2011, 2 (Suppl 3):S2. [pdf]
·
Machine
translation for biomedical text:
o Cuijun Wu, Fei Xia, Louise Deleqer, and Imre Solti, 2011. “Statistical Machine Translation
for Biomedical Text: Are We There Yet?” In the Proc. of the AMIA 2011
Annual Symposium, Washington DC, Oct 22-26, 2011. [pdf]
·
Using
Mechnical Turk for medical named entity annotation:
o
Meliha Yetisgen-Yildiz, Imre Solti, Fei Xia, and Scott Halgrim, 2010. "Preliminary Experiments with Amazon's
Mechanical Turk for Annotating Medical Named Entities," Proceedings of the
NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's
Mechanical Turk, pages 180-183, Los Angeles, June 2010. [pdf]
·
Detecting
critical recommendations:
o
Meliha Yetisgen-Yildiz, Martin Gunn, Fei
Xia, and Tom Payne, 2011. “Automatic Identification of Critical Follow-Up
Recommendation Sentences in Radiology Reports”. In Proc. of the AMIA 2011
Annual Symposium, Washington DC, Oct 22-26, 2011. [pdf]
·
Detecting
Acute Lung Injury (ALI):
o
Imre
Solti, Colin R. Cooke, Fei Xia, and Mark M. Wurfel, 2010. "Peeling Away the Black Box Label:
Clinical Validation of a MaxEnt Machine Learning
Character N-gram Feature Set for Acute Lung Injury," 2010 AMIA Summit on
Translational Bioinformatics, San Francisco, CA, March 10-12, 2010. [pdf]
o
Imre
Solti, Colin Cooke, Fei Xia, and Mark Wurfel, 2009: "Automated classification of radiology
reports for acute lung injury: Comparison of keyword and machine learning based
natural language processing approaches," IEEE International Conference on
Bioinformatics and Biomedicine Workshop (BIBM-2009), pages 314-319, Washington
DC, November 1-4, 2009. [pdf]
·
POS
tagging:
o
Alex
Cheng, Fei Xia, and Jianfeng
Gao, 2010. "A comparison
of unsupervised methods for Part of Speech Tagging in Chinese,"
Proceedings of the 23rd International Conference on Computational Linguistics
(COLING 2010), Poster Volume, pages 135-143, Beijing, China, August 23-27,
2010. [pdf]
o
Fei
Xia and Lap Cheung, 2006. "Features, Bagging, and System Combination for
the Chinese POS Tagging Task," Proceedings of the 5th SIGHAN Workshop on
Chinese Language Processing (SIGHAN 2006), pages 25-32, Sydney, Australia, July
22-23, 2006. [pdf]
·
Workshops:
o
Qing
Ma and Fei Xia (eds),
2003. Proceedings of the 2nd SIGHAN Workshop on Chinese Language Processing
(SIGHAN-2003), in conjunction with ACL 2003. [workshop
proceedings (preface)]
o
Martha
Palmer, Mitch Marcus, Aravind Joshi, and Fei Xia (eds),
2000. Proceedings of the 2nd Chinese Language Processing Workshop (CLP-2000),
in conjunction with ACL 2000. [workshop
proceedings (front matter)]
o
The 1st
Chinese Language Processing Workshop (CLP-1998), Philadelphia, PA, June 30
– July 2, 1998.
·
The
Chinese Penn Treebank Project (see the “Treebank Development”
section)
·
Michael
Tepper and Fei Xia, 2010.
"Inducing Morphemes Using Light Knowledge," Journal of ACM
Transactions on Asian Language Information Processing (TALIP), 9(3): 1-38,
2010. [pdf]
·
Michael
Tepper and Fei Xia, 2008.
"A Hybrid Approach to the Induction of Underlying Morphology,"
Proceedings of the Third International Joint Conference on Natural Language
Processing (IJCNLP-2008), pages 17-24, Hyderabad, India, Jan 7-12, 2008. [pdf]
·
Finding
parallel text:
o
Achim Ruopp and Fei
Xia, 2008. "Finding parallel texts on the web using cross- language
information retrieval", Proceedings of the 2nd International Workshop on
"Cross Lingual Information Access" in conjunction with IJCNLP-2008,
pages 18-25, Hyderabad, India, Jan 7-12, 2008. [pdf]
·
Statistical
MT:
o
Fei
Xia and Michael McCord, 2004. "Improving a Statistical MT System with
Automatically Learned Rewrite Patterns", the 20th International Conference
on Computational Linguistics (COLING 2004), Geneva, Switzerland, Aug 22-29,
2004. [pdf]
o
Christoph Tillmann and Fei
Xia, 2003. "A Phrase-Based Unigram Model for Statistical Machine
Translation", the 3rd Human Language Technology Conference (HLT/NAACL
2003), Edmonton, Canada, May 27 -- June 2, 2003. [pdf]
o
Y.
Al-Onaizan, R. Florian, M. Franz, H. Hassan, Y. S.
Lee, S. McCarley, K. Papineni,
S. Roukos, J. Sorensen, C. Tillmann,
T. Ward, F. Xia, 2003. "TIPS: A Translingual
Information Processing System", Proceedings of the 3rd Human Language
Technology Conference (HLT/NAACL-2003), Demonstration Session, pages 1-2,
Edmonton, Canada, May 27 - June 2, 2003. [pdf]
·
Transfer-based
MT:
o
Hiyan Alshawi, Adam Buchsbaum
and Fei Xia, 1997. "A Comparison of Head
Transducers and Transfer for a Limited Domain Translation", Proceedings of
the 35th Annual Meeting of the Association for Computational Linguistics
(ACL-1997), pages 360-365, Madrid, Spain, July 7-11, 1997. [pdf]
o
Hiyan Alshawi and Fei
Xia, 1997. "English-to-Mandarin Speech Translation with Head
Transducers", Proceedings of the Workshop of Spoken Language Translation
(SLT-1997), pages 54-60, Madrid, Spain, July 11, 1997. [pdf]
·
Fei
Xia and Martha Palmer, 2010. "From Treebank to Tree-Adjoining
Grammar", in "Supertagging: Using Complex
Lexical Descriptions in Natural Language Processing", edited by Srinivas Bangalore and Aravind K.
Joshi, pages 35-72, MIT Press, 2010. [pdf]
·
Fei
Xia, Chung-hye Han, Martha Palmer and Aravind Joshi, 2001. "Automatically Extracting and
Comparing Lexicalized Grammars for Different Languages", the 17th
International Joint conference on Artificial Intelligence (IJCAI-2001), pages
1321-1326, Seattle, Aug 4-10, 2001. [pdf]
·
Fei
Xia, Martha Palmer, and Aravind Joshi, 2000. "A
Uniform Method of Grammar Extraction and Its Applications", the Joint
SIGDAT Conference on Empirical Methods in Natural Language Processing and Very
Large Corpora (EMNLP/VLC-2000), pages 53-62, Hong Kong, Oct 7-8, 2000. [pdf]
·
Fei
Xia, Chung-hye Han, Martha Palmer, and Aravind Joshi, 2000. "Comparing Lexicalized Treebank
Grammars Extracted from Chinese, Korean, and English Corpora", the 2nd
Chinese Language Processing Workshop (CLP-2000), pages 52-59, Hong Kong, Oct 8,
2000. [pdf]
·
Fei
Xia and Martha Palmer, 2000. "Evaluating the Coverage of LTAGs on
Annotated Corpora", the Workshop on Using Evaluation within HLT Programs:
Results and Trends, Athens, Greece, May 30, 2000. [pdf]
·
Fei
Xia and Tonia Bleam, 2000. "A Corpus-based
Evaluation of Syntactic Locality in TAGs", the 5th International Workshop
on Tree Adjoining Grammar and Related Formalisms (TAG+ 2000), pages 215-220,
Paris, France, May 25-27, 2000. [pdf]
·
Fei
Xia and Martha Palmer, 2000. "Comparing and Integrating Tree Adjoining
Grammars", the 5th International Workshop on Tree Adjoining Grammar and
Related Formalisms (TAG+ 2000), pages 265-268, Paris, France, May 25-27, 2000.
[pdf]
·
Fei
Xia, 1999. "Extracting Tree Adjoining Grammars from Bracketed
Corpora", the 5th Natural Language Processing Pacific Rim Symposium
(NLPRS-99), pages 398-403, Beijing, China, Nov. 1999. [pdf]
·
Fei
Xia, Martha Palmer, and Vijay Shanker, 2010.
"Developing Tree-Adjoining Grammars with Lexical Descriptions," in
"Supertagging: Using Complex Lexical
Descriptions in Natural Language Processing", edited by Srinivas Bangalore and Aravind K.
Joshi, pages 73-110, MIT Press, 2010. [pdf]
·
Fei
Xia, Martha Palmer and K. Vijay-Shanker, 2005.
"Automatically Generating Tree Adjoining Grammars from Abstract
Specifications", Journal of Computational Intelligence, 21(3), 246-287,
2005. [pdf]
·
Fei
Xia, Martha Palmer, K. Vijay-Shanker, 1999.
"Towards Semi-automating Grammar Development", the 5th Natural
Language Processing Pacific Rim Symposium (NLPRS-99), pages 96-101, Beijing,
China, Nov. 1999. [pdf]
·
Fei
Xia, Martha Palmer, K. Vijay-Shanker and Joseph Rosenzweig, 1998. "Consistent Grammar Development
Using Partial-Tree Descriptions for LTAGs", the 4th International Workshop
on Tree Adjoining Grammar and Related Formalisms (TAG+ 1998), page 180-183,
Philadelphia, Aug 1-3, 1998. [pdf]
·
Anoop Sarkar, Fei
Xia, and Aravind Joshi, 2000. "Some Experiments
on Indicators of Parsing Complexity for Lexicalized Grammars", Efficiency
in Large-Scale Parsing Systems Workshop, Luxembourg, Germany, Aug 5, 2000. [pdf]
·
Christy
Doran, Beth Ann Hockey, Anoop Sarkar,
B. Srinivas and Fei Xia,
2000. "Evolution of the XTAG System", in "Tree Adjoining
Grammars: Formalisms, Linguistic Analysis and Processing", a CSLI volume
edited by Anne Abeille and Owen Rambow,
pages 371-404, 2000. [pdf]
·
Martha
Palmer, Chung-hye Han, Fei
Xia, Dania Egedi and Joseph Rosenzweig,
2000. "Constraining Lexical Selection across Languages Using Tree
Adjoining Grammars", in "Tree Adjoining Grammars: Formalisms, Linguistic
Analysis and Processing", a CSLI volume edited by Anne Abeille
and Owen Rambow, pages 445-466, 2000. [pdf]
·
C.
Doran, B. Hockey, P. Hopely, J. Rosenzweig,
A. Sarkar, B. Srinivas, F.
Xia, A. Nasr and O. Rambow, 1997. "Maintaining
the Forest and Burning out the Underbrush in XTAG", the Workshop on
Computational Environments for Grammar Development and Language Engineering
(ENVGRAM-1997), pages 30-37, Madrid, Spain, July 12, 1997. [pdf]
·
Kelly Peterson, Matt Hohensee, and Fei Xia, 2011. “Email Formality in the Workplace: A Case Study on
the Enron Corpus,” In Proceedings of the 2011 ACL Workshop on Language in Social Media (LSM 2011), Portland, Oregon,
June 23, 2011. [pdf]
·
Chris
Brew, Martha Palmer, and Fei Xia (eds), 2008. Proceedings of the 3rd Workshop on Issues
in Teaching Computational Linguistics, in conjunction with ACL 2008. [workshop
proceedings (front matter)]
·
Fei
Xia, 2008. "The evolution of a statistical NLP course," In
Proceedings of the Third Workshop on Issues in Teaching Computational
Linguistics (TeachCL-2008), pages 45-53, Columbus, Ohio, June 19-20, 2008. [pdf]
·
Emily
Bender, Fei Xia, and Erik Bansleben,
2008. "Building a flexible, collaborative, intensive master's program in
computational linguistics," Proceedings of the Third Workshop on Issues in
Teaching Computational Linguistics (TeachCL-2008), pages 10-18, Columbus, Ohio,
June 19-20, 2008. [pdf]