LDC Papers
The
following papers, presented or published by LDC staff, are listed by year and
then alphabetically by the last name of the first author.
2012 |
2011 |
2010 |
2009 |
2008 |
2007 |
2006 |
2005 |
2004 |
2003 |
2002 |
2001 |
2000 |
1999 |
1998 |
Undated
Eleftheria Ahtaridis, Christopher Cieri, Denise DiPersio
LDC Language Resource Papers: Building a Bibliographic Database
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF,
Poster in PDF
Christopher Cieri, Marian Reed, Denise DiPersio, Mark Liberman
Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available:
Slides
Christopher Cieri, Malcah Yaeger-Dror
Toward the Harmonization of Metadata Practice for Spoken Languages Resources
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
SpeechCorpora 2012: Workshop on Best Practices for Speech Corpora in Linguistic Research
Available: Paper in PDF
Jennifer Garland, Stephanie Strassel, Safa Ismael, Zhiyi Song, Haejoong Lee
Linguistic Resources for Genre-Independent Language Technologies: User-Generated Content in BOLT
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF, Presentation Slides in PDF
David Graff, Mohamed Maamouri
Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF,
Poster in PDF
Stephen Grimes, Katherine Peterson, Xuansong Li
Automatic Word Alignment Tools to Scale Production of Manually Aligned Parallel Texts
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF
Seth Kulick, Ann Bies, Justin Mott
Further Developments in Treebank Error Detection Using Derivation Trees
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF,
Poster in PDF
Seth Kulick, Ann Bies, Justin Mott
Using Supertags and Encoded Annotation Principles for Improved
Dependency to Phrase Structure Conversion
NAACL-HLT 2012: The 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Montreal, QC, Canada, June 3-8
Available: Paper in PDF,
Poster in PDF
Xuansong Li, Stephanie M. Strassel, Heng Ji, Kira Griffitt, Joe
Ellis
Linguistic Resources for Entity Linking Evaluation: from
Monolingual to Cross-lingual
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF
Xuansong Li, Stephanie Strassel, Stephen Grimes, Safa Ismael,
Mohamed Maamouri, Ann Bies, Nianwen Xue
Parallel Aligned Treebanks at LDC: New Challenges Interfacing
Existing Infrastructures
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF
Mohamed Maamouri, Ann Bies, Seth Kulick
Expanding Arabic Treebank to Speech: Results from Broadcast News
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF,
Poster in PDF
Mohamed Maamouri, Wajdi Zaghouani, Violetta Cavalli-Sforza, David Graff, Mike Ciul
Developing ARET: An NLP-based Educational Tool Set for Arabic Reading Enhancement
NAACL-HLT 2012: The 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Montreal, QC, Canada, June 3-8
Available: Paper in PDF
Zhiyi Song, Safa Ismael, Stephen Grimes, David Doermann, Stephanie Strassel
Linguistic Resources for Handwriting Recognition and Translation Evaluation
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF,
Poster in PDF
Stephanie Strassel, Amanda Morris, Jonathan Fiscus, Christopher Caruso, Haejoong Lee,
Paul Over, James Fiumara, Barbara Shaw, Brian Antonishek, Martial Michel
Creating HAVIC: Heterogeneous Audio Visual Internet Collection
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF,
Poster in PDF
Stephanie Strassel, Kevin Walker, Karen Jones, Dave Graff, Christopher Cieri
New Resources for Recognition of Confusable Linguistic
Varieties: The LRE11 Corpus
Odyssey 2012: The Speaker and Language Recognition
Workshop, Singapore, Jun 25-28
Available: Paper in PDF,
Presentation Slides in PDF
Kevin Walker, Stephanie Strassel
The RATS Radio Traffic Collection System
Odyssey 2012: The Speaker and Language Recognition
Workshop, Singapore, Jun 25-28
Available: Paper in PDF
Jonathan Wright, Kira Griffitt, Joe Ellis, Stephanie Strassel, Brendan Callahan
Annotation Trees: LDC's customizable, extensible, scalable annotation
infrastructure
LREC 2012: 8th International Conference on Language Resources and Evaluation, Istanbul, May 21-27
Available: Paper in PDF
Seth Kulick, Ann Bies, and Justin Mott
Using Derivation Trees for Treebank Error Detection
ACL 2011, Portland, Oregon, USA, June 19-24, 2011
Available: Paper in PDF
Marianna Di Paolo,
Malcah Yaeger-Dror, Christopher Cieri, Stephanie Strassel,
Zsuzsanna Fagyal
Workshop: Towards Best Practices in Sociophonetics
NWAV39: New Ways of Analyzing Variation, San Antonio, Texas, November 4-6, 2010
Christopher Cieri, Stephanie Strassel
Robust, Digital, Empirical, Reproducible, Sociolinguistic, Methodology
Available: Presentation Slides in PPT format
Zsuzsanna Fagyal, Malcah Yaeger-Dror
Analyzing Rhythm I
Available: Presentation
Slides in PDF format Malcah Yaeger-Dror, Zsuzsanna Fagyal
Analyzing "Timing" 2
Available: Presentation
Slides in PPTX format
Meghan Lammie Glenn, Stephanie M. Strassel, Haejoong Lee, Kazuaki Maeda,
Ramez Zakhary and Xuansong Li
Transcription Methods for Consistency, Volume and Efficiency
LREC 2010, Workshop on Language Resources and Human Language Technologies for Semitic Languages. Valletta, Malta, May 2010
Available: Paper in PDF
Stephen Grimes, Xuansong Li, Ann Bies, Seth Kulick, Xiaoyi Ma, and Stephanie Strassel
Creating Arabic-English Parallel Word-Aligned Treebank Corpora at LDC
LREC 2010, Workshop on Language Resources and Human Language Technologies for Semitic Languages. Valletta, Malta, May 2010
Available: Paper in PDF
Seth Kulick and Ann Bies
A Treebank Query System Based on an Extracted Tree Grammar
Human Language Technologies: The 11th Annual Conference of the North
American Chapter of the Association for Computational Linguistics,
Los Angeles, CA, June 2010
Available: Paper
in PDF
Seth Kulick and Ann Bies
A TAG-derived Database for Treeban Search and Parser Analysis
TAG+10: 10th International Workshop on Tree Adjoining Grammars and
Related Formalisms, New Haven, CT, June 10-12, 2010
Available: Paper
in PDF
Seth Kulick, Ann Bies, and Mohamed Maamouri
Consistent and Flexible Integration of Morphological Annotation in the Arabic Treebank
LREC 2010, In Proceedings of the Seventh International Conference on Language Resources and Evaluation. Valletta, Malta, May 2010
Available: Paper in PDF, Poster in PDF
Xuansong Li, Niyu Ge, Stephen Grimes, Stephanie M. Strassel, Kazuaki Maeda
Enriching Word Alignment with Linguistic Tags
LREC 2010, In Proceedings of the Seventh International Conference on Language Resources and Evaluation. Valletta, Malta, May 2010
Available: Paper in PDF, Presentation in PDF
Xuansong Li, Stephanie Strassel, Stephen Grimes, Safa Ismael,
Xiaoyi Ma, Niyu Ge, Ann Bies, Nianwen Xue, Mohamed Maamouri
Parallel Aligned Treebank Corpora at LDC: Methodology, Annotation
and Integration
TLT9 - The Ninth International Workshop on Treebanks and Linguistic
Theories, December 2, 2010, University of Tartu, Estonia
Workshop on Annotation and Exploitation of Parallel Corpora (AEPC).
Available: Paper in PDF
Mark Liberman
The Future of Computational Linguistics: or, What Would Antonio Zampolli Do?
Antonio Zampolli Prize speech, presented at LREC2010, Valletta, Malta, May 21, 2010
Available: Presentation
Slides, Antonio Zampolli Prize Information: Prof. Antonio Zampolli Prize
Xiaoyi Ma
Toward a Name Entity Aligned Bilingual Corpus
LREC 2010, Workshop on Methods for the Automatic Acquisition of Language Resources and Their Evaluation Methods. Valletta, Malta, May 2010
Available: Paper in PDF
Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zaghouani, Dave Graff and Mike Ciul
From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News
LREC 2010, In Proceedings of the Seventh International Conference on Language Resources and Evaluation. Valletta, Malta, May 2010
Available: Paper in PDF, Poster in PDF
Mark Mandel Paul McNamee, Hoa Trang Dang, Heather Simpson, Patrick Schone and Stephanie
M. Strassel Heather Simpson, Stephanie Strassel, Robert Parker, Paul McNamee Zhiyi Song, Stephanie Strassel, Gary Krug and
Kazuaki Maeda Stephanie Strassel, Dan
Adams, Henry Goldberg, Jonathan Herr, Ron Keesing, Daniel Oblinger, Heather
Simpson, Robert Schrag and Jonathan Wright Wajdi Zaghouani, Bruno Pouliquen, Mohamed Ebrahim and Ralf Steinberger Wajdi Zaghouani, Ralf Steinberger and Bruno Pouliquen Wajdi Zaghouani, Mona�Diab, Aous Mansouri, Sameer�Pradhan and�Martha Palmer Mohammed Maamouri Steven Bird, Ewan Klein, and
Edward Loper Christopher Cieri Christopher Cieri, Stephanie
Strassel Seth Kulick and Ann Bies Catherine Lai and Steven Bird Mohamed Maamouri, Ann Bies and
Seth Kulick Niklas Paulsson, Khalid Choukri,
Djamel Mostefa, Denise DiPersio, Meghan Glenn and Stephanie Strassel Stephanie Strassel Chomicha Bendahman, Meghan Lammie Glenn, Djamel Mostefa, Niklas Paulsson, Stephanie Strassel Linda Brandschain, Christopher
Cieri, David Graff, Abby Neely, Kevin Walker Christopher Cieri, Stephanie Strassel, Meghan Glenn, Reva Schwartz, Wade Shen, Joseph Campbell Mona Diab, Aous Mansouri, Martha
Palmer, Olga Babko-Malaya,Wajdi Zaghouani, Ann Bies, Mohammed Maamouri Lauren Friedman, Stephanie Strassel, Meghan Lammie Glenn Lauren Friedman, Stephanie Strassel Lauren Friedman, Stephanie Strassel, Haejoong Lee Ryan Gabbard and Seth Kulick Meghan Lammie Glenn, Stephanie Strassel, Lauren Friedman, Haejoong Lee,
Shawn Medero
Mohamed Maamouri, Seth Kulick,
Ann Bies Mohamed Maamouri, Ann Bies, Seth
Kulick Mohamed Maamouri, Ann Bies, Seth
Kulick Kazuaki Maeda, Haejoong Lee,
Shawn Medero, Julie Medero, Robert Parker, Stephanie Strassel Kazuaki Maeda, Xiaoyi Ma,
Stephanie Strassel Marian Reed, Denise DiPersio and
Christopher Cieri Zhiyi
Song, Stephanie Strassel
Stephanie Strassel, Lauren Friedman, Safa
Ismael, Linda Brandschain
Gary Simons and Steven Bird Steven Bird and Haejoong Lee Christopher Cieri Christopher Cieri, Stephanie
Strassel, Meghan Lammie Glenn, Lauren Friedman Christopher Cieri, Linda Corson,
David Graff, Kevin Walker K. Ganchev, K. Crammer, F.
Pereira, G. Mann, K. Bellare, A. McCallum, S. Carroll, Y. Jin, P. White. Kuzman Ganchev, Fernando Pereira,
Mark Mandel, Steven Carroll, Peter White Olga Babko-Malaya, Ann Bies, Ann
Taylor, Szuting Yi, Martha Palmer, Mitch Marcus, Seth Kulick, Libin Shen Ann Bies, Stephanie Strassel,
Haejoong Lee, Kazuaki Maeda, Seth Kulick, Yang Liu, Mary Harper, Matthew Lease Steven
Bird, Yi Chen, Susan Davidson, Haejoong Lee, and Yifeng Zheng Christopher
Cieri Christopher Cieri, Mark Liberman,
Victoria Arranz and Khalid Choukri Christopher Cieri Christopher Cieri, Mark Liberman Christopher Cieri, Walt Andrews,
Joseph P. Campbell, George Doddington, Jack Godfrey, Shudong Huang, Mark
Liberman, Alvin Martin, Hirotaka Nakasone, Mark Przybocki, Kevin Walker Ryan Gabbard, Seth Kulick,
Mitchell Marcus David Graff, Tim Buckwalter,
Hubert Jin, Mohamed Maamouri Yang
Jin, Ryan McDonald, Kevin Lerman, Mark Mandel, Steven Carroll, Mark Y Liberman,
Fernando Pereira, Raymond Winters, Peter White Xiaoyi Ma Xiaoyi Ma, Christopher Cieri Maamouri, Mohamed; Ann Bies and
Seth Kulick Mohamed Maamouri, Ann Bies, Tim
Buckwalter, Mona Diab, Nizar Habash, Owen Rambow, Dalila Tabessi Kazuaki Maeda, Christopher Cieri,
Kevin Walker Kazuaki Maeda, Haejoong Lee,
Julie Medero, Stephanie Strassel Mark Mandel Ryan McDonald, Kevin Lerman, and
Fernando Pereira Julie Medero, Kazuaki Maeda,
Stephanie Strassel, Christopher Walker Stephanie Strassel, Andrew W.
Cole Stephanie
Strassel, Christopher Cieri, Andy Cole, Denise DiPersio, Mark Liberman, Xiaoyi
Ma, Mohamed Maamouri, Kazuaki Maeda Jiahong Yuan, Mark Liberman,
Christopher Cieri Ann Bies, Seth Kulick, Mark
Mandel Meghan
Lammie Glenn, Stephanie Strassel Jerry Goldman, Steve Renals,
Steven Bird, Franciska de Jong, Marcello Federico, Carl Fleischhauer, Mark
Kornbluh, Lori Lamel, Douglas Oard, Claire Stewart and Richard Wright Jachym Kolar, Jan Svec, Stephanie Strassel, Christopher Walker, Dagmar
Kozlakova, Josef Psutka Violetta Cavalli-Sforza, Mohamed
Maamouri Christopher
Cieri Christopher Cieri Yang Jin, Ryan T. McDonald, Kevin
Lerman, Mark A. Mandel, Mark Y. Liberman, Fernando Pereira, R. Scott Winters,
Peter S. White Mohamed Maamouri Ryan
McDonald, Fernando Pereira, Seth Kulick, Scott Winters, Yang Jin, and Peter
White Tim Buckwalter (2004) Christopher Cieri, Joseph P.
Campbell, Hirotaka Nakasone, David Miller, Kevin Walker Christopher Cieri, Mark Liberman Christopher Cieri, David Miller,
Kevin Walker George Doddington, Alexis Mitchell,
Mark Przybocki, Lance Ramshaw, Stephanie Strassel, Ralph Weischedel Shudong Huang, Stephanie
Strassel, Alexis Mitchell, Zhiyi Song Seth
Kulick, Ann Bies, Mark Liberman, Mark Mandel, Ryan McDonald, Martha Palmer,
Andrew Schein, Lyle Ungar, Scott Winters, Pete White Mohamed Maamouri and Ann Bies
(2004) Mohamed Maamouri, Tim Buckwalter,
and Christopher Cieri (2004) Mohamed Maamouri, Ann Bies, Tim
Buckwalter, and Wigdan Mekki (2004) Mohamed Maamouri, David Graff,
Hubert Jin, Christopher Cieri, and Tim Buckwalter (2004) Ryan
McDonald, R. Scott Winters, Mark Mandel, Yang Jin, Peter S. White, Fernando
Pereira Kazuaki Maeda and Stephanie
Strassel (2004) Mike Maxwell Douglas Oard, Dagobert Soergel,
G. Craig Murray, David Doermann, Jianqiang Wang, Bhuvana Ramabhadran, Martin
Franz, James Mayfield and Samuel Gustman, Stephanie Strassel Stephanie Strassel Colin
Warner, Ann Bies, Christine Brisson, Justin Mott Steven Bird and Gary Simons
(2003) Steven Bird and Gary Simons
(2003) Christopher Cieri, Stephanie
Strassel Christopher Cieri, Mike Maxwell,
Stephanie Strassel Baden Hughes and Steven Bird
(2003) Seth Kulick, Mark Liberman,
Martha Palmer, and Andrew Schein Mike Maxwell Gary Simons and Steven Bird
(2003) Gary Simons and Steven Bird
(2003) Stephanie Strassel, David Miller,
Kevin Walker, Christopher Cieri (2003) Stephanie Strassel (2003) Stephanie Strassel, Alexis
Mitchell, Shudong Huang (2003) Stephanie Strassel, Mike
Maxwell, Christopher Cieri (2003) Steven Bird, Kazuaki Maeda,
Xiaoyi Ma, Haejoong Lee, Beth Randall, and Salim Zayat (2002) Christopher Cieri and Stephanie Strassel Christopher Cieri, Stephanie
Strassel, David Graff, Nii Martey, Kara Rennert and Mark Liberman (2002) Christopher Cieri, David Miller,
Kevin Walker (2002) Christopher Cieri, Stephanie
Strassel, William Labov Scott Cotton and Steven Bird
(2002) Xiaoyi Ma, Haejoong Lee, Steven
Bird and Kazuaki Maeda (2002) Mohamed Maamouri, Christopher
Cieri Kazuaki Maeda, Steven Bird,
Xiaoyi Ma, and Haejoong Lee (2002) Mike Maxwell, Gary Simons, and
Larry Hayashi (2002) Mike Maxwell (2002) Horacio Saggion, Dragomir Radev, Simone Teufel, Wai Lam, Stephanie M.
Strassel Christopher Cieri, David Graff,
David Miller, Kevin Walker (2001) Christopher Cieri, Andy Cole,
Dave Graff, Nii Martey, Stephanie Strassel, Cristina Tofan (2001) Christopher Cieri and Steven Bird
(2001) Lea Christiansen, Christopher
Cieri, Kathleen Egan, Anita Kulman, Milton Paul (2001) David Miller, Christopher Cieri
and Kevin Walker (2001) Stephanie Strassel, Christopher
Cieri and Steven Bird (2001) Stephanie Strassel and
Christopher Cieri Steven Bird and Mark Liberman
(2001) Steven Bird, Gary Simons and
Chu-Ren Huang (2001) Steven Bird and Gary Simons
(2001) Kazuaki Maeda, Steven Bird,
Xiaoyi Ma and Haejoong Lee (2001) Kazuaki Maeda and Steven Bird
(2001) Steve Cassidy and Steven Bird
(2000) Christopher Cieri (2000) Christopher Cieri (2000) Christopher Cieri, David Graff,
Nii Martey, Stephanie Strassel (2000) Christopher Cieri, Dave Graff,
Mark Liberman, Nii Martey and Stephanie Strassel (2000) Christopher Cieri and Mark
Liberman (2000) David Graff and Steven Bird
(2000) Dave Graff, Stephanie Strassel
and Christopher Cieri (2000) Stephanie Strassel, Dave Graff,
Nii Martey and Christopher Cieri (2000) Steven Bird and Mark Liberman
(1999) Steven Bird and Mark Liberman
(1999) Steven Bird (1999) Steven Bird and Stephanie
Strassel (1999) Alexandra Canavan, Kevin Walker,
David Graff and Christopher Cieri (1999) Christopher Cieri, David Graff,
Mark Liberman, Nii Martey, Stephanie Strassel (1999) Christopher Cieri (1999) Xiaoyi Ma and Mark Liberman
(1999) Xiaoyi Ma (1999) Stephanie Strassel (1999) Stephanie Strassel and
Christopher Cieri (1999) Steven Bird and Mark Liberman
(1998) Christopher Cieri and David Graff
(1998) David Graff and Christopher Cieri
(1998) Mark Liberman and Christopher
Cieri (1998)
Conomastics: The Naming of Science Fiction Conventions
American Name Society Annual Meeting, Baltimore, MD, Jan. 7-9, 2010
Available: Presentation, Presentation
Slides in PDF, Presentation
Slides with notes in PDF
An Evaluation of Technologies for Knowledge Base Population
LREC 2010, In Proceedings of the Seventh conference on International Language Resources and Evaluation. Valletta, Malta, May 2010
Available: Paper in PDF
Wikipedia and the Web of Confusable Entities: Experience from Entity Linking
Query Creation for TAC 2009 Knowledge Base Population
LREC 2010, In Proceedings of the Seventh conference on International Language Resources and Evaluation. Valletta, Malta, May 2010
Available: Slides in PDF
Enhanced Infrastructure for Creation and Collection of
Translation Resources
LREC 2010, In Proceedings of the Seventh conference on International Language Resources and Evaluation. Valletta, Malta, May 2010
Available: Paper in PDF
The DARPA Machine Reading Program - Encouraging Linguistic and
Reasoning Research with a Series of Reading Tasks
LREC 2010, In Proceedings of the Seventh conference on International Language Resources and Evaluation. Valletta, Malta, May 2010
Available: Paper in PDF
Adapting a resource-light highly multilingual Named Entity Recognition system to Arabic
LREC 2010, In Proceedings of the Seventh conference on International Language Resources and Evaluation. Valletta, Malta, May 2010
Available: Paper in PDF, Presentation in PDF
A resource-light Arabic Named Entity Recognition system
2010 Georgetown University Round Table, Arabic Language and Linguistics. Georgetown, MD, March 12-14, 2010
Available: Presentation Slides
The Revised Arabic PropBank
ACL 2010, Proceedings of the Fourth Linguistic Annotation Workshop. Uppsala, Sweden, July 11-16, 2010
Available: Paper in PDF
LDC Arabic Reading Tools: "Read to Succeed"
presented at 2009 ACTFL Arabic SIG Meeting, San Diego, CA, November 21, 2009
Available: Presentation
Slides,
Additional Audiovisual Materials: Recorder, ilm, m_682, milad, p_682
Natural Language Processing with Python ;
O'Reilly Media Inc, 2009
Available: Book in HTML
Models of Phonological Variation for Multi-dialectal Communities: the case
of L'Aquila
NWAV 38: New Ways of Analyzing Variation, University of
Ottawa, Ottawa, Canada, October 22-25, 2009
Available: Presentation
Slides
Closer Still to a Robust, All Digital, Empirical, Reproducible
Sociolinguistic Methodology
NWAV 38: New Ways of Analyzing Variation, University of
Ottawa, Ottawa, Canada, October 22-25, 2009
Available: Presentation
Slides
Treebank Analysis and Search Using an Extracted Tree Grammar
TLT8: Eighth International Workshop on Treebanks and Linguistic Theories,
Milan, Italy, Dec 3-5, 2009
Available: Paper
in PDF
Querying Linguistic Trees ;
Journal of Logic, Language, and Information, Volume 18, 2009
Available: Paper in
PDF
Creating a Methodology for Large-Scale Correction of Treebank Annotation:
The Case of the Arabic Treebank ;
MEDAR Second International Conference on Arabic Language Resources and
Tools, Cairo, Egypt, April 22-23, 2009
Available: Paper
in PDF, Presentation
Slides
A Large Arabic Broadcast News Speech Data Collection ;
MEDAR Second International Conference on Arabic Language Resources and
Tools, Cairo, Egypt, April 22-23, 2009
Available: Paper
in PDF, Poster
Linguistic Resources for Arabic Handwriting Recognition. A Large Arabic Broadcast News Speech Data Collection
MEDAR Second International Conference on Arabic Language Resources and
Tools, Cairo, Egypt, April 22-23, 2009
Available: Paper
in PDF
Quick Rich Transcriptions of Arabic Broadcast News Speech Data
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper
in PDF
Speaker Recognition: Building the Mixer 4 and 5 Corpora ;
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper
in PDF, Poster
Bridging the Gap between Linguists and Technology Developers:
Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper in
PDF
A Pilot Arabic Propbank;
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper in
PDF
Explicit and Implicit Requirements of Technology Evaluations: Implications for Test Data Creation
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper in
PDF
Identifying Common Challenges for Human and Machine Translation: A Case Study from the GALE Program
The Eighth Conference of the Association for Machine Translation in the Americas (AMTA), Waikiki, HI, Oct 21-25, 2008
Available: Paper in
PDF
A Quality Control Framework for Gold Standard Reference Translations: The Process and Toolkit Developed for GALE
European Association for Machine Translation (EAMT): Translating & The Computer, London, England, Nov 19-20, 2008
Available: Paper in
PDF
Construct State Modification in the Arabic Treebank;
ACL 2008, Columbus, Ohio, June 16-18, 2008
Available: Paper
in PDF
Management of Large Annotation Projects Involving Multiple
Human Judges: a Case Study of GALE Machine Translation Post-editing
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper in
PDF
Diacritic Annotation in the Arabic Treebank and Its Impact on Parser
Evaluation;
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper
in PDF, Poster
Enhanced Annotation and Parsing of the Arabic Treebank;
INFOS 2008, Cairo, Egypt, March 27-29, 2008
Available: Paper
in PDF
Enhancing the Arabic Treebank: A Collaborative Effort toward New Annotation
Guidelines;
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper
in PDF, Poster
Annotation Tool Development for Large-Scale Corpus Creation Projects at the
Linguistic Data Consortium
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper
in PDF
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of
Potential Parallel Text using
LREC 2008, Marrakech,
Morocco, May 28-30, 2008
Available: Paper
in PDF
The Linguistic Data Consortium Member Survey: Purpose, Execution and
Results;
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper
in PDF, Presentation
Slides
Entity Translation and Alignment in the ACE-07 ET Task;
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper
in PDF
New Resources for Document Classification Analysis and
Translation Technologies;
LREC 2008, Marrakech, Morocco, May 28-30, 2008
Available: Paper
in PDF
Toward a Global Infrastructure for the Sustainability of Language Resources;
22nd Pacific Asia Conference on Language, Information and Computation,
Cebu City, Philippines, 2008
Available: Paper
in PDF
Graphical Query for Linguistic Treebanks
Tenth Conference of the Pacific Association for Computational Linguistics,
Melbourne 2007
Available: Paper
in PDF
Phonological Variation in Multi-Dialectal Italy: distinguishing e from ?
NWAV 2007, Philadelphia, October 11-14, 2007
Available: Presentation
Slides
Linguistic Resources in Support of Various Evaluation Metrics
MT Summit XI, Workshop on Automatic Procedures in MT Evaluation,
Copenhagen, September 9-14,2007
Available: Presentation
Slides
Resources for New Research Directions in Speaker Recognition: The Mixer 3, 4
and 5 Corpora
Interspeech 2007, Antwerp, August 2007.
Available: Paper
in PDF, Presentation
Slides
Penn/UMass/CHOP Biocreative II Systems
Biocreative 2. [In Press]
Available: Paper
in PDF
Semi-automated Named Entity Annotation
Linguistic Annotation Workshop 2007 [In Press]
Available: Paper in
PDF
Issues in Synchronizing the English Treebank and PropBank
Frontiers in Linguistically Annotated Corpora, A Merged Workshop with 7th
International Workshop on Linguistically Interpreted Corpora (LINC-2006) and
Frontiers in Corpus Annotation III, Coling/ACL 2006 Available: Paper
in PDF
Linguistic Resources for Speech Parsing
LREC 2006: Fifth International Conference on Language Resources and
Evaluation Available: Paper
in PDF
Designing and Evaluating an XPath Dialect for Linguistic Queries
22nd International Conference on Data Engineering (ICDE), Atlanta Available:
Paper in
PDF
Linguistic Resources, Development and Evaluation
Chapter 8 in Laila Dybkj�r, Holmer, Hemsen and Wolfgang Minker,
Evaluation of Text and Speech Systems, Kluwer Academic Publishers
Available: Forthcoming
Linguistic Data Resources
Chapter 3 in Tanja Schultz and Katrin Kirchhoff (eds.) Multilingual
Speech Processing, Elsevier, Academic Press, ISBN 13: 978-0-12-088501-5. April
2006.
Available: Elsevier's Page
What is Quality? Invited Talk at the Workshop on Quality Assurance and
Quality Measurement for Language and Speech Resources
LREC 2006: Fifth International Conference on Language Resources and
Evaluation
Available: Presentation
Slides
More Data and Tools for More Languages and Research Areas: A Progress Report
on LDC Activities
LREC 2006: Fifth International Conference on Language Resources and
Evaluation
Available: Paper
in PDF, Presentation
Slides
The Mixer and Transcript Reading Corpora: Resources for Multilingual,
Crosschannel Speaker Recognition Research
LREC 2006: Fifth International Conference on Language Resources and
Evaluation
Available: Paper
in PDF, Presentation
Slides
Fully Parsing the Penn Treebank
HLT-NAACL, 2006
Available: Paper in
PDF
Lexicon Development for Varieties of Spoken Colloquial Arabic
LREC 2006: Fifth International Conference on Language Resources and
Evaluation Available: Paper in
PDF
Automated recognition of malignancy mentions in biomedical literature
Open Access: BMC Bioinformatics 7:492
Available: Paper
in PDF
Champollion: A Robust Parallel Text Sentence Aligner
LREC 2006: Fifth International Conference on Language Resources and
Evaluation Available: Paper in PDF
Corpus Support for Machine Translation at LDC
LREC 2006: Fifth International Conference on Language Resources and
Evaluation Available: Paper in PDF
Diacritization: A Challenge to Arabic Treebank Annotation and Parsing
Machine Translation SIG of the British Computer Society Conference Available:
Paper
in PDF
Developing and Using a Pilot Dialectal Arabic Treebank
LREC 2006: Fifth International Conference on Language Resources and
Evaluation Available: Paper
in PDF
Low-cost Customized Speech Corpus Creation for Speech Technology
Applications
LREC 2006: Fifth International Conference on Language Resources and
Evaluation
Available: Paper
in PDF
A New Phase in Annotation Tool Development at the Linguistic Data
Consortium: The Evolution of the Annotation Graph Toolkit
LREC 2006: Fifth International Conference on Language Resources and
Evaluation
Integrated Annotation of Biomedical Text: Creating the PennBioIE Corpus
Presented at Text Mining, Ontologies and Natural Language Processing in
Biomedicine, Manchester, UK, March 20 - 21, 2006
Available: Abstract,
Presentation
Slides in PDF
Multilingual Dependency Parsing with a Two-Stage Discriminative Parser
Computational Natural Language Learning (CoNLL-X), 2006 Available: Paper
as PDF
An Efficient Approach for Gold-Standard Annotation: Decision Points for
Complex Tasks
LREC 2006: Fifth International Conference on Language Resources and
Evaluation Available: Paper in PDF
Corpus Development and Publication
LREC 2006: Fifth International Conference on Language Resources and
Evaluation Available: Paper
in PDF and Poster
in PPT
Integrated Linguistic Resources for Language Exploitation Technologies
LREC 2006: Fifth International Conference on Language Resources and
Evaluation
Available: Paper in
PDF, Presentation
Slides
Towards an Integrated Understanding of Speaking Rate in Conversation
The Ninth International Conference on Spoken Language Processing
(Interspeech 2006 - ICSLP), Pittsburgh, Pennsylvania
Available: Paper
in PDF, Presentation
Slides
Parallel Entity and Treebank Annotation
Presented at Frontiers in Corpus Annotation II: Pie in the Sky, ACL 2005
workshop, Ann Arbor, June 29, 2005
Available: Paper
in PDF
Linguistic Resources for Meeting Speech Recognition
MLMI 2005, Edinburgh, UK, July 11-13, 2005
Available:
Paper
in PDF
Transforming Access to the Spoken Word
International Journal on Digital Libraries 5, 287-298, 2005
Available: Paper
in PDF
Czech spontaneous speech corpus with structural metadata
Interspeech 2005, Lisbon, Portugal, September 4-8, 2005
Available: Paper
in PDF
Extensions to Histogram-Based Student Modeling Approach to Facilitate
Reading in Morphologically Complex Languages
AIED: International Conference on Artificial Intelligence in Education Available:
Paper in
PDF
HLT Evaluation: The Role of Data Centers
ELRA HLT Evaluation Workshop, Malta, December 2005
Available: Presentation
Slides
Modeling Phonological Variation in Multidialectal Italy
University of Pennsylvania, Doctoral Dissertation, May 2005
Available: PDF
from ProQuest
Identifying and Extracting Malignancy Types in Cancer Literature
Presented at BioLink 2005: ISMB/ACL, Detroit, June 24, 2005
Available: Paper
in PDF
Arabic Literacy
Lemma, 11,16 in Encyclopedia of Arabic Language and Linguistics (EALL).
Vol 2 Available: Paper in PDF
Simple Algorithms for Complex Relation Extraction with Applications to
Biomedical IE
43rd Annual Meeting of the Association for Computational Linguistics, 2005
Available: Paper
in PDF
Issues in Arabic Orthography and Morphology Analysis
Proceedings of the Workshop on Computational Approaches to Arabic
Script-based Languages, COLING 2004, Geneva, August 28, 2004.
Available: Paper
in PDF
The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data
LREC 2004: Fourth International Conference on Language Resources and
Evaluation, Lisbon
Available: Paper in
PDF, Poster in
PowerPoint Format
Progress Report from the Linguistic Data Consortium: recent activities in
resource creation and distribution and the development of tools and standards
LREC 2004: Fourth International Conference on Language Resources and
Evaluation, Lisbon
Available: Paper
in PDF, Presentation
Slides
The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text
LREC 2004: Fourth International Conference on Language Resources and Evaluation,
Lisbon
Available: Paper in
PDF, Presentation
Slides
Automatic Content Extraction (ACE) program - task definitions and
performance measures
LREC 2004: Fourth International Conference on Language Resources and
Evaluation Available: Paper
in PDF
Shared Resources for Multilingual Information Extraction and Challenges in
Named Entity Annotation
IJCNLP-04 Workshop on Named Entity Recognition for NLP Applications, Hainan
Island, China, March 2004
Available: Paper in
PDF
Integrated Annotation for Biomedical Information Extraction
Presented at HLT/NAACL Workshop BioLink 2004, Boston, May 2-7, 2004
Available: Paper
in PDF, Presentation
Slides
Developing an Arabic Treebank: Methods, Guidelines, Procedures, and Tools
Proceedings of the Workshop on Computational Approaches to Arabic Script-based
Languages, COLING 2004, Geneva, August 28, 2004.
Available: Paper
in PDF
Dialectal Arabic Telephone Speech Corpus: Principles, Tool Design, and
Transcription Conventions
Paper presented at the NEMLAR International Conference on Arabic Language
Resources and Tools, Cairo, Sept. 22-23, 2004.
Available: Paper
in PDF, Presentation
Slides.
The Penn Arabic Treebank: Building a Large-Scale Annotated Arabic Corpus
Paper presented at the NEMLAR International Conference on Arabic Language
Resources and Tools, Cairo, Sept. 22-23, 2004.
Available: Paper in
PDF
Dialectal Arabic Orthography-based Transcription and CTS Levantine Arabic
Collection.
Paper presented at the Parallel STT-NA Tracks Session of the EARS RT-04
Workshop, Palisades IBM Executive Center, New York, Nov. 10, 2004.
Available: Paper
in Word format
An entity tagger for recognizing acquired genomic variations in cancer
literature
Bioinformatics 20:3249-3251
Available: Paper
in PDF
Annotation Tools for Large-Scale Corpus Development: Using AGTK at the
Linguistic Data Consortium.
LREC 2004: Fourth International Conference on Language Resources and
Evaluation
Available: Paper in PDF
From Legacy Lexicon to Archivable Resource
First Steps for Language Documentation of Minority Languages: Workshop on
Computational Linguistic Tools for Morphology, Lexicon and Corpus Compilation, LREC
2004
Available: Paper
in PDF
Building an Information Retrieval Test Collection for Spontaneous
Conversational Speech
27th Annual International ACM SIGIR Conference (SIGIR2004), Sheffield,
England, July 2004
Available: Paper in
PDF
Linguistic Resources for Effective, Affordable, Reusable Speech-to-Text
LREC 2004: Fourth International Conference on Language Resources and
Evaluation
Available: Paper in
PDF
Addendum to the Penn Treebank II Style Bracketing Guidelines: BioMedical
Treebank Annotation
November, 2004
Available: Paper
in PDF , Paper
as web page , Paper
in plain text
Seven Dimensions of Portability for Language Documentation and Description
Language 79, 557-582.
Available: Paper in PDF
Extending Dublin Core Metadata to support the description and discovery of
language resources
Computing and the Humanities 37, 375-388.
Available: Paper in PDF
Robust Sociolinguistic Methodology: Tools, Data and Best Practices
NWAV 32, Philadelphia, 2003
Available: Presentation
Slides
Core Linguistic Resources for the World's Languages
ELSNET, ENABLER, ICWLR Joint Workshop, Paris, 2003
Available: Presentation
Slides
Grid-Enabling Natural Language Engineering By Stealth
Proceedings of the Workshop on The Software Engineering and Architecture of
Language Technology Systems (SEALTS)
Available: arXiv.org
Shallow Semantic Annotation of Biomedical Corpora for Information Extraction
ISMB Special Interest Group Meeting on Text Mining (BioLink). June 2003.
Brisbane, Australia
Available: Paper
in PDF , Presentation
slides
Incremental Grammar Development using Finite State Tools
Proceedings of the Workshop on Finite-State Methods in Natural Language
Processing, EACL 10, Budapest, 13-14 April 2003. Available: Paper in PDF
The Open Language Archives Community: An infrastructure for distributed
archiving of language resources
Literary and Linguistic Computing 18 (in press)
Available: arXiv.org
Building an Open Language Archives Community on the OAI Foundation
Library Hi Tech 21, 210-218, Special Issue on Open Archives Initiative
Metadata Harvesting.
Available: Paper in PDF
Shared Resources for Robust Speech-to-Text Technology
Eurospeech 2003
Available: Paper
in PDF
Corpus Creation for Disfluency Research
Disfluency in Spontaneous Speech Conference, Gothenburg, Sweden
Available: Abstract
in PDF, Presentation
Slides in PDF
Multilingual Resources for Entity Extraction
41st Annual Meeting of the Association for Computational Linguistics
Workshop on Multilingual and Mixed-language Named Entity Recognition:
Combining Statistical and Symbolic Models, Sapporo Japan
Available: Paper
in PDF
Linguistic Resource Creation for Research and Technology Development: A
Recent Experiment
Association for Computing Machinery Transactions on Asian Language
Information Processing (TALIP). Volume 2, Issue 2, 101 - 117
Available: Paper
in PDF
TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the
Annotation Graph Toolkit
Proceedings of the Third International Conference on Language Resources and
Evaluation
Available: arXiv.org
The DASL Project: a Case Study in Data Re- Annotation and
Re-Use
LREC 2002, Canary Islands, May 2002
Available: Paper
Corpora for Topic Detection and Tracking
James Allan, ed. Topic Detection and Tracking: Event-based Information
Organization, Kluwer International Series on Information Retrieval, Bruce
Croft, series editor, Boston, Kluwer Academic Publishers.
Research Methodologies, Observations and Outcomes in (Conversational) Speech
Data Collection
HLT 2002 The Human Language Technologies Conference, San Diego, CA,
March 2002
Available: Notebook
Paper.
Sharable Resources for Sociolinguistic Research
NWAV31, Stanford, 2002
Available: Presentation
Slides
An Integrated Framework for Treebanks and Multilayer Annotations
Proceedings of the Third International Conference on Language Resources and
Evaluation
Available: arXiv.org
Models and Tools for Collaborative Annotation
Proceedings of the Third International Conference on Language Resources and
Evaluation
Available: arXiv.org
Resources for Arabic Natural Language Processing
International Symposium on Processing Arabic, Tunis, April 2002
Available: Presentation
Slides
Creating Annotation Tools with the Annotation Graph Toolkit
Proceedings of the Third International Conference on Language Resources and
Evaluation
Available: arXiv.org
A Morphological Glossing Assistant
Proceedings of the International LREC Workshop on Resources and Tools in
Field Linguistics
Available: Paper
in PDF
Resources for Morphology Learning and Evaluation
LREC 2002: Third International Conference on Language Resources and
Evaluation vol. III, 967-974
Available: Paper
in PDF
Developing Infrastructure for the Evaluation of Single and
Multi-document Summarization Systems in a Multi-lingual Environment.
LREC 2002, Canary Islands, May 2002
Available: Paper
Resources and Infrastructure to Support Robust, Omnipresent
Communicator, SPINE, ROAR Workshop, Orlando, November 2001
Available: Presentation
Slides.
SPINE 2001 Data Preparation and Annotation and the SPINE Corpora
Communicator, SPINE, ROAR Workshop, Orlando, November 2001
Available: Presentation
Slides.
Annotation Graphs, Annotation Servers and Multi-Modal Resources:
Infrastructure for Interdisciplinary Education, Research and Development
Proceedings of the Association for Computational Linguistics: Workshop on
Sharing Tools & Resources Toulouse, July 2001
Available: Paper in
PDF, Presentation
Slides.
Getting SMART about Authoring
CALICO 2001, University of Central Florida, Orlando, March 2001
Available: Presentation
Slides.
Switchboard Cellular Resources for Speaker Recognition
Speaker Recognition Workshop, Maritime Institute of Technology and
Graduate Studies, Linthicum MD, March 2001
Available: Presentation
Slides.
Shared Resources and Community Building for Corpus Linguistics and Language
Teaching
Corpus Linguistics and Language Teaching Workshop Boston, MA., March
2001
Available: Presentation
Slides.
Data and Annotations for SocioLinguistics: A Corpus-Based Approach to
Sociolinguistic Research
Penn Linguistic Colloquium, Philadelphia, PA. March 2001
Available: Presentation
Slides.
A formal framework for linguistic annotation
Speech Communication 33(1,2), pp 23-60.
Available: arXiv.org
The Open Language Archives Community and Asian Language Resources
Proceedings of the Workshop on Language Resources in Asia, 6th Natural
Language Processing Pacific Rim Symposium (NLPRS), Tokyo, November 2001.
Available: arXiv.org
The OLAC Metadata Set and Controlled Vocabularies Proceedings of the
ACL Workshop on Sharing Tools and Resources for Research and Education,
Toulouse, July 2001, pp 7-18.
Available: arXiv.org
The Annotation Graph Toolkit: Software Components for Building Linguistic
Annotation Tools
Proceedings of HLT 2001 The Human Language Technologies Conference, San
Diego, CA, March 2001
Available: Paper in PDF
A Framework for Annotating Animal Bioacoustic Data
The 142nd Meeting of the Acoustical Society of America, Chicago, June
2001
Available: Presentation
Slides (Powerpoint).
Querying databases of annotated speech
Proceedings of the Eleventh Australasian Database Conference
Available: Paper in PDF
Multiple Annotation of Reuseable Data Resources: Corpora for Topic Detection
and Tracking
In Rajman, M. and J. C. Chappelier, eds. (2000) Actes des 5es Journees
internationales d'analyse statistique des donnees textuelles, volume 1,
Ecole Polytechnique Federale de Lausanne
Available: Paper in
PDF
Issues and Tools for Annotating a Corpus of Sociolinguistic Field Data
Linguistic Exploration Workshop in conjunction with
Linguistic Society of American Annual Meeting, Chicago, January 2000
Available: Presentation Slides
The TDT-3 Text and Speech Corpus
Presented at the Topic Detection and Tracking Workshop, Vienna,
Virginia, February 28 - March 1, 2000.
Available: Paper in
PostScript
Large Multilingual Broadcast News Corpora for Cooperative Research in Topic
Detection and Tracking: The TDT2 and TDT3 Corpus Efforts
In Proceedings of the Second International Language Resources and Evaluation
Conference, Athens, Greece, May 2000.
Available: Paper
in PDF
Issues in Corpus Creation and Distribution: the Evolution of the Linguistic
Data Consortium
In Proceedings of the Second International Language Resources and Evaluation
Conference, Athens, Greece, May 2000.
Available: Paper
in PDF
Many uses, many annotations for large speech corpora: Switchboard and TDT as
case studies
2nd Language Resources and Evaluation Conference (LREC 2000) Athens,
Greece, May 2000
Available: Paper in
PDF -- Paper in PostScript
Resources, New and Forthcoming, from LDC
Presented at the 2000 Speech Transcription Workshop, University of
Maryland, May 16-19, 2000.
Available: Presentation Slides
Quality Control in Large Annotation Projects Involving Multiple Judges: The
Case of the TDT Corpora.
In Proceedings of the Second International Language Resources and Evaluation
Conference, Athens, Greece, May 2000.
Available: Paper
in PDF
A Formal Framework for Linguistic Annotation
Technical Report MS-CIS-99-01 - Department of Computer and Information
Science, University of Pennsylvania
(expanded from version presented at ICSLP-98, Sydney)
Available: Paper in
PDF
Annotation graphs as a framework for multidimensional linguistic data
analysis
Towards Standards and Tools for Discourse Tagging -- Proceedings of the
Workshop, Somerset, NJ: Association for Computational Linguistics
Available: Paper in PDF
Multidimensional exploration of online linguistic field data
Proceedings of the 29th Annual Meeting of the Northeast Linguistics Society,
University of Massachussetts at Amherst.
Available: Paper in
PDF
Annotated Corpora in Linguistic Research
North American Symposium on Corpora in Linguistics and Language Teaching,
University of Michigan, May 21, 1999.
Available: Presentation Slides
Telephone Speech Corpora: New Needs, Languages, Methods and Technology
Presented at the Hub-5 Conversational Speech Understanding (LVCSR) Workshop,
Maritime Institute Technology and Graduate Studies, Linthicum Heights,
Maryland, June 1999.
Available: Presentation Slides
The TDT-2 Text and Speech Corpus
Presented at the DARPA Broadcast News Workshop, Washington, DC.,
February 1999.
Available: Paper
in PDF
This Ain't Your Father's Digital Data: Another Perspective on Legal
Information
Presented at the CALI 1999 - The Conference for Law School Computing.
Eugene Oregon, June 1999.
Available: Presentation Slides,
Video in RealMedia
Machine Translation Summit VII, September 13th, 1999, Kent Ridge Digital
Labs, National University of Singapore
Available: Paper in
Postscript, Paper
in PDF
Parallel Text Collections at the Linguistic Data Consortium
Machine Translation Summit
Available: Paper
in Postscript
Corpus Creation and Quality Control at the LDC
Presented at the Corpus of Spoken Dutch Workshop; Tilburg, Netherlands;
November 12, 1999.
Available: Presentation Slides
Corpus Sociolinguistics: Issues, Data and Tools
Presented at NWAVE-28, York University, Toronto, Ontario October, 1999.
Available: Presentation Slides
Towards a Formal Framework for Linguistic Annotations
Proceedings of the 5th International Conference on Spoken Language
Processing.
Available: Paper in
PDF
Topic Detection and Tracking Corpora
Presented at TREC/SDR Conference, Gaithesburg Maryland, November 1998.
Available:
Update on Lexical Resources and Projects at the Linguistic Data Consortium
Presented at the Ninth Hub-5 Conversational Speech Recognition (LVCSR)
Workshop, Maritime Institute Technology and Graduate Studies, Linthicum
Heights, Maryland, September 1998.
Available:
The Creation, Distribution and Use of Linguistic Data
Proceedings of the First International Conference on Language Resources and
Evaluation, Granada, Spain, May 1998.
Available: Paper in
PDF