Natural Language Processing

248 Publications

  •    

    A Crossing-Sensitive Third-Order Factorization for Dependency Parsing

    Emily Pitler

    Transactions of the Association for Computational Linguistics, vol. 2 (2014), pp. 41-54

  •   

    A Database for Measuring Linguistic Information Content.

    Richard Sproat, Bruno Cartoni, HyunJeong Choe, David Huynh, Linne Ha, Ravindran Rajakumar, Evelyn Wenzel-Grondie

    Language Resources and Evaluation Conference, ELDA, 330 W 58th St (2014)

  •    

    A Discriminative Latent Variable Model for Online Clustering

    Rajhans Samdani, Kai-Wei Chang, Dan Roth

    International Conference on Machine Learning (2014) (to appear)

  •    

    A New Entity Salience Task with Millions of Training Examples

    Dan Gillick, Jesse Dunietz

    Proceedings of the European Association for Computational Linguistics, Association for Computational Linguistics (2014) (to appear)

  •    

    A Scalable Gibbs Sampler for Probabilistic Entity Linking

    Neil Houlsby, Massimiliano Ciaramita

    Advances in Information Retrieval (ECIR 2014), Springer International Publishing, pp. 335-346

  •   

    Adapting taggers to Twitter with not-so-distant supervision

    Barbara Plank, Dirk Hovy, Anders Sogaard, Ryan McDonald

    International Conference on Computational Linguistics (2014)

  •   

    An Extension of BLANC to System Mentions

    Xiaoqiang Luo, Sameer Pradhan, Marta Recasens, Eduard Hovy

    Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers) (2014), pp. 24-29

  •   

    Applications of Maximum Entropy Rankers to Problems in Spoken Language Processing

    Richard Sproat, Keith Hall

    Interspeech 2014, International Speech Communications Association (to appear)

  •    

    Backoff Inspired Features for Maximum Entropy Language Models

    Fadi Biadsy, Keith Hall, Pedro Moreno, Brian Roark

    Proceedings of Interspeech, ISCA (2014)

  •    

    Bridging Text and Knowledge with Frames

    Srini Narayanan

    ACL Workshop on Frame Semantics (in honor of Charles FIllmore) (2014)

  •    

    Computer-aided quality assurance of an Icelandic pronunciation dictionary

    Martin Jansche

    LREC 2014, Reykjavik

  •   

    Constrained Arc-Eager Dependency Parsing

    Joakim Nivre, Yoav Goldberg, Ryan McDonald

    Computational Linguistics (2014)

  •    

    Discriminative pronunciation modeling for dialectal speech recognition

    Maider Lehr, Kyle Gorman, Izhak Shafran

    Proc. Interspeech (2014) (to appear)

  •   

    Enforcing Structural Diversity in Cube-pruned Dependency Parsing

    Hao Zhang, Ryan McDonald

    ACL (2014)

  •   

    Enhanced Search with Wildcards and Morphological Inflections in the Google Books Ngram Viewer

    Jason Mann, David Zhang, Lu Yang, Dipanjan Das, Slav Petrov

    Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (Demonstrations), Association for Computational Linguistics (2014)

  •    

    Frame-Semantic Parsing

    Dipanjan Das, Desai Chen, André F. T. Martins, Nathan Schneider, Noah A. Smith

    Computational Linguistics, vol. 40:1 (2014), pp. 9-56

  •   

    Great Question! Question Quality in Community Q&A

    Sujith Ravi, Bo Pang, Vibhor Rastogi, Ravi Kumar

    International AAAI Conference on Weblogs and Social Media (ICWSM) (2014)

  •    

    Hippocratic Abbreviation Expansion

    Brian Roark, Richard Sproat

    ACL, ACL (2014) (to appear)

  •   

    Learning Compact Lexicons for CCG Semantic Parsing

    Yoav Artzi, Dipanjan Das, Slav Petrov

    Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP '14)

  •   

    Modelling Events through Memory-based, Open-IE Patterns for Abstractive Summarization

    Daniele Pighin, Marco Cornolti, Enrique Alfonseca, Katja Filippova

    Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14) (2014) (to appear)

  •  

    Opinion Mining on YouTube

    Aliaksei Severyn, Olga Uryupina, Barbara Plank, Alessandro Moschitti, Katja Filippova

    Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14) (2014) (to appear)

  •    

    ParTes. Test Suite for Parsing Evaluation

    Marina Lloberes, Irene Castellón, Lluís Padró, Edgar Gonzàlez

    Procesamiento del Lenguaje Natural, vol. 53 (2014), pp. 87-94

  •   

    Parallel Algorithms for Unsupervised Tagging

    Sujith Ravi, Sergei Vassilivitskii, Vibhor Rastogi

    Transactions of the ACL (2014)

  •   

    Projecting the Knowledge Graph to Syntactic Parsing

    Andrea Gesmundo, Keith Hall

    EACL 2014: 15th Conference of the European Chapter of the Association for Computational Linguistics

  •   

    ReNoun: Fact Extraction for Nominal Attributes

    Mohamed Yahya, Steven Whang, Rahul Gupta, Alon Halevy

    EMNLP (2014) (to appear)

  •  

    SUIT: A Supervised User-Item based Topic model for Sentiment Analysis

    Fangtao Li, Sheng Wang, Shenghua Liu, Ming Zhang

    Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI-14) (2014) (to appear)

  •   

    Scoring Coreference Partitions of Predicted Mentions: A Reference Implementation

    Sameer Pradhan, Xiaoqiang Luo, Marta Recasens, Eduard Hovy, Vincent Ng, Michael Strube

    Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers) (2014), pp. 30-35

  •    

    Semantic Frame Identification with Distributed Word Representations

    Karl Moritz Hermann, Dipanjan Das, Jason Weston, Kuzman Ganchev

    Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (2014)

  •    

    The SMAPH System for Query Entity Recognition and Disambiguation

    Marco Cornolti, Paolo Ferragina, Massimiliano Ciaramita, Stefan Rued, Hinrich Schuetze

    ERD 2014: Entity Recognition and Disambiguation Challenge. SIGIR Forum., ACM

  •   

    A Dataset of Syntactic-Ngrams over Time from a Very Large Corpus of English Books

    Yoav Goldberg, Jon Orwant

    Second Joint Conference on Lexical and Computational Semantics, Association for Computational Linguistics, Atlanta, Georgia, USA (2013), pp. 241-247

  •   

    Breaking Out of Local Optima with Count Transforms and Model Recombination: A Study in Grammar Induction

    Valentin I. Spitkovsky, Daniel Jurafsky, Hiyan Alshawi

    2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013; Best Paper Award)

  •  

    Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction

    Jason Weston, Antoine Bordes, Oksana Yakhnenko, Nicolas Usunier

    Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing.

  •    

    Cross-Lingual Discriminative Learning of Sequence Models with Posterior Regularization

    Kuzman Ganchev, Dipanjan Das

    Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing

  •   

    Deceptive Answer Prediction with User Preference Graph

    Fangtao Li, Yang Gao, Shuchang Zhou, Xiance Si, Decheng Dai

    The 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013) (to appear)

  •    

    Efficient Estimation of Word Representations in Vector Space

    Tomas Mikolov, Kai Chen, Greg S. Corrado, Jeffrey Dean

    International Conference on Learning Representations (2013)

  •    

    Empirical Exploration of Language Modeling for the google.com Query Stream as Applied to Mobile Voice Search

    Ciprian Chelba, Johan Schalkwyk

    Mobile Speech and Advanced Natural Language Solutions, Springer Science+Business Media, New York (2013), pp. 197-229

  •    

    Enlisting the Ghost: Modeling Empty Categories for Machine Translation

    Bing Xiang, Xiaoqiang Luo, Bowen Zhou

    Proceedings of ACL, ACL (2013), pp. 822-831

  •   

    Filling Knowledge Base Gaps for Distant Supervision of Relation Extraction

    Wei Xu, Raphael Hoffmann, Le Zhao, Ralph Grishman

    ACL 2013

  •    

    HEADY: News headline abstraction through event pattern clustering

    Enrique Alfonseca, Daniele Pighin, Guillermo Garrido

    Proceedings of ACL-2013

  •   

    Hierarchical Geographical Modeling of User locations from Social Media Posts

    Amr Ahmed, Liangjie Hong, Alexander J Smola

    Proceedings of the 22nd International World Wide Web Conference (WWW 2013) (to appear)

  •    

    Identifying Phrasal Verbs Using Many Bilingual Corpora

    Karl Pichotta, John DeNero

    Proceedings of Empirical Methods in Natural Language Processing (2013)

  •   

    Language-Independent Discriminative Parsing of Temporal Expressions

    Gabor Angeli, Jakob Uszkoreit

    The 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013) (to appear)

  •    

    One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling

    Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn, Tony Robinson

    ArXiv, Google (2013)

  •   

    Online Learning for Inexact Hypergraph Search

    Hao Zhang, Liang Huang, Kai Zhao, Ryan McDonald

    Proc. of EMNLP (2013)

  •  

    Open-Domain Fine-Grained Class Extraction from Web Search Queries

    Marius Pasca

    Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2013)

  •    

    Overcoming the Lack of Parallel Data in Sentence Compression

    Katja Filippova, Yasemin Altun

    Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP '13), pp. 1481-1491

  •    

    ReFr: An Open-Source Reranker Framework

    Daniel M. Bikel, Keith B. Hall

    Interspeech 2013, pp. 756-758

  •   

    Russian Stress Prediction using Maximum Entropy Ranking

    Keith Hall, Richard Sproat

    EMNLP, ACL (2013)

  •   

    Scalable Decipherment for Machine Translation via Hash Sampling

    Sujith Ravi

    Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (2013)

  •    

    Smoothed marginal distribution constraints for language modeling

    Brian Roark, Cyril Allauzen, Michael Riley

    Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (2013), pp. 43-52

  •    

    Speech and Natural Language: Where Are We Now And Where Are We Headed?

    Ciprian Chelba

    Mobile Voice Conference, San Francisco (2013)

  •   

    Summarization Through Submodularity and Dispersion

    Anirban Dasgupta, Ravi Kumar, Sujith Ravi

    Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (2013)

  •    

    Supervised Learning of Complete Morphological Paradigms

    Greg Durrett, John DeNero

    Proceedings of the North American Chapter of the Association for Computational Linguistics (2013)

  •    

    System and method for determining active topics

    Michael Jeffrey Procopio

    Patent (2013)

  •   

    Target Language Adaptation of Discriminative Transfer Parsers

    Oscar Tackstrom, Ryan McDonald, Joakim Nivre

    Proceedings of the North American Chapter of the Association for Computational Linguistics (2013)

  •   

    Token and Type Constraints for Cross-Lingual Part-of-Speech Tagging

    Oscar Tackstrom, Dipanjan Das, Slav Petrov, Ryan McDonald, Joakim Nivre

    Transactions of the Association for Computational Linguistics (TACL '13) (2013)

  •   

    Universal Dependency Annotation for Multilingual Parsing

    Ryan McDonald, Joakim Nivre, Yoav Goldberg, Yvonne Quirmbach-Brundage, Dipanjan Das, Kuzman Ganchev, Keith Hall, Slav Petrov, Hao Zhang, Oscar Tackstrom, Claudia Bedini, Nuria Bertomeu Castello, Jungmee Lee

    Association for Computational Linguistics (2013)

  •    

    WHAD: Wikipedia historical attributes data

    Enrique Alfonseca, Guillermo Garrido, Jean-Yves Delort, Anselmo Peñas

    Language Resources and Evaluation (2013), pp. 28

  •  

    Written-Domain Language Modeling for Automatic Speech Recognition

    Hasim Sak, Yun-hsuan Sung, Françoise Beaufays, Cyril Allauzen

    Interspeech (2013)

  •    

    A Class-Based Agreement Model For Generating Accurately Inflected Translations

    Spence Green, John DeNero

    50th Annual Meeting of the Association for Computational Linguistics (ACL 2012)

  •   

    A Comparison of Chinese Parsers for Stanford Dependencies

    Wanxiang Che, Valentin I. Spitkovsky, Ting Liu

    50th Annual Meeting of the Association for Computational Linguistics (ACL 2012)

  •   

    A Data-Driven Approach to Question Subjectivity Identification in Community Question Answering

    Tom Chao Zhou, Xiance Si, Edward Y., Irwin King, Michael R. Lyu

    Proceedings of the 26th AAAI Conference on Artificial Intelligence (AAAI-12) (2012)

  •    

    A Feature-Rich Constituent Context Model for Grammar Induction

    Dave Golland, John DeNero, Jakob Uszkoreit

    Proceedings of the Association for Computational Linguistics (2012)

  •   

    A Pushdown Transducer Extension for the OpenFst Library

    Cyril Allauzen, Michael Riley

    CIAA, Springer (2012), pp. 66-77

  •   

    A Universal Part-of-Speech Tagset

    Slav Petrov, Dipanjan Das, Ryan McDonald

    Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC '12) (2012)

  •  

    Attribute Extraction from Conjectural Queries

    Marius Pasca

    Proceedings of the 24th International Conference on Computational Linguistics (COLING-2012)

  •   

    Bootstrapping Dependency Grammar Inducers from Incomplete Sentence Fragments via Austere Models

    Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky

    11th International Conference on Grammatical Inference (ICGI 2012)

  •   

    Capitalization Cues Improve Dependency Grammar Induction

    Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky

    NAACL HLT 2012 Workshop on Inducing Linguistic Structure (WILS 2012)

  •   

    Cross-lingual Word Clusters for Direct Transfer of Linguistic Structure

    Oscar Tackstrom, Ryan McDonald, Jakob Uszkoreit

    North American Association for Computational Linguistics (2012)

  •   

    DualSum: A Topic-Model for Update Summarization

    Enrique Alfonseca, Jean-Yves Delort

    Proceedings of EACL-2012, Brandschenkestrasse 110

  •   

    Entity Disambiguation with Freebase

    Zhicheng Zheng, Xiance Si, Fangtao Li, Edward Y. Chang, Xiaoyan Zhu

    The 2012 IEEE/WIC/ACM International Conference on Web Intelligence (WI'2012) (to appear)

  •   

    Generalized Higher-Order Dependency Parsing with Cube Pruning

    Hao Zhang, Ryan McDonald

    EMNLP (2012)

  •   

    Hallucinated N-Best Lists for Discriminative Language Modeling

    Kenji Sagae, Maider Lehr, Emily Tucker Prud’hommeaux, Puyang Xu, Nathan Glenn, Damianos Karakos, Sanjeev Khudanpur, Brian Roark, Murat Saraçlar, Izhak Shafran, Daniel M. Bikel, Chris Callison-Burch, Yuan Cao, Keith Hall, Eva Hassler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley

    Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2012)

  •   

    Haptic Voice Recognition Grand Challenge

    K. Sim, S. Zhao, K. Yu, H. Liao

    14th ACM International Conference on Multimodal Interaction. (2012)

  •    

    Improved Domain Adaptation for Statistical Machine Translation

    Wei Wang, Klaus Macherey, Wolfgang Macherey, Franz Och, Peng Xu

    AMTA-2012, The Association for Machine Translation in the Americas

  •  

    Instance-Driven Attachment of Semantic Annotations over Conceptual Hierarchies

    Janara Christensen, Marius Pasca

    Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL-2012), pp. 503-513

  •  

    Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing.

    Antoine Bordes, Xavier Glorot, Jason Weston, Yoshua Bengio

    AISTATS (2012)

  •    

    Language Modeling for Automatic Speech Recognition Meets the Web: Google Search by Voice

    Ciprian Chelba, Johan Schalkwyk, Boulos Harb, Carolina Parada, Cyril Allauzen, Leif Johnson, Michael Riley, Peng Xu, Preethi Jyothi, Thorsten Brants, Vida Ha, Will Neveitt

    University of Toronto (2012)

  •    

    Large Scale Language Modeling in Automatic Speech Recognition

    Ciprian Chelba, Dan Bikel, Maria Shugrina, Patrick Nguyen, Shankar Kumar

    Google (2012)

  •    

    Large-scale Discriminative Language Model Reranking for Voice Search

    Preethi Jyothi, Leif Johnson, Ciprian Chelba, Brian Strope

    Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, Association for Computational Linguistics, pp. 41-49

  •  

    Multilingual Natural Language Processing Applications: From Theory to Practice

    Daniel M. Bikel, Imed Zitouni

    IBM Press (2012)

  •    

    Optimal Size, Freshness and Time-frame for Voice Search Vocabulary

    Maryam Kamvar, Ciprian Chelba

    Google (2012)

  •   

    Overview of the 2012 Shared Task on Parsing the Web

    Slav Petrov, Ryan McDonald

    Notes of the First Workshop on Syntactic Analysis of Non-Canonical Language (SANCL) (2012)

  •   

    Pattern Learning for Relation Extraction with Hierarchical Topic Models

    Enrique Alfonseca, Katja Filippova, Jean-Yves Delort, Guillermo Garrido

    Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL'12) (2012)

  •   

    Syntactic Annotations for the Google Books Ngram Corpus

    Yuri Lin, Jean-Baptiste Michel, Erez Lieberman Aiden, Jon Orwant, William Brockman, Slav Petrov

    Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Volume 2: Demo Papers (ACL '12) (2012)

  •   

    The OpenGrm Open-Source Finite-State Grammar Software Libraries

    Brian Roark, Richard Sproat, Cyril Allauzen, Michael Riley, Jeffrey Sorensen, Terry Tai

    ACL (System Demonstrations) (2012), pp. 61-66

  •   

    Three Dependency-and-Boundary Models for Grammar Induction

    Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky

    2012 Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2012)

  •    

    Unsupervised Translation Sense Clustering

    Mohit Bansal, John DeNero, Dekang Lin

    the North American Association of Computational Linguistics (2012)

  •   

    User Demographics and Language in an Implicit Social Network

    Katja Filippova

    Proceedings of the 2012 Conference on Empirical Methods in Natural Language Processing (EMNLP'12), Jeju, Korea

  •   

    Using Search-Logs to Improve Query Tagging

    Kuzman Ganchev, Keith B. Hall, Ryan McDonald, Slav Petrov

    Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Volume 2: Short Papers (ACL '12) (2012)

  •    

    Vine Pruning for Efficient Multi-Pass Dependency Parsing

    Alexander Rush, Slav Petrov

    The 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL '12), Best Paper Award

  •    

    A Tweet Consumers' Look At Twitter Trends

    Thomas Steiner, Arnaud Brousseau, Raphael Troncy

    Workshop Making Sense of Microposts (MSM 2011) at the Extended Semantic Web Conference (ESWC 2011), Heraklion, Crete

  •    

    Adding Meaning to Facebook Microposts via a Mash-up API and Tracking Its Data Provenance

    Thomas Steiner, Ruben Verborgh, Joaquim Gabarro, Rik Van de Walle

    The 7th International Conference on Next Generation Web Services Practices (NWeSP 2011)

  •   

    Analyzing and Integrating Dependency Parsers

    Ryan McDonald, Joakim Nivre

    Computational Linguistics, vol. 37 (2011)

  •  

    Asking What No One Has Asked Before: Using Phrase Similarities to Generate Synthetic Web Search Queries

    Marius Pasca

    Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM-2011), ACM, Glasgow, Scotland, pp. 1347-1352

  •   

    Beam-Width Prediction for Efficient Context-Free Parsing

    Nathan Bodenstab, Aaron Dunlop, Keith Hall, Brian Roark

    Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics (2011)

  •   

    Binarized Forest to String Translation

    Hao Zhang, Licheng Fang, Peng Xu, Xiaoyun Wu

    ACL (2011), pp. 835-845

  •    

    Blognoon: Exploring a Topic in the Blogosphere

    Maria Grineva, Maxim Grinev, Dmitry Lizorkin, Alexander Boldakov, Denis Turdakov, Andrey Sysoe, Alexander Kiyko

    WWW 2011, ACM, New York, NY, USA, pp. 213-216

  •    

    Controlling Complexity in Part-of-Speech Induction

    Joao Graca, Kuzman Ganchev, Luisa Coheur, Fernando Pereira, Ben Taskar

    Journal of Artificial Intelligence Research (JAIR), vol. 41 (2011), pp. 527-551

  •   

    Corrective Dependency Parsing

    Keith B. Hall, Vaclav Novak

    Trends in Parsing Technologies, Springer (2011)

  •   

    Deterministic Statistical Mapping of Sentences to Underspecified Semantics

    Hiyan Alshawi, Pi-Chuan Chang, Michael Ringgaard

    Proceedings of the Ninth International Conference on Computational Semantics (IWCS 2011)

  •   

    Discovering fine-grained sentiment with latent variable structured prediction models

    Oscar Tackstrom, Ryan McDonald

    European Conference on Information Retrieval (2011)

  •   

    Efficient Parallel CKY Parsing on GPUs

    Youngmin Yi, Chao-Yue Lai, Slav Petrov, Kurt Keutzer

    Proceedings of the International Conference on Parsing Technologies (IWPT '11) (2011)

  •  

    Fine-Grained Class Label Markup of Search Queries

    Joseph Reisinger, Marius Pasca

    Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL-2011), pp. 1200-1209

  •   

    Gappy Phrasal Alignment by Agreement

    Mohit Bansal, Chris Quirk, Robert C. Moore

    Proc. 49th Annual Meeting of the Association for Computational Linguistics, ACL, Portland, Oregon (2011), pp. 1308-1317

  •   

    Improved Video Categorization from Text Metadata and User Comments

    Katja Filippova, Keith B. Hall

    Proceedings of the 34th international ACM SIGIR conference on Research and development in Information (SIGIR-2011), Beijing, China, pp. 835-842

  •    

    K2Q: Generating Natural Language Questions from Keywords with User Refinements

    Zhicheng Zheng, Xiance Si, Edward Y. Chang, Xiaoyan Zhu

    Proceedings of the 5th International Joint Conference on Natural Language Processing, ACL (2011), 947–955

  •    

    Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models

    Sameer Singh, Amarnag Subramanya, Fernando Pereira, Andrew McCallum

    Association for Computational Linguistics (ACL) (2011)

  •    

    Learning to Rank Answers to Non-Factoid Questions from Web Collections

    Mihai Surdeanu, Massimiliano Ciaramita, Hugo Zaragoza

    Computational Linguistics, vol. 37 (2011), pp. 351-383

  •   

    Multi-Source Transfer of Delexicalized Dependency Parsers

    Ryan McDonald, Slav Petrov, Keith B. Hall

    Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP '11)

  •    

    Piggyback: Using Search Engines for Robust Cross-Domain Named Entity Recognition

    Stefan Rued, Massimiliano Ciaramita, Jens Mueller, Hinrich Schuetze

    49th Annual Meeting of the Association for Computational Linguistics (ACL-HLT), Association for Computational Linguistics (2011), pp. 965-975

  •    

    Posterior Sparsity in Dependency Grammar Induction

    Jennifer Gillenwater, Kuzman Ganchev, Joao Graca, Fernando Pereira, Ben Taskar

    Journal of Machine Learning Research, vol. 12 (2011), pp. 455-490

  •   

    Punctuation: Making a Point in Unsupervised Dependency Parsing

    Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky

    Fifteenth Conference on Computational Natural Language Learning (CoNLL-2011)

  •    

    Question Identification on Twitter, Accepted by CIKM 2011

    Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, Edward Y. Chang

    Proceedings of the 20th ACM international conference on Information and knowledge management, ACM, New York, NY, USA (2011)

  •  

    Ranking Class Labels Using Query Sessions

    Marius Pasca

    Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL-2011), pp. 1607-1615

  •   

    Semi-supervised Latent Variable Models for Fine-grained Sentiment Analysis

    Oscar Tackstrom, Ryan McDonald

    Association for Computational Linguistics (2011)

  •   

    Training Structured Prediction Models with Extrinsic Loss Functions

    Keith Hall, Ryan McDonald, Slav Petrov

    Domain Adaptation Workshop at NIPS 2011

  •    

    Training a Parser for Machine Translation Reordering

    Jason Katz-Brown, Slav Petrov, Ryan McDonald, Franz Och, David Talbot, Hiroshi Ichikawa, Masakazu Seno

    Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP '11)

  •   

    Training dependency parsers by jointly optimizing multiple objectives

    Keith B. Hall, Ryan McDonald, Jason Katz-Brown, Michael Ringgaard

    Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing

  •   

    Unsupervised Dependency Parsing without Gold Part-of-Speech Tags

    Valentin I. Spitkovsky, Hiyan Alshawi, Angel X. Chang, Daniel Jurafsky

    2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011)

  •    

    Unsupervised Part-of-Speech Tagging with Bilingual Graph-Based Projections

    Dipanjan Das, Slav Petrov

    Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL '11) (2011), Best Paper Award

  •    

    A Comparison of Features for Automatic Readability Assessment

    Lijun Feng, Martin Jansche, Matt Huenerfauth, Noémie Elhadad

    23rd International Conference on Computational Linguistics (COLING 2010), Poster Volume, pp. 276-284

  •  

    A novel approach for proper name transliteration verification

    Ea-Ee Jan, Niyu Ge, Shih-Hsiang Lin, S. Roukos, J. Sorensen

    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on, pp. 89 -94

  •  

    Acquisition of Instance Attributes via Labeled and Related Instances

    Enrique Alfonseca, Marius Pasca, Enrique Robledo-Arnuncio

    Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR-10) (2010)

  •    

    Building Transcribed Speech Corpora Quickly and Cheaply for Many Languages

    Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu, Pedro Moreno, Mike LeBeau

    Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010), International Speech Communication Association, pp. 1914-1917

  •    

    Direct Construction of Compact Context-Dependency Transducers From Data

    David Rybach, Michael Riley

    Interspeech 2010, ISCA

  •   

    Distributed MAP Inference for Undirected Graphical Models

    Sameer Singh, Amarnag Subramanya, Fernando Pereira, Andrew McCallum

    Workshop on Learning on Cores, Clusters and Clouds (LCCC), Neural Information Processing Society (NIPS) (2010)

  •   

    Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models

    Amarnag Subramanya, Slav Petrov, Fernando Pereira

    Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)

  •   

    Evaluation of Dependency Parsers on Unbounded Dependencies

    Joakim Nivre, Laura Rimell, Ryan McDonald, Carlos Gómez Rodríguez

    International Conference on Computational Linguistics (2010)

  •   

    Expected Sequence Similarity Maximization

    Cyril Allauzen, Shankar Kumar, Wolfgang Macherey, Mehryar Mohri, Michael Riley

    NAACL HLT (2010)

  •   

    Experiments in Graph-based Semi-Supervised Learning Methods for Class-Instance Acquisition

    Partha Pratim Talukdar, Fernando Pereira

    48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)

  •   

    From Baby Steps to Leapfrog: How “Less is More” in Unsupervised Dependency Parsing

    Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky

    Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010)

  •   

    Learning Better Monolingual Models with Unannotated Bilingual Text

    David Burkett, Slav Petrov, John Blitzer, Dan Klein

    Fourteenth Conference on Computational Natural Language Learning (CoNLL '10) (2010)

  •   

    Learning Dense Models of Query Similarity from User Click Logs

    Fabio De Bona, Stefan Riezler, Keith Hall, Massimiliano Ciaramita, Amac Herdagdelen, Maria Holmqvist

    Proceedings of NAACL-HLT 2010

  •  

    Lightly Supervised Learning of Text Normalization: Russian Number Names

    Richard Sproat

    IEEE Workshop on Spoken Language Technology, Berkeley, CA (2010) (to appear)

  •   

    Logical Leaps and Quantum Connectives: Forging Paths through Predication Space

    Trevor Cohen, Dominic Widdows, Roger W. Schvaneveldt, Thomas C. Rindflesch

    AAAI-Fall 2010 Symposium on Quantum Informatics for Cognitive, Social, and Semantic Processes. (to appear)

  •   

    Multi-Sentence Compression: Finding Shortest Paths in Word Graphs

    Katja Filippova

    Proceedings of the 23rd International Conference on Computational Linguistics (Coling'10) (2010)

  •   

    Products of Random Latent Variable Grammars

    Slav Petrov

    Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL/HLT '10) (2010)

  •   

    Profiting from Mark-Up: Hyper-Text Annotations for Guided Parsing

    Valentin I. Spitkovsky, Daniel Jurafsky, Hiyan Alshawi

    48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)

  •    

    Proper Name Transcription/Transliteration with ICU Transforms

    Sascha Brawer, Martin Jansche, Hiroshi Takenaka, Yui Terashima

    34th Internationalization & Unicode Conference (2010)

  •    

    Query Language Modeling for Voice Search

    Ciprian Chelba, Johan Schalkwyk, Thorsten Brants, Vida Ha, Boulos Harb, Will Neveitt, Carolina Parada, Peng Xu

    Proceedings of the 2010 IEEE Workshop on Spoken Language Technology, IEEE, pp. 127-132

  •   

    Query Rewriting using Monolingual Statistical Machine Translation

    Stefan Riezler, Yi Liu

    Computational Linguistics, vol. 36 (2010)

  •   

    Self-training with Products of Latent Variable Grammars

    Zhongqiang Huang, Mary Harper, Slav Petrov

    Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)

  •   

    Sparsity in Dependency Grammar Induction

    Jennifer Gillenwater, Kuzman Ganchev, João Graça, Fernando Pereira, Ben Taskar

    48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)

  •   

    Speech Recognition for Mobiles Devices at Google

    Mike Schuster

    PRICAI 2010, Lecture Notes in Artificial Intelligence volume 6230, Springer, Heidelberg, pp. 8-10

  •    

    Study on Interaction between Entropy Pruning and Kneser-Ney Smoothing

    Ciprian Chelba, Thorsten Brants, Will Neveitt, Peng Xu

    Proceedings of Interspeech (2010), pp. 2242-2245

  •  

    The Role of Queries in Ranking Labeled Instances Extracted from Text

    Marius Pasca

    Proceedings of the 23rd International Conference on Computational Linguistics (COLING-2010), pp. 955-962

  •  

    The Role of Query Sessions in Extracting Instance Attributes from Web Search Queries

    Marius Pasca, Enrique Alfonseca, Enrique Robledo-Arnuncio, Ricardo Martin-Brualla, Keith Hall

    Proceedings of the 32nd European Conference on Information Retrieval (ECIR-2010), pp. 62-74

  •   

    The Semantic Vectors Package: New Algorithms and Public Tools for Distributional Semantics

    Dominic Widdows, Trevor Cohen

    Fourth IEEE International Conference on Semantic Computing (IEEE ICSC2010), IEEE

  •   

    The Viability of Web-derived Polarity Lexicons

    Leonid Velikovich, Sasha Blair-Goldensohn, Kerry Hannan, Ryan McDonald

    North American Chapter of the Association for Computational Linguistics (2010)

  •   

    Uptraining for Accurate Deterministic Question Parsing

    Slav Petrov, Pi-Chuan Chang, Michael Ringgaard, Hiyan Alshawi

    Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)

  •   

    Using Web-scale N-grams to Improve Base NP Parsing Performance

    Emily Pitler, Shane Bergsma, Dekang Lin, Ken Church

    Proceedings of COLING (2010), 886–894

  •   

    Viterbi Training Improves Unsupervised Dependency Parsing

    Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky, Christopher D. Manning

    Fourteenth Conference on Computational Natural Language Learning (CoNLL-2010)

  •    

    Voice Search for Development

    Etienne Barnard, Johan Schalkwyk, Charl van Heerden, Pedro J. Moreno

    Interspeech 2010

  •   

    What’s great and what’s not: learning to classify the scope of negation for improved sentiment analysis

    Isaac Councill, Ryan McDonald, Leonid Velikovich

    Workshop on Negation and Speculation in Natural Language Processing (2010)

  •    

    A Generalized Composition Algorithm for Weighted Finite-State Transducers

    Cyril Allauzen, Michael Riley, Johan Schalkwyk

    Interspeech 2009

  •   

    A Panlingual Anomalous Text Detector

    Ashok C. Popat

    DocEng '09: Proceedings of the 9th ACM symposium on Document Engineering, ACM, New York (2009), pp. 201-204

  •   

    A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches

    Eneko Agirre, Enrique Alfonseca, Keith Hall, Jana Kravalova, Marius Pasca, Aitor Soroa

    Proceedings of NAACL-HLT 2009

  •   

    An Approach to Web-Scale Named-Entity Disambiguation

    Luís Sarmento, Alexander Kehlenbeck, Eugénio C. Oliveira, Lyle Ungar

    MLDM '09: Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition, Springer-Verlag, Berlin, Heidelberg (2009), pp. 689-703

  •   

    Automatic Adaptation of Annotation Standards: Chinese Word Segmentation and POS Tagging - A Case Study

    Wenbin Jiang, Liang Huang, Qun Liu

    Proceedings of ACL-IJCNLP (2009)

  •   

    Baby Steps: How “Less is More” in Unsupervised Dependency Parsing

    Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky

    NIPS 2009 Workshop on Grammar Induction, Representation of Language and Language Learning

  •   

    Back-off Language Model Compression

    Boulos Harb, Ciprian Chelba, Jeffrey Dean, Sanjay Ghemawat

    Proceedings of Interspeech 2009, International Speech Communication Association (ISCA), pp. 325-355

  •   

    Bilingually-Constrained (Monolingual) Shift-Reduce Parsing

    Liang Huang, Wenbin Jiang, Qun Liu

    Proceedings of EMNLP (2009), pp. 1222-1231

  •   

    Combining Language Modeling and Discriminative Classification for Word Segmentation

    Dekang Lin

    CICLing '09: Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing, Springer-Verlag, Berlin, Heidelberg (2009), pp. 170-182

  •   

    Contrastive summarization: An experiment with consumer reviews

    Kevin Lerman, Ryan McDonald

    North American Association for Computational Linguistics (2009)

  •   

    Dependency Parsing

    Sandra Kubler, Ryan McDonald, Joakim Nivre

    Morgan & Claypool (2009)

  •  

    Distributed language models

    Thorsten Brants, Peng Xu

    NAACL '09: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Tutorial Abstracts, Association for Computational Linguistics, Morristown, NJ, USA, pp. 3-4

  •  

    Finite-State Machines for Mining Patterns in Very Large Text Repositories

    Wojciech Skut

    Proceeding of the 2009 conference on Finite-State Methods and Natural Language Processing, IOS Press, Amsterdam, The Netherlands, The Netherlands, pp. 23-23

  •  

    Gazpacho and summer rash: lexical relationships from temporal patterns of web search queries

    Enrique Alfonseca, Massimiliano Ciaramita, Keith Hall

    Proceedings of the conference on Empirical Methods in Natural Language Processing (EMNLP) (2009)

  •   

    Generative and Discriminative Latent Variable Grammars

    Slav Petrov

    The Generative and Discriminative Learning Interface Workshop at NIPS 2009

  •   

    Glen, Glenda or Glendale: Unsupervised and Semi-supervised Learning of English Noun Gender

    Shane Bergsma, Dekang Lin, Randy Goebel

    CoNLL, Boulder, CO (2009)

  •  

    Integrating sentence- and word-level error identification for disfluency correction

    Erin Fitzgerald, Frederick Jelinek, Keith B. Hall

    EMNLP '09: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Morristown, NJ, USA, pp. 765-774

  •   

    Language modelling for what-with-where on GOOG-411

    Charl Van Heerden, Johan Schalkwyk, Brian Strope

    Proc. International Speech Communication Association (Interspeech 2009), pp. 991-994

  •   

    Large-scale Computation of Distributional Similarities for Queries

    Enrique Alfonseca, Keith Hall, Silvana Hartmann

    Proceedings of NAACL-HLT-2009

  •   

    Large-scale Semantic Networks: Annotation and Evaluation

    Vaclav Novak, Sven Hartrumpf, Keith B. Hall

    Proceedings of the Semantic Evaluations Workshop at NAACL-HLT (2009)

  •   

    Latent Variable Models of Concept-Attribute Attachment

    Joseph Reisinger, Marius Pasca

    Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics (ACL-IJCNLP-2009), pp. 620-628

  •  

    Low-Cost Supervision for Multiple-Source Attribute Extraction

    Joseph Reisinger, Marius Pasca

    Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing-2009), Mexico City, Mexico, pp. 382-393

  •    

    Named Entity Transcription with Pair n-Gram Models

    Martin Jansche, Richard Sproat

    2009 Named Entities Workshop: Shared Task on Transliteration (NEWS 2009), ACL-IJCNLP 2009, pp. 32-35

  •    

    OpenFst: An Open-Source, Weighted Finite-State Transducer Library and its Applications to Speech and Language

    Michael Riley, Cyril Allauzen, Martin Jansche

    Proceedings of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 conference, Tutorials

  •  

    Outclassing Wikipedia in Open-Domain Information Extraction: Weakly-Supervised Acquisition of Attributes over Conceptual Hierarchies

    Marius Pasca

    Proceedings of the 12th Conference of the European Chapter of the Association of Computational Linguistics (EACL-2009), Athens, Greece, pp. 639-647

  •   

    Phrase Clustering for Discriminative Learning

    Dekang Lin, Xiaoyun Wu

    Proceedings of ACL/IJCNLP, Singapore (2009), pp. 1030-1038

  •    

    Posterior vs. Parameter Sparsity in Latent Variable Models

    Joao Graca, Kuzman Ganchev, Ben Taskar, Fernando Pereira

    Advances in Neural Information Processing Systems 22 (2009), pp. 664-672

  •   

    Randomized Pruning: Efficiently Calculating Expectations in Large Dynamic Programs

    Alexandre Bouchard-Côté, Slav Petrov, Dan Klein

    Advances in Neural Information Processing Systems 22 (NIPS '09) (2009)

  •   

    Reconstructing false start errors in spontaneous speech text

    Erin Fitzgerald, Keith B. Hall, Frederick Jelinek

    Proceedings of the European Chapter of the Association for Computational Linguistics (2009)

  •  

    Semantic Vector Combinations and the Synoptic Gospels

    Dominic Widdows, Trevor Cohen

    Third International Symposium on Quantum Interaction (2009)

  •  

    Semi-Supervised Polarity Lexicon Induction

    Delip Rao, Deepak Ravichandran

    EACL - 2009

  •   

    Sentiment Summarization: Evaluating and Learning User Preferences

    Kevin Lerman, Sasha Blair-Goldensohn, Ryan McDonald

    European Association for Computational Linguistics (2009)

  •   

    The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple Languages

    Jan Hajič, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antònia Martí, Lluís Màrquez, Adam Meyers, Joakim Nivre, Sebastian Padó, Jan Štepánek, Pavel Straňák, Mihai Surdeanu, Nianwen Xue, Yi Zhang

    Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task, Association for Computational Linguistics, 209 N. Eight Street, Stroudsburg, PA 18360, pp. 1-18

  •  

    Using a dependency parser to improve SMT for subject-object-verb languages

    Peng Xu, Jaeho Kang, Michael Ringgaard, Franz Och

    NAACL '09: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, Morristown, NJ, USA, pp. 245-253

  •   

    Using the web for language independent spellchecking and autocorrection

    Casey Whitelaw, Ben Hutchinson, Grace Y. Chung, Gerard Ellis

    EMNLP '09: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Morristown, NJ, USA, pp. 890-899

  •  

    Web-Derived Resources for Web Information Retrieval: From Conceptual Hierarchies to Attribute Hierarchies

    Marius Pasca, Enrique Alfonseca

    Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR-09), Boston, Massachusetts (2009), pp. 596-603

  •   

    Web-Scale N-gram Models for Lexical Disambiguation

    Shane Bergsma, Dekang Lin, Randy Goebel

    Proceedings of IJCAI, Los Angeles, CA (2009), pp. 1507-1512

  •   

    A Joint Model of Text and Aspect Ratings for Sentiment Summarization

    Ivan Titov, Ryan McDonald

    Association for Computational Linguistics (2008)

  •  

    Answering Definition Questions via Temporally-Anchored Text Snippets

    Marius Pasca

    Proceedings of the 3rd International Joint Conference on Natural Language Processing (IJCNLP-2008), Hyderabad, India, pp. 411-417

  •   

    Building a Sentiment Summarizer for Local Service Reviews

    Sasha Blair-Goldensohn, Kerry Hannan, Ryan McDonald, Tyler Neylon, George Reis, Jeff Reynar

    WWW Workshop on NLP Challenges in the Information Explosion Era (NLPIX) (2008)

  •  

    Decompounding query keywords from compounding languages

    Enrique Alfonseca, Slaven Bilac, Stefan Pharies

    Proceedings of ACL-2008

  •  

    Discriminative learning of selectional preference from unlabeled text

    Shane Bergsma, Dekang Lin, Randy Goebel

    EMNLP '08: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Morristown, NJ, USA (2008), pp. 59-68

  •   

    Distributional Identification of Non-Referential Pronouns

    Shane Bergsma, Dekang Lin, Randy Goebel

    Proceedings of ACL-08: HLT, Association for Computational Linguistics, Columbus, Ohio (2008), pp. 10-18

  •  

    Finding Cars, Goddesses and Enzymes: Parametrizable Acquisition of Labeled Instances for Open-Domain Information Extraction

    Benjamin Van Durme, Marius Pasca

    Proceedings of the 23rd Annual Conference on Artificial Intelligence (AAAI-2008), pp. 1243-1248

  •  

    German decompounding in a difficult corpus

    Enrique Alfonseca, Slaven Bilac, Stefan Pharies

    Proceedings of CICLING-2008, Lecture Notes in Computer Science, Springer, pp. 128-139

  •   

    Integrating Graph-based and Transition-based Dependency Parsers

    Joakim Nivre, Ryan McDonald

    Association for Computational Linguistics (2008)

  •  

    Large Scale Acquisition of Paraphrases for Learning Surface Patterns

    Rahul Bhagat, Deepak Ravichandran

    ACL-2008

  •   

    Randomized Language Models via Perfect Hash Functions

    David Talbot, Thorsten Brants

    Proceedings of ACL-08: HLT, Association for Computational Linguistics, Columbus, Ohio (2008), pp. 505-513

  •    

    Reading the Markets: Forecasting Public Opinion of Political Candidates by News Analysis

    Kevin Lerman, Ari Gilder, Mark Dredze, Fernando Pereira

    Conference on Computational Linguistics (Coling) (2008)

  •    

    Semantic Vector Products: Some Initial Investigations

    Dominic Widdows

    Proceedings of the Second AAAI Symposium on Quantum Interaction, AAAI (2008)

  •  

    Towards Temporal Web Search

    Marius Pasca

    Proceedings of the 23rd ACM Symposium on Applied Computing (SAC-2008), Fortaleza, Brazil, pp. 1117-1121

  •    

    Translating Queries into Snippets for Improved Query Expansion

    Stefan Riezler, Yi Liu, Alexander Vasserman

    Proceedings of the 22nd International Conference on Computational Linguistics (COLING'08), Manchester, England (2008)

  •  

    Turning Web Text and Search Queries into Factual Knowledge: Hierarchical Class Attribute Extraction

    Marius Pasca

    Proceedings of the 23rd Annual Conference on Artificial Intelligence (AAAI-2008), pp. 1225-1230

  •  

    Using Structured Text for Large-Scale Attribute Extraction

    Sujith Ravi, Marius Pasca

    Proceedings of the 17th ACM Conference on Information and Knowledge Management (CIKM-2008), pp. 1183-1192

  •   

    Weakly-Supervised Acquisition of Labeled Class Instances using Graph Random Walks

    Partha Pratim Talukdar, Joseph Reisinger, Marius Pasca, Deepak Ravichandran, Rahul Bhagat, Fernando Pereira

    Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-08), Association for Computational Linguistics, Honolulu, Hawaii (2008), pp. 582-590

  •  

    Weakly-Supervised Acquisition of Open-Domain Classes and Class Attributes from Web Documents and Query Logs

    Marius Pasca, Benjamin Van Durme

    Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-2008), pp. 19-27

  •   

    Web-scale named entity recognition

    Casey Whitelaw, Alex Kehlenbeck, Nemanja Petrovic, Lyle Ungar

    CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge management, ACM, New York, NY, USA (2008), pp. 123-132

  •   

    Wide-Coverage Deep Statistical Parsing using Automatic Dependency Structure Annotation

    Aoife Cahill, Ruth O'Donovan, Josef van Genabith, Michael Burke, Stefan Riezler, Andy Way

    Computational Linguistics, vol. 34 (1) (2008), pp. 81-124

  •   

    A Study of Global Inference Algorithms in Multi-Document Summarization

    Ryan McDonald

    European Conference on Information Retrieval (ECIR) (2007)

  •   

    Characterizing the Errors of Data-Driven Dependency Parsers

    Ryan McDonald, Joakim Nivre

    Empirical Methods in Natural Language Processing (2007)

  •   

    Frustratingly Hard Domain Adaptation for Dependency Parsing

    Mark Dredze, John Blitzer, Partha Pratim Talukdar, Kuzman Ganchev, Jo~{a}o V. Graça, Fernando Pereira

    Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007, pp. 1051-1055

  •    

    How difficult is it to develop a perfect spell-checker? A cross-linguistic analysis through complex network approach

    Monojit Choudhury, Markose Thomas, Animesh Mukherjee, Niloy Ganguly, Anupam Basu

    Textgraphs 2 Workshop, at HLT/NAACL, ACL (2007), pp. 8

  •   

    Inference in Text Understanding

    Peter Norvig

    AAAI Spring Symposium on Machine Reading (2007)

  •  

    Lightweight Web-Based Fact Repositories for Textual Question Answering

    Marius Pasca

    Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM-2007), Lisboa, Portugal, pp. 87-96

  •    

    N-Gram Statistical Similarities and Differences between Chinese and English

    Pei Cao, Stewart Yang, Hongjun Zhu

    First IEEE International Conference on Semantic Computing, IEEE (2007)

  •   

    On the Complexity of Non-Projective Data-Driven Dependency Parsing

    Ryan McDonald, Giorgio Satta

    International Conference on Parsing Technologies (2007), pp. 121-132

  •   

    Organizing and Searching the World Wide Web of Facts - Step Two: Harnessing the Wisdom of the Crowds

    Marius Pasca

    Proceedings of the 16th International World Wide Web Conference (WWW-07) (2007), pp. 101-110

  •  

    Reconocimiento de Entidades, Resolución de Correferencia y Extracción de Relaciones

    Enrique Alfonseca

    F. Verdejo (ed.), Acceso y viabilidad de la información multilingüe en la red: el rol de la semántica, Fundación Duques de Soria (2007)

  •  

    Simple training of dependency parsers via structured boosting

    Qin Iris Wang, Dekang Lin, Dale Schuurmans

    IJCAI'07: Proceedings of the 20th international joint conference on Artifical intelligence, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (2007), pp. 1756-1762

  •    

    Statistical Machine Translation for Query Expansion in Answer Retrieval

    Stefan Riezler, Alexander Vasserman, Ioannis Tsochantaridis, Vibhu Mittal, Yi Liu

    Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL'07), Prague, Czech Republic (2007)

  •   

    Structured Models for Fine-to-Coarse Sentiment Analysis

    Ryan McDonald, Kerry Hannan, Tyler Neylon, Mike Wells, Jeff Reynar

    45th Annual Meeting of the Association for Computational Linguistics (ACL 2007)

  •   

    The Role of Documents vs. Queries in Extracting Class Attributes from Text

    Marius Pasca, Benjamin Van Durme, Nikesh Garera

    Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM-2007), Lisboa, Portugal, pp. 485-494

  •   

    Weakly-Supervised Discovery of Named Entities Using Web Search Queries

    Marius Pasca

    Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM-2007), Lisboa, Portugal, pp. 683-690

  •  

    What You Seek is What You Get: Extraction of Class Attributes from Query Logs

    Marius Pasca, Benjamin Van Durme

    Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07) (2007), pp. 2832-2837

  •   

    A Context Pattern Induction Method for Named Entity Extraction

    Partha Pratim Talukdar, Thorsten Brants, Mark Liberman, Fernando Pereira

    Proceedings of CoNLL-X (2006), pp. 141-148

  •   

    Bootstrapping Path-Based Pronoun Resolution

    Shane Bergsma, Dekang Lin

    Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Sydney, Australia (2006), pp. 33-40

  •   

    Comparative Experiments on Sentiment Classification for Online Product Reviews

    Hang Cui, Vibhu Mittal, Mayur Datar

    Proceedings of the 21st National Conference on Artificial Intelligence, AAAI, Boston, MA (2006)

  •   

    Integrating probabilistic extraction models and data mining to discover relations and patterns in text

    Aron Culotta, Andrew McCallum, Jonathan Betz

    HLT-NAACL, New York, NY (2006), pp. 296-303

  •   

    Names and Similarities on the Web: Fact Extraction in the Fast Lane

    Marius Pasca, Dekang Lin, Jeffrey Bigham, Andrei Lifchits, Alpa Jain

    Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (COLING-ACL-06), Sydney, Australia (2006), pp. 809-816

  •  

    Organizing and Searching the World Wide Web of Facts - Step One: the One-Million Fact Extraction Challenge

    Marius Pasca, Dekang Lin, Jeffrey Bigham, Andrei Lifchits, Alpa Jain

    Proceedings of the 21st National Conference on Artificial Intelligence (AAAI-06), Boston, Massachusetts (2006), pp. 1400-1405

  •   

    Probabilistic Context-Free Grammar Induction Based on Structural Zeros

    Mehryar Mohri, Brian Roark

    Proceedings of the Seventh Meeting of the Human Language Technology conference - North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2006), New York, NY

  •   

    Soft Syntactic Constraints for Word Alignment through Discriminative Training

    Colin Cherry, Dekang Lin

    Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, Association for Computational Linguistics, Sydney, Australia, pp. 105-112

  •   

    Using Encyclopedic Knowledge for Named Entity Disambiguation

    Razvan Bunescu, Marius Pasca

    Proceedings of the 11th Conference of the European Chapter of the Association of Computational Linguistics (EACL-2006), Trento, Italy, pp. 9-16

  •   

    On a Common Fallacy in Computational Linguistics

    Mehryar Mohri, Richard Sproat

    A Man of Measure: Festschrift in Honour of Fred Karlsson on this 60th Birthday, SKY Journal of Linguistics, Volume 19 (2006), pp. 432-439

  •  

    Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web

    Marius Pasca, Peter Dienes

    Proceedings of the 2nd International Joint Conference on Natural Language Processing (IJCNLP-2005), Jeju Island, Republic of Korea, pp. 119-130

  •  

    Finding Instance Names and Alternative Glosses on the Web: WordNet Reloaded

    Marius Pasca

    Proceedings of the 6th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing-2005), Mexico City, Mexico, pp. 280-292

  •   

    Local Grammar Algorithms

    Mehryar Mohri

    Inquiries into Words, Constraints, and Contexts. Festschrift in Honour of Kimmo Koskenniemi on his 60th Birthday, CSLI Publications, Stanford University (2005), pp. 84-93

  •  

    Mining Paraphrases from Self-Anchored Web Sentence Fragments

    Marius Pasca

    Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD-2005), Porto, Portugal, pp. 193-204

  •  

    Strictly lexical dependency parsing

    Qin Iris Wang, Dale Schuurmans, Dekang Lin

    Parsing '05: Proceedings of the Ninth International Workshop on Parsing Technology, Association for Computational Linguistics, Morristown, NJ, USA (2005), pp. 152-159

  •   

    Statistical Natural Language Processing

    Mehryar Mohri

    Applied Combinatorics on Words, Cambridge University Press (2005)

  •   

    The Design Principles and Algorithms of a Weighted Grammar Library

    Cyril Allauzen, Mehryar Mohri, Brian Roark

    International Journal of Foundations of Computer Science, vol. 16 (2005)

  •   

    Acquisition of Categorized Named Entities for Web Search

    Marius Pasca

    Proceedings of the 13th ACM Conference on Information and Knowledge Management (CIKM-04), Washington, D.C. (2004), pp. 137-145

  •   

    Searching the Web by Voice

    Alexander Franz, Brian Milch

    Proceedings of the 19th International Conference on Computational Linguistics (COLING) (2002), pp. 1213-1217