Natural Language Processing
Natural Language Processing (NLP) research at Google focuses on algorithms that apply at scale, across languages, and across domains. Our systems are used in numerous ways across Google, impacting user experience in search, mobile, apps, ads, translate and more.
Our work spans the range of traditional NLP tasks, with general-purpose syntax and semantic algorithms underpinning more specialized systems. We are particularly interested in algorithms that scale well and can be run efficiently in a highly distributed environment.
Our syntactic systems predict part-of-speech tags for each word in a given sentence, as well as morphological features such as gender and number. They also label relationships between words, such as subject, object, modification, and others. We focus on efficient algorithms that leverage large amounts of unlabeled data, and recently have incorporated neural net technology.
On the semantic side, we identify entities in free text, label them with types (such as person, location, or organization), cluster mentions of those entities within and across documents (coreference resolution), and resolve the entities to the Knowledge Graph.
Recent work has focused on incorporating multiple sources of knowledge and information to aid with analysis of text, as well as applying frame semantics at the noun phrase, sentence, and document level.
395 Publications
-
An efficient framework for learning sentence representations
Lajanugen Logeswaran, Honglak Lee
ICLR (2018) (to appear)
-
Ask the Right Questions: Active Question Reformulation with Reinforcement Learning
Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Wojciech Paweł Gajewski, Andrea Gesmundo, Neil Houlsby, Wei Wang
Sixth International Conference on Learning Representations (2018)
-
Beyond word importance: using contextual decompositions to extract interactions from LSTMs.
Jamie Murdoch, Peter J. Liu, Bin Yu
ICLR (2018)
-
Crowdsourcing Ground Truth for Medical Relation Extraction
Anca Dumitrache, Chris Welty, Lora Aroyo
ACM Transactions on Interactive Intelligent Systems, vol. 8:1 (2018) (to appear)
-
Generating Wikipedia by Summarizing Long Sequences
Peter J. Liu, Mohammad Ahmad Saleh, Etienne Pot, Ben Goodrich, Ryan Sepassi, Lukasz Kaiser, Noam Shazeer
ICLR (2018)
-
Improving homograph disambiguation with supervised machine learning
Gleb Mazovetskiy, Kyle Gorman, Vitaly Nikolaev
LREC (2018) (to appear)
-
MaskGAN: Better Text Generation via Filling in the ____
Andrew Dai, Ian Goodfellow, Liam Fedus
ICLR (2018) (to appear)
-
Measuring and Mitigating Unintended Bias in Text Classification
Lucas Dixon, John Li, Jeffrey Sorensen, Nithum Thain, Lucy Vasserman
AAAI/ACM Conference on AI, Ethics, and Society (2018)
-
Natural TTS Synthesis By Conditioning WaveNet On Mel Spectrogram Predictions
Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, RJ Skerry-Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis, Yonghui Wu
ICASSP (2018)
-
SHAPED: Shared-Private Encoder-Decoder for Text Style Adaptation
Ye Zhang, Nan Ding, Radu Soricut
Proceedings of NAACL-HLT, ACL (2018) (to appear)
-
Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Lyn Untalan Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Céspedes, Steve Yuan, Chris Tar, Yun-hsuan Sung, Brian Strope, Ray Kurzweil
In submission to: ACL demonstration, Association for Computational Linguistics, Melbourne, Australia (2018)
-
A Neural Architecture for Dialectal Arabic Segmentation
Younes Samih, Mohammed Attia, Mohamed Eldesouki, Hamdy Mubarak, Ahmed Abdelali, Laura Kallmeyer, Kareem Darwish
The Third Arabic Natural Language Processing Workshop (WANLP), Valencia, Spain (2017), pp. 46-54
-
Yin-Wen Chang, Michael Collins
Transactions of the Association for Computational Linguistics (TACL), vol. 5 (2017), pp. 59-71
-
An RNN Model of Text Normalization
Navdeep Jaitly, Richard Sproat
Interspeech 2017 (2017)
-
Analyza: Exploring Data with Conversation
Kedar Dhamdhere, Kevin McCurley, Mukund Sundararajan, Qiqi Yan, Ralfi Nahmias
Intelligent User Interfaces 2017, ACM, Limassol, Cyprus (to appear)
-
Analyzing Language Learned by an Active Question Answering Agent
Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Wojciech Gajewski, Andrea Gesmundo, Neil Houlsby, Wei Wang
Emergent Communication Workshop @ NIPS (2017)
-
Approaches for Neural-Network Language Model Adaptation
Fadi Biadsy, Michael Alexander Nirschl, Min Ma, Shankar Kumar
Interspeech 2017, Stockholm, Sweden (2017)
-
Arabic Multi-Dialect Segmentation: bi-LSTM-CRF vs. SVM
Mohamed Eldesouki, Younes Samih, Ahmed Abdelali, Mohammed Attia, Hamdy Mubarak, Kareem Darwish, Laura Kallmeyer
arxiv.org 2017 (2017)
-
Areal and Phylogenetic Features for Multilingual Speech Synthesis
Alexander Gutkin, Richard Sproat
Proc. of Interspeech 2017, ISCA, August 20–24, 2017, Stockholm, Sweden, pp. 2078-2082
-
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
NIPS (2017)
-
Avoiding Your Teacher's Mistakes: Training Neural Networks with Controlled Weak Supervision
Mostafa Dehghani, Aliaksei Severyn, Sascha Rothe, Jaap Kamps
arXiv (2017)
-
Better Text Understanding Through Image-To-Text Transfer
Karol Kurach, Sylvain Gelly, Michal Jastrzebski, Philip Haeusser, Olivier Teytaud, Damien Vincent, Olivier Bousquet
arXiv (2017)
-
CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
Daniel Zeman, Martin Popel, Milan Straka, Jan Hajic, Joakim Nivre, Filip Ginter, Juhani Luotolahti, Sampo Pyysalo, Slav Petrov, Martin Potthast, Francis Tyers, et al.
Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
-
Crafting a lexicon of referential expressions for NLG applications
Ariel Gutman, Alexandros Chaaraoui, Pascal Fleury
The 2017 Israeli Seminar of Computational Linguistics, Rachel and Selim Benin School of Computer Science and Engineering, Edmond J. Safra Campus, Jerusalem (2017)
-
Depthwise Separable Convolutions for Neural Machine Translation
Lukasz Kaiser, Aidan N. Gomez, Francois Chollet
arXiv (2017)
-
Efficient Natural Language Response Suggestion for Smart Reply
Matthew Henderson, Rami Al-Rfou, Brian Strope, Yun-hsuan Sung, László Lukács, Ruiqi Guo, Sanjiv Kumar, Balint Miklos, Ray Kurzweil
ArXiv e-prints (2017)
-
False Positive and Cross-relation Signals in Distant Supervision Data
Anca Dumitrache, Lora Aroyo, Chris Welty
NIPS-2017 Workshop - Automatic Knowledge-Base Construction, http://www.akbc.ws/2017/
-
Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models
Louis Shao, Stephan Gouws, Denny Britz, Anna Goldie, Brian Strope, Ray Kurzweil
EMNLP (2017)
-
German Typographers vs. German Grammar: Decomposition of Wikipedia Category Labels into Attribute-Value Pairs
Proceedings of the 10th International Conference on Web Search and Data Mining (WSDM-2017), pp. 315-324
-
Get To The Point: Summarization with Pointer-Generator Networks
Abigail See, Peter Liu, Christopher Manning
Association for Computational Linguistics (2017)
-
Identifying 1950s American Jazz Musicians: Fine-Grained IsA Extraction via Modifier Composition
Ellie Pavlick, Marius Pasca
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL-2017), pp. 2099-2109
-
Language Modeling in the Era of Abundant Data
AI With the Best online conference. (2017)
-
Learning Recurrent Span Representations for Extractive Question Answering
Kenton Lee, Shimi Salant, Tom Kwiatkowski, Ankur Parikh, Dipanjan Das, Jonathan Berant
arXiv 1611.01436 (2017)
-
Learning from Relatives: Unified Dialectal Arabic Segmentation
Younes Samih, Mohamed Eldesouki, Mohammed Attia, Ahmed Abdelali, Hamdy Mubarak, Kareem Darwish, Laura Kallmeyer
CONLL, Vancouver, Canada (2017)
-
Learning to Attend, Copy, and Generate for Session-Based Query Suggestion
Mostafa Dehghani, Sascha Rothe, Enrique Alfonseca, Pascal Fleury
CIKM 2017 (2017)
-
Adams Wei Yu, Hongrae Lee, Quoc V. Le
ACL (2017)
-
Multilingual Metaphor Processing: Experiments with Semi-Supervised and Unsupervised Learning
Ekaterina Shutova, Lin Sun, Dario Gutierrez, Patricia Lichtenstein, Srini Narayanan
Computational Linguistics (2017) (to appear)
-
N-gram Language Modeling using Recurrent Neural Network Estimation
Ciprian Chelba, Mohammad Norouzi, Samy Bengio
ArXiv, Google (2017)
-
Natural Language Processing with Small Feed-Forward Networks
Jan A. Botha, Emily Pitler, Ji Ma, Anton Bakalov, Alex Salcianu, David Weiss, Ryan Mcdonald, Slav Petrov
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Copenhagen, Denmark, 2879–2885
-
Neural Paraphrase Identification of Questions with Noisy Pretraining
Gaurav Singh Tomar, Thyago Duque, Oscar Täckström, Jakob Uszkoreit, Dipanjan Das
Proceedings of the First Workshop on Subword and Character Level Models in NLP (2017)
-
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision
Chen Liang, Jonathan Berant, Quoc V. Le, Ken Forbus, Ni Lao
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics, Vancouver, Canada (2017), pp. 23-33
-
PoS, Morphology and Dependencies Annotation Guidelines for Arabic
Mohammed Attia, Ryan Mcdonald, Slav Petrov, Tolga Kayadelen
(2017)
-
SLING: A framework for frame semantic parsing
Michael Ringgaard, Rahul Gupta, Fernando C. N. Pereira
arXiv (2017), pp. 9
-
SemEval-2017 Task 1: Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation
Daniel Cer, Mona Diab, Eneko Agirre, Iñigo Lopez-Gazpio, Lucia Specia
Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), Association for Computational Linguistics, Vancouver, Canada, pp. 1-14
-
Shared Task Proposal: Multilingual Surface Realization Using Universal Dependency Trees
Anya Belz, Bernd Bohnet, Emily Pitler, Leo Wanner, Simone Mille
Proceedings of the 10th International Conference on Natural Language Generation, Association for Computational Linguistics (ACL), Santiago de Compostela, Spain (2017), pp. 120-123 (to appear)
-
Yin-Wen Chang, Michael Collins
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 1496–1500
-
To Plan or Not to Plan? Sequence to sequence generation for language generation in dialogue systems
Neha Nayak, Dilek Hakkani-Tur, Marilyn Walker, Larry Heck
INTERSPEECH 2017 (2017)
-
Towards better decoding and language model integration in sequence to sequence models
Jan Chorowski, Navdeep Jaitly
Interspeech (2017)
-
Transliterated mobile keyboard input via weighted finite-state transducers
Lars Hellsten, Brian Roark, Prasoon Goyal, Cyril Allauzen, Francoise Beaufays, Tom Ouyang, Michael Riley, David Rybach
Proceedings of the 13th International Conference on Finite State Methods and Natural Language Processing (FSMNLP) (2017)
-
Proc. of Interspeech 2017, ISCA, August 20–24, Stockholm, Sweden, pp. 2183-2187
-
Universal Semantic Parsing
Siva Reddy, Oscar Tackstrom, Slav Petrov, Mark Steedman, Mirella Lapata
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP)
-
A Decomposable Attention Model for Natural Language Inference
Ankur P. Parikh, Oscar Tackstrom, Dipanjan Das, Jakob Uszkoreit
Proceedings of EMNLP (2016)
-
A Piggyback System for Joint Entity Mention Detection and Linking in Web Queries
Hinrich Schuetze, Marco Cornolti, Massimiliano Ciaramita, Paolo Ferragina, Stefan Rued
WWW 2016
-
Annotating Topic Development in Information Seeking Queries
Marta Andersson, Adnan Öztürel, Silvia Pareti
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), European Language Resources Association (ELRA), Portorož, Slovenia
-
Building Large Machine Reading-Comprehension Datasets using Paragraph Vectors
Arxiv, https://arxiv.org/abs/1612.04342 (2016)
-
Building Statistical Parametric Multi-speaker Synthesis for Bangladeshi Bangla
Alexander Gutkin, Linne Ha, Martin Jansche, Oddur Kjartansson, Knot Pipatsrisawat, Richard Sproat
SLTU-2016 5th Workshop on Spoken Language Technologies for Under-resourced languages, 09-12 May 2016, Yogyakarta, Indonesia; Procedia Computer Science, Elsevier B.V., pp. 194-200
-
CogALex-V Shared Task: GHHH - Detecting Semantic Relations via Word Embeddings
Mohammed Attia, Suraj Maharjan, Younes Samih, Laura Kallmeyer, Thamar Solorio
CogALex-2016 Shared Task on the Corpus-Based Identification of Semantic Relations, Osaka, Japan (2016), pp. 86-91
-
Collective Entity Resolution with Multi-Focal Attention
Amir Globerson, Nevena Lazic, Soumen Chakrabarti, Amarnag Subramanya, Michael Ringaard, Fernando Pereira
ACL (2016)
-
Contextual LSTM: A Step towards Hierarchical Language Modeling
Shalini Ghosh, Oriol Vinyals, Brian Strope, Scott Roy, Tom Dean, Larry Heck
Workshop on Large-scale Deep Learning for Data Mining - KDD (2016) (to appear)
-
Conversational Contextual Cues: The Case of Personalization and History for Response Ranking
Rami Al-Rfou, Marc Pickett, Javier Snaider, Yun-hsuan Sung, Brian Strope
preprint (2016)
-
Cross-Lingual Morphological Tagging for Low-Resource Languages
Jan Buys, Jan A. Botha
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics, Berlin, Germany (2016), pp. 1954-1964
-
Cross-lingual projection for class-based language models
Beat Gfeller, Vlad Schogol, Keith Hall
ACL2016
-
Crowdsourcing a Gold Standard for Medical Relation Extraction with CrowdTruth
Anca Dumitrache, Chris Welty, Lora Aroyo
Proceedings of the 2016 Collective Intelligence Conference (to appear)
-
Distributed representation and estimation of WFST-based n-gram models
Cyril Allauzen, Michael Riley, Brian Roark
Proceedings of the ACL Workshop on Statistical NLP and Weighted Automata (StatFSM) (2016), pp. 32-41
-
Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic
Abdelati Hawwari, Mohammed Attia, Mahmoud Ghoneim, Mona Diab
LREC 2016, Arabic
-
Exploring the Steps of Verb Phrase Ellipsis
Zhengzhong Liu, Edgar Gonzàlez, Dan Gillick
Workshop on Coreference Resolution Beyond OntoNotes at NAACL 2016
-
Exploring the limits of language modeling
Rafal Jozefowicz, Oriol Vinyals, Mike Schuster, Noam Shazeer, Yonghui Wu
Google Inc. (2016)
-
Generalized Transition-based Dependency Parsing
Bernd Bohnet, Emily Pitler, Ji Ma, Ryan Mcdonald
Association for Computational Linguistics (ACL) (2016) (to appear)
-
Generating Sentences from a Continuous Space
Samuel R. Bowman, Luke Vilnis, Oriol Vinyals, Andrew M. Dai, Rafal Jozefowicz, Samy Bengio
CoNLL (2016)
-
Globally Normalized Transition-Based Neural Networks
Daniel Andor, Chris Alberti, David Weiss, Aliaksei Severyn, Alessandro Presta, Kuzman Ganchev, Slav Petrov, Michael Collins
Association for Computational Linguistics (2016)
-
ICON: Inferring Temporal Constraints from Natural Language API Descriptions
Rahul Pandita, Kunal Taneja, Teresa Tung, Laurie Williams
The International Conference on Software Maintenance and Evolution (2016)
-
Improving Chinese syntactic analysis through more consistent treebank annotation
Daisuke Kawahara, Mo Shen
ACL (2016), pp. 298-308
-
Latent Attention For If-Then Program Synthesis
Chang Liu, Dawn Song, Eui Chul Richard Shin, Mingcheng Chen, Xinyun Chen
Neural Information Processing Systems (2016)
-
Learning N-gram Language Models from Uncertain Data
Vitaly Kuznetsov, Hank Liao, Mehryar Mohri, Michael Riley, Brian Roark
Interspeech (2016)
-
Length bias in Encoder Decoder Models and a Case for Global Conditioning
Pavel Sountsov, Sunita Sarawagi
EMNLP (2016)
-
Linguistic Wisdom from the Crowd
Nancy Chang, Russell Lee-Goldman, Michael Tseng
Crowdsourcing Breakthroughs for Language Technology Applications, AAAI Technical Report WS-15-24 (2016)
-
Morpho-syntactic Lexicon Generation Using Graph-based Semi-supervised Learning
Manaal Faruqui, Ryan McDonald, Radu Soricut
TACL (2016)
-
Multilingual Code-switching Identification via LSTM Recurrent Neural Networks
Younes Samih, Suraj Maharjan, Mohammed Attia, Laura Kallmeyer, Thamar Solorio
Proceedings of the Second Workshop on Computational Approaches to Code Switching,, Austin, TX (2016), pp. 50-59
-
Multilingual Language Processing From Bytes
Dan Gillick, Cliff Brunk, Oriol Vinyals, Amarnag Subramanya
NAACL (2016)
-
Pynini: A Python library for weighted finite-state grammar compilation
Proceedings of the ACL Workshop on Statistical NLP and Weighted Automata (2016), pp. 75-80
-
Recent Advances in Google Real-time HMM-driven Unit Selection Synthesizer
Xavi Gonzalvo, Siamak Tazari, Chun-an Chan, Markus Becker, Alexander Gutkin, Hanna Silen
INTERSPEECH 2016, Sep 8-12, San Francisco, USA, pp. 2238-2242
-
Revisiting Taxonomy Induction over Wikipedia
Amit Gupta, Francesco Piccinno, Mikhail Kozhevnikov, Marius Pasca, Daniele Pighin
Proceedings of the 26th International Conference on Computational Linguistics (COLING-2016), pp. 2300-2309
-
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Mohammad Norouzi, Samy Bengio, Zhifeng Chen, Navdeep Jaitly, Mike Schuster, Yonghui Wu, Dale Schuurmans
NIPS (2016)
-
SemEval-2016 Task 1: Semantic Textual Similarity, Monolingual and Cross-Lingual Evaluation
Eneko Agirre, Carmen Banea, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Rada Mihalcea, German Rigau, Janyce Wiebe
Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), Association for Computational Linguistics, San Diego, California, pp. 497-511
-
Semantic Model for Fast Tagging of Word Lattices
IEEE Spoken Language Technology (SLT) Workshop (2016) (to appear)
-
Semi-supervised Word Sense Disambiguation with Neural Models
Dayu Yuan, Julian Richardson, Ryan Doherty, Colin Evans, Eric Altendorf
COLING 2016
-
Sense Anaphoric Pronouns: Am I One?
Marta Recasens, Zhichao Hu, Olivia Rhinehart
Proceedings of the Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2016), pp. 1-6
-
Smart Reply: Automated Response Suggestion for Email
Anjuli Kannan, Karol Kurach, Sujith Ravi, Tobias Kaufman, Balint Miklos, Greg Corrado, Andrew Tomkins, Laszlo Lukacs, Marina Ganea, Peter Young, Vivek Ramavajjala
Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) (2016).
-
Sparse Non-negative Matrix Language Modeling
Joris Pelemans, Noam Shazeer, Ciprian Chelba
Transactions of the Association for Computational Linguistics, vol. 4 (2016), pp. 329-342
-
Sparse Non-negative Matrix Language Modeling (EMNLP presentation)
Joris Pelemans, Noam Shazeer, Ciprian Chelba
Association for Computational Linguistics
-
Stack-propagation: Improved Representation Learning for Syntax
David Weiss, Yuan Zhang
ACL2016
-
TTS for Low Resource Languages: A Bangla Synthesizer
Alexander Gutkin, Linne Ha, Martin Jansche, Knot Pipatsrisawat, Richard Sproat
10th edition of the Language Resources and Evaluation Conference, 23-28 May 2016, European Language Resources Association (ELRA), Portorož, Slovenia, pp. 2005-2010
-
The Power of Language Music: Arabic Lemmatization through Patterns
Mohammed Attia, Ayah Zirizkly, Mona Diab
Proceedings of the Workshop on Cognitive Aspects of the Lexicon, Osaka, Japan (2016), pp. 40-50
-
Transforming Dependency Structures to Logical Forms for Semantic Parsing
Siva Reddy, Oscar Täckström, Michael Collins, Tom Kwiatkowski, Dipanjan Das, Mark Steedman, Mirella Lapata
Transactions of the Association for Computational Linguistics, vol. 4 (2016)
-
Understanding Image and Text Simultaneously: a Dual Vision-Language Machine Comprehension Task
Nan Ding, Sebastian Goodman, Fei Sha, Radu Soricut
Arxiv, https://arxiv.org/abs/1612.07833 (2016)
-
Universal Dependencies v1: A Multilingual Treebank Collection
Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajic, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)
-
Virtual Adversarial Training for Semi-Supervised Text Classification
Takeru Miayto, Andrew M. Dai, Ian Goodfellow
arXiv preprint (2016)
-
A Computationally Efficient Algorithm for Learning Topical Collocation Models
Zhendong Zhao, Lan Du, Benjamin Borschinger, John K Pate, Massimiliano Ciaramita, Mark Steedman, Mark Johnson
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Association for Computational Linguistics, Beijing, China (2015), pp. 1460-1469
-
A Linear-Time Transition System for Crossing Interval Trees
NAACL (2015), 662–-671
-
Rohit Prabhavalkar, Raziel Alvarez, Carolina Parada, Preetum Nakkiran, Tara Sainath
Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE (2015), pp. 4704-4708
-
BilBOWA: Fast Bilingual Distributed Representations without Word Alignments
Stephan Gouws, Yoshua Bengio, Greg Corrado
Proceedings of the 32nd International Conference on Machine Learning (2015)
-
Composition-based on-the-fly rescoring for salient n-gram biasing
Keith Hall, Eunjoon Cho, Cyril Allauzen, Francoise Beaufays, Noah Coccaro, Kaisuke Nakajima, Michael Riley, Brian Roark, David Rybach, Linda Zhang
Interspeech 2015, International Speech Communications Association
-
Dissecting German Grammar and Swiss Passports: Open-Domain Decomposition of Compositional Entries in Large-Scale Knowledge Repositories
Marius Pasca, Hylke Buisman
Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI-2015), pp. 896-902
-
Document embedding with paragraph vectors
Andrew M. Dai, Christopher Olah, Quoc V. Le
NIPS Deep Learning Workshop (2015)
-
Efficient Inference and Structured Learning for Semantic Role Labeling
Oscar Täckström, Kuzman Ganchev, Dipanjan Das
Transactions of the Association for Computational Linguistics, vol. 3 (2015), pp. 29-41
-
Embedding methods for fine-grained entity type classification
Dani Yogatama, Dan Gillick, Nevena Lazic
(2015)
-
Fast k-best Sentence Compression
Katja Filippova, Enrique Alfonseca
arXiv (2015)
-
Geo-location for Voice Search Language Modeling
Ciprian Chelba, Xuedong Zhang, Keith Hall
Interspeech 2015, International Speech Communications Association, pp. 1438-1442
-
Oriol Vinyals, Lukasz Kaiser, Terry Koo, Slav Petrov, Ilya Sutskever, Geoffrey Hinton
NIPS (2015)
-
HEADS: Headline Generation as Sequence Prediction Using an Abstract Feature-Rich Space
Carlos A. Colmenares, Marina Litvak, Amin Mantrach, Fabrizio Silvestri
Human Language Technologies: The 2015 Annual Conference of the North American Chapter of the ACL (NAACL'15), pp. 133-142
-
Idest: Learning a Distributed Representation for Event Patterns
Sebastian Krause, Enrique Alfonseca, Katja Filippova, Daniele Pighin
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL'15), pp. 1140-1149
-
Improved Transition-Based Parsing and Tagging with Neural Networks
Chris Alberti, David Weiss, Greg Coppola, Slav Petrov
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP '15)
-
Improved recognition of contact names in voice commands
Petar Aleksic, Cyril Allauzen, David Elson, Aleks Kracun, Diego Melendo Casado, Pedro J. Moreno
ICASSP 2015
-
Interpreting Compound Noun Phrases Using Web Search Queries
Proceedings of the 2015 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-2015), pp. 335-344
-
Language Modeling in the Era of Abundant Data
Stanford Information Theory Forum (2015)
-
Long Short-Term Memory Language Models with Additive Morphological Features for Automatic Speech Recognition
Daniel Renshaw, Keith B. Hall
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2015)
-
Machine Learning for Dialog State Tracking: A Review
Proceedings of The First International Workshop on Machine Learning in Spoken Language Processing (2015)
-
Mining Subjective Properties on the Web
Immanuel Trummer, Alon Halevy, Hongrae Lee, Sunita Sarawagi, Rahul Gupta
SIGMOD (2015) (to appear)
-
Modeling the Lifespan of Discourse Entities with Application to Coreference Resolution
Marie-Catherine de Marneffe, Marta Recasens, Christopher Potts
Journal of Artificial Intelligence Research, vol. 52 (2015), pp. 445-475
-
Multilingual Open Relation Extraction Using Cross-lingual Projection
Manaal Faruqui, Shankar Kumar
Proceedings of NAACL (2015)
-
Multinomial Loss on Held-out Data for the Sparse Non-negative Matrix Language Model
Ciprian Chelba, Fernando Pereira
ArXiv, Google (2015)
-
Plato: A Selective Context Model for Entity Resolution
Nevena Lazic, Amarnag Subramanya, Michael Ringgaard, Fernando Pereira
Transactions of the Association for Computational Linguistics, vol. 3 (2015), pp. 503-515
-
Pruning Sparse Non-negative Matrix N-gram Language Models
Joris Pelemans, Noam M. Shazeer, Ciprian Chelba
Proceedings of Interspeech 2015, ISCA, pp. 1433-1437
-
Rapid Vocabulary Addition to Context-Dependent Decoder Graphs
Interspeech 2015
-
Refer-to-as Relations as Semantic Knowledge
Song Feng, Sujith Ravi, Ravi Kumar, Polina Kuznetsova, Wei Liu, Alex Berg, Tamara Berg, Yejin Choi
AAAI Conference on Artificial Intelligence (2015)
-
Resolving Discourse-Deictic Pronouns: A Two-Stage Approach to Do It
Sujay Kumar Jauhar, Raul D. Guerra, Edgar Gonzàlez Pellicer, Marta Recasens
Proceedings of the 4th Joint Conference on Lexical and Computational Semantics (*SEM 2015), pp. 299-308
-
Semantic Role Labeling with Neural Network Factors
Nicholas FitzGerald, Oscar Täckström, Kuzman Ganchev, Dipanjan Das
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP '15), Association for Computational Linguistics
-
Semi-supervised sequence learning
Advances in Neural Information Processing Systems, NIPS (2015)
-
Sentence Compression by Deletion with LSTMs
Katja Filippova, Enrique Alfonseca, Carlos Colmenares, Lukasz Kaiser, Oriol Vinyals
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP'15)
-
Sequence-based Class Tagging for Robust Transcription in ASR
Lucy Vasserman, Vlad Schogol, Keith Hall
Interspeech 2015, International Speech Communications Association (to appear)
-
Sparse Non-negative Matrix Language Modeling For Skip-grams
Noam M. Shazeer, Joris Pelemans, Ciprian Chelba
Proceedings of Interspeech 2015, ISCA, pp. 1428-1432
-
Structured Training for Neural Network Transition-Based Parsing
David Weiss, Chris Alberti, Michael Collins, Slav Petrov
Proceedings of the 53th Annual Meeting of the Association for Computational Linguistics (ACL '15) (2015)
-
Unsupervised Morphology Induction Using Word Embeddings
Radu Soricut, Franz Och
NAACL (2015)
-
Using Entity Information from a Knowledge Base to Improve Relation Extraction
Lan Du, Anish Kumar, M. Johnson, Massimiliano Ciaramita
Proceedings of the 13th annual workshop of The Australasian Language Technology Association, Association for Computational Linguistics (2015)
-
Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding
Grégoire Mesnil, Yann Dauphin, Kaisheng Yao, Yoshua Bengio, Li Deng, Dilek Hakkani-Tür, Xiaodong He, Larry Heck, Gokhan Tur, Dong Yu, Geoffrey Zweig
IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23 (2015), pp. 530-539
-
What’s Cookin’? Interpreting Cooking Videos using Text, Speech and Vision
Jonathan Malmaud, Jonathan Huang, Vivek Rathod, Nicholas Johnston, Andrew Rabinovich, Kevin Murphy
North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT 2015) (to appear)
-
A Crossing-Sensitive Third-Order Factorization for Dependency Parsing
Transactions of the Association for Computational Linguistics, vol. 2 (2014), pp. 41-54
-
A Database for Measuring Linguistic Information Content.
Richard Sproat, Bruno Cartoni, HyunJeong Choe, David Huynh, Linne Ha, Ravindran Rajakumar, Evelyn Wenzel-Grondie
Language Resources and Evaluation Conference, ELDA, 330 W 58th St (2014)
-
A Discriminative Latent Variable Model for Online Clustering
Rajhans Samdani, Kai-Wei Chang, Dan Roth
International Conference on Machine Learning (2014) (to appear)
-
A New Entity Salience Task with Millions of Training Examples
Dan Gillick, Jesse Dunietz
Proceedings of the European Association for Computational Linguistics, Association for Computational Linguistics (2014)
-
A Scalable Gibbs Sampler for Probabilistic Entity Linking
Neil Houlsby, Massimiliano Ciaramita
Advances in Information Retrieval (ECIR 2014), Springer International Publishing, pp. 335-346
-
Acquisition of Noncontiguous Class Attributes from Web Search Queries
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL-2014), pp. 386-394
-
Acquisition of Open-Domain Classes via Intersective Semantics
Proceedings of the 23rd International World Wide Web Conference (WWW-2014), pp. 551-562
-
Adapting taggers to Twitter with not-so-distant supervision
Barbara Plank, Dirk Hovy, Anders Sogaard, Ryan McDonald
International Conference on Computational Linguistics (2014)
-
An Extension of BLANC to System Mentions
Xiaoqiang Luo, Sameer Pradhan, Marta Recasens, Eduard Hovy
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers) (2014), pp. 24-29
-
Applications of Maximum Entropy Rankers to Problems in Spoken Language Processing
Interspeech 2014, International Speech Communications Association
-
Backoff Inspired Features for Maximum Entropy Language Models
Fadi Biadsy, Keith Hall, Pedro Moreno, Brian Roark
Proceedings of Interspeech, ISCA (2014)
-
Bridging Text and Knowledge with Frames
ACL Workshop on Frame Semantics (in honor of Charles FIllmore) (2014)
-
Computer-aided quality assurance of an Icelandic pronunciation dictionary
LREC 2014, Reykjavik
-
Constrained Arc-Eager Dependency Parsing
Joakim Nivre, Yoav Goldberg, Ryan McDonald
Computational Linguistics (2014)
-
Context-Dependent Fine-Grained Entity Type Tagging
Dan Gillick, Nevena Lazic, Kuzman Ganchev, Jesse Kirchner, David Huynh
arXiv.org (2014)
-
Discriminative pronunciation modeling for dialectal speech recognition
Maider Lehr, Kyle Gorman, Izhak Shafran
Proc. Interspeech (2014) (to appear)
-
Enforcing Structural Diversity in Cube-pruned Dependency Parsing
ACL (2014)
-
Enhanced Search with Wildcards and Morphological Inflections in the Google Books Ngram Viewer
Jason Mann, David Zhang, Lu Yang, Dipanjan Das, Slav Petrov
Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (Demonstrations), Association for Computational Linguistics (2014)
-
Dipanjan Das, Desai Chen, André F. T. Martins, Nathan Schneider, Noah A. Smith
Computational Linguistics, vol. 40:1 (2014), pp. 9-56
-
Great Question! Question Quality in Community Q&A
Sujith Ravi, Bo Pang, Vibhor Rastogi, Ravi Kumar
International AAAI Conference on Weblogs and Social Media (ICWSM) (2014)
-
Hippocratic Abbreviation Expansion
ACL, ACL (2014)
-
Learning Compact Lexicons for CCG Semantic Parsing
Yoav Artzi, Dipanjan Das, Slav Petrov
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP '14)
-
Modelling Events through Memory-based, Open-IE Patterns for Abstractive Summarization
Daniele Pighin, Marco Cornolti, Enrique Alfonseca, Katja Filippova
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14) (2014), pp. 892-901
-
Opinion Mining on YouTube
Aliaksei Severyn, Olga Uryupina, Barbara Plank, Alessandro Moschitti, Katja Filippova
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14) (2014), pp. 1252-1261
-
ParTes. Test Suite for Parsing Evaluation
Marina Lloberes, Irene Castellón, Lluís Padró, Edgar Gonzàlez
Procesamiento del Lenguaje Natural, vol. 53 (2014), pp. 87-94
-
Parallel Algorithms for Unsupervised Tagging
Sujith Ravi, Sergei Vassilivitskii, Vibhor Rastogi
Transactions of the ACL (2014)
-
Projecting the Knowledge Graph to Syntactic Parsing
EACL 2014: 15th Conference of the European Chapter of the Association for Computational Linguistics
-
Pushdown automata in statistical machine translation
Cyril Allauzen, Bill Byrne, Adrià de Gispert, Gonzalo Iglesias, Michael Riley
Computational Linguistics, vol. 40 (2014), pp. 687-723
-
Queries as a Source of Lexicalized Commonsense Knowledge
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2014), pp. 1081-1091
-
ReNoun: Fact Extraction for Nominal Attributes
Mohamed Yahya, Steven Whang, Rahul Gupta, Alon Halevy
Proc. 2014 Conf. on Empirical Methods in Natural Language Processing (EMNLP)
-
SUIT: A Supervised User-Item based Topic model for Sentiment Analysis
Fangtao Li, Sheng Wang, Shenghua Liu, Ming Zhang
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI-14) (2014) (to appear)
-
Scoring Coreference Partitions of Predicted Mentions: A Reference Implementation
Sameer Pradhan, Xiaoqiang Luo, Marta Recasens, Eduard Hovy, Vincent Ng, Michael Strube
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers) (2014), pp. 30-35
-
Semantic Frame Identification with Distributed Word Representations
Karl Moritz Hermann, Dipanjan Das, Jason Weston, Kuzman Ganchev
Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (2014)
-
Skip-gram Language Modeling Using Sparse Non-negative Matrix Probability Estimation
Noam M. Shazeer, Joris Pelemans, Ciprian Chelba
Google (2014)
-
The SMAPH System for Query Entity Recognition and Disambiguation
Marco Cornolti, Paolo Ferragina, Massimiliano Ciaramita, Stefan Rued, Hinrich Schuetze
ERD 2014: Entity Recognition and Disambiguation Challenge. SIGIR Forum., ACM
-
A Dataset of Syntactic-Ngrams over Time from a Very Large Corpus of English Books
Yoav Goldberg, Jon Orwant
Second Joint Conference on Lexical and Computational Semantics, Association for Computational Linguistics, Atlanta, Georgia, USA (2013), pp. 241-247
-
Breaking Out of Local Optima with Count Transforms and Model Recombination: A Study in Grammar Induction
Valentin I. Spitkovsky, Daniel Jurafsky, Hiyan Alshawi
2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013; Best Paper Award)
-
Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction
Jason Weston, Antoine Bordes, Oksana Yakhnenko, Nicolas Usunier
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing.
-
Cross-Lingual Discriminative Learning of Sequence Models with Posterior Regularization
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
-
Deceptive Answer Prediction with User Preference Graph
Fangtao Li, Yang Gao, Shuchang Zhou, Xiance Si, Decheng Dai
The 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013) (to appear)
-
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, Jeffrey Dean
Neural and Information Processing System (NIPS) (2013)
-
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov, Kai Chen, Greg S. Corrado, Jeffrey Dean
International Conference on Learning Representations (2013)
-
Ciprian Chelba, Johan Schalkwyk
Mobile Speech and Advanced Natural Language Solutions, Springer Science+Business Media, New York (2013), pp. 197-229
-
Enlisting the Ghost: Modeling Empty Categories for Machine Translation
Bing Xiang, Xiaoqiang Luo, Bowen Zhou
Proceedings of ACL, ACL (2013), pp. 822-831
-
Filling Knowledge Base Gaps for Distant Supervision of Relation Extraction
Wei Xu, Raphael Hoffmann, Le Zhao, Ralph Grishman
ACL 2013
-
Grounded compositional semantics for finding and describing images with sentences
Richard Socher, Andrej Karpathy, Quoc V. Le, Chris D. Manning, Andrew Y. Ng
Transactions of the Association for Computational Linguistics (2013) (to appear)
-
HEADY: News headline abstraction through event pattern clustering
Enrique Alfonseca, Daniele Pighin, Guillermo Garrido
Proceedings of ACL-2013
-
Hierarchical Geographical Modeling of User locations from Social Media Posts
Amr Ahmed, Liangjie Hong, Alexander J Smola
Proceedings of the 22nd International World Wide Web Conference (WWW 2013) (to appear)
-
Identifying Phrasal Verbs Using Many Bilingual Corpora
Karl Pichotta, John DeNero
Proceedings of Empirical Methods in Natural Language Processing (2013)
-
Language Model Verbalization for Automatic Speech Recognition
Hasim Sak, Françoise Beaufays, Kaisuke Nakajima, Cyril Allauzen
Proc ICASSP, IEEE (2013)
-
Language-Independent Discriminative Parsing of Temporal Expressions
Gabor Angeli, Jakob Uszkoreit
The 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013) (to appear)
-
Mixture of mixture n-gram language models
Hasim Sak, Cyril Allauzen, Kaisuke Nakajima, Françoise Beaufays
ASRU (2013), pp. 31-36
-
One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling
Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn, Tony Robinson
ArXiv, Google (2013)
-
Online Learning for Inexact Hypergraph Search
Hao Zhang, Liang Huang, Kai Zhao, Ryan McDonald
Proc. of EMNLP (2013)
-
Open-Domain Fine-Grained Class Extraction from Web Search Queries
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2013), pp. 403-414
-
Overcoming the Lack of Parallel Data in Sentence Compression
Katja Filippova, Yasemin Altun
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP '13), pp. 1481-1491
-
ReFr: An Open-Source Reranker Framework
Daniel M. Bikel, Keith B. Hall
Interspeech 2013, pp. 756-758
-
Russian Stress Prediction using Maximum Entropy Ranking
EMNLP, ACL (2013)
-
Scalable Decipherment for Machine Translation via Hash Sampling
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (2013)
-
Smoothed marginal distribution constraints for language modeling
Brian Roark, Cyril Allauzen, Michael Riley
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (2013), pp. 43-52
-
Speech and Natural Language: Where Are We Now And Where Are We Headed?
Mobile Voice Conference, San Francisco (2013)
-
Summarization Through Submodularity and Dispersion
Anirban Dasgupta, Ravi Kumar, Sujith Ravi
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (2013)
-
Supervised Learning of Complete Morphological Paradigms
Greg Durrett, John DeNero
Proceedings of the North American Chapter of the Association for Computational Linguistics (2013)
-
System and method for determining active topics
Patent (2013)
-
Target Language Adaptation of Discriminative Transfer Parsers
Oscar Tackstrom, Ryan McDonald, Joakim Nivre
Proceedings of the North American Chapter of the Association for Computational Linguistics, Association for Computational Linguistics (2013)
-
Token and Type Constraints for Cross-Lingual Part-of-Speech Tagging
Oscar Tackstrom, Dipanjan Das, Slav Petrov, Ryan McDonald, Joakim Nivre
Transactions of the Association for Computational Linguistics (2013), 1–-12
-
Universal Dependency Annotation for Multilingual Parsing
Ryan McDonald, Joakim Nivre, Yoav Goldberg, Yvonne Quirmbach-Brundage, Dipanjan Das, Kuzman Ganchev, Keith Hall, Slav Petrov, Hao Zhang, Oscar Tackstrom, Claudia Bedini, Nuria Bertomeu Castello, Jungmee Lee
Association for Computational Linguistics, Association for Computational Linguistics (2013)
-
WHAD: Wikipedia historical attributes data
Enrique Alfonseca, Guillermo Garrido, Jean-Yves Delort, Anselmo Peñas
Language Resources and Evaluation (2013), pp. 28
-
Written-Domain Language Modeling for Automatic Speech Recognition
Hasim Sak, Yun-hsuan Sung, Françoise Beaufays, Cyril Allauzen
Interspeech (2013)
-
A Class-Based Agreement Model For Generating Accurately Inflected Translations
Spence Green, John DeNero
50th Annual Meeting of the Association for Computational Linguistics (ACL 2012)
-
A Comparison of Chinese Parsers for Stanford Dependencies
Wanxiang Che, Valentin I. Spitkovsky, Ting Liu
50th Annual Meeting of the Association for Computational Linguistics (ACL 2012)
-
A Data-Driven Approach to Question Subjectivity Identification in Community Question Answering
Tom Chao Zhou, Xiance Si, Edward Y., Irwin King, Michael R. Lyu
Proceedings of the 26th AAAI Conference on Artificial Intelligence (AAAI-12) (2012)
-
A Feature-Rich Constituent Context Model for Grammar Induction
Dave Golland, John DeNero, Jakob Uszkoreit
Proceedings of the Association for Computational Linguistics (2012)
-
A Pushdown Transducer Extension for the OpenFst Library
CIAA, Springer (2012), pp. 66-77
-
A Universal Part-of-Speech Tagset
Slav Petrov, Dipanjan Das, Ryan McDonald
Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC '12) (2012)
-
Attribute Extraction from Conjectural Queries
Proceedings of the 24th International Conference on Computational Linguistics (COLING-2012), pp. 2177-2190
-
Bootstrapping Dependency Grammar Inducers from Incomplete Sentence Fragments via Austere Models
Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky
11th International Conference on Grammatical Inference (ICGI 2012)
-
Capitalization Cues Improve Dependency Grammar Induction
Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky
NAACL HLT 2012 Workshop on Inducing Linguistic Structure (WILS 2012)
-
Cross-lingual Word Clusters for Direct Transfer of Linguistic Structure
Oscar Tackstrom, Ryan McDonald, Jakob Uszkoreit
North American Association for Computational Linguistics, Association for Computational Linguistics (2012)
-
DualSum: A Topic-Model for Update Summarization
Enrique Alfonseca, Jean-Yves Delort
Proceedings of EACL-2012, Brandschenkestrasse 110
-
Entity Disambiguation with Freebase
Zhicheng Zheng, Xiance Si, Fangtao Li, Edward Y. Chang, Xiaoyan Zhu
The 2012 IEEE/WIC/ACM International Conference on Web Intelligence (WI'2012) (to appear)
-
Generalized Higher-Order Dependency Parsing with Cube Pruning
EMNLP (2012)
-
Hallucinated N-Best Lists for Discriminative Language Modeling
Kenji Sagae, Maider Lehr, Emily Tucker Prud’hommeaux, Puyang Xu, Nathan Glenn, Damianos Karakos, Sanjeev Khudanpur, Brian Roark, Murat Saraçlar, Izhak Shafran, Daniel M. Bikel, Chris Callison-Burch, Yuan Cao, Keith Hall, Eva Hassler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2012)
-
Haptic Voice Recognition Grand Challenge
K. Sim, S. Zhao, K. Yu, H. Liao
14th ACM International Conference on Multimodal Interaction. (2012)
-
Instance-Driven Attachment of Semantic Annotations over Conceptual Hierarchies
Janara Christensen, Marius Pasca
Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL-2012), pp. 503-513
-
Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing.
Antoine Bordes, Xavier Glorot, Jason Weston, Yoshua Bengio
AISTATS (2012)
-
Language Modeling for Automatic Speech Recognition Meets the Web: Google Search by Voice
Ciprian Chelba, Johan Schalkwyk, Boulos Harb, Carolina Parada, Cyril Allauzen, Leif Johnson, Michael Riley, Peng Xu, Preethi Jyothi, Thorsten Brants, Vida Ha, Will Neveitt
University of Toronto (2012)
-
Large Scale Language Modeling in Automatic Speech Recognition
Ciprian Chelba, Dan Bikel, Maria Shugrina, Patrick Nguyen, Shankar Kumar
Google (2012)
-
Large-scale Discriminative Language Model Reranking for Voice Search
Preethi Jyothi, Leif Johnson, Ciprian Chelba, Brian Strope
Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, Association for Computational Linguistics, pp. 41-49
-
Multilingual Natural Language Processing Applications: From Theory to Practice
Daniel M. Bikel, Imed Zitouni
IBM Press (2012)
-
Optimal Size, Freshness and Time-frame for Voice Search Vocabulary
Google (2012)
-
Overview of the 2012 Shared Task on Parsing the Web
Notes of the First Workshop on Syntactic Analysis of Non-Canonical Language (SANCL) (2012)
-
Pattern Learning for Relation Extraction with Hierarchical Topic Models
Enrique Alfonseca, Katja Filippova, Jean-Yves Delort, Guillermo Garrido
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL'12) (2012)
-
Syntactic Annotations for the Google Books Ngram Corpus
Yuri Lin, Jean-Baptiste Michel, Erez Lieberman Aiden, Jon Orwant, William Brockman, Slav Petrov
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Volume 2: Demo Papers (ACL '12) (2012)
-
The OpenGrm Open-Source Finite-State Grammar Software Libraries
Brian Roark, Richard Sproat, Cyril Allauzen, Michael Riley, Jeffrey Sorensen, Terry Tai
ACL (System Demonstrations) (2012), pp. 61-66
-
Three Dependency-and-Boundary Models for Grammar Induction
Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky
2012 Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2012)
-
Unsupervised Translation Sense Clustering
Mohit Bansal, John DeNero, Dekang Lin
the North American Association of Computational Linguistics (2012)
-
User Demographics and Language in an Implicit Social Network
Proceedings of the 2012 Conference on Empirical Methods in Natural Language Processing (EMNLP'12), Jeju, Korea
-
Using Search-Logs to Improve Query Tagging
Kuzman Ganchev, Keith B. Hall, Ryan McDonald, Slav Petrov
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Volume 2: Short Papers (ACL '12) (2012)
-
Vine Pruning for Efficient Multi-Pass Dependency Parsing
Alexander Rush, Slav Petrov
The 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL '12), Best Paper Award
-
A Tweet Consumers' Look At Twitter Trends
Thomas Steiner, Arnaud Brousseau, Raphael Troncy
Workshop Making Sense of Microposts (MSM 2011) at the Extended Semantic Web Conference (ESWC 2011), Heraklion, Crete
-
Adding Meaning to Facebook Microposts via a Mash-up API and Tracking Its Data Provenance
Thomas Steiner, Ruben Verborgh, Joaquim Gabarro, Rik Van de Walle
The 7th International Conference on Next Generation Web Services Practices (NWeSP 2011)
-
Analyzing and Integrating Dependency Parsers
Ryan McDonald, Joakim Nivre
Computational Linguistics, vol. 37 (2011)
-
Asking What No One Has Asked Before: Using Phrase Similarities to Generate Synthetic Web Search Queries
Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM-2011), ACM, Glasgow, Scotland, pp. 1347-1352
-
Beam-Width Prediction for Efficient Context-Free Parsing
Nathan Bodenstab, Aaron Dunlop, Keith Hall, Brian Roark
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics (2011)
-
Binarized Forest to String Translation
Hao Zhang, Licheng Fang, Peng Xu, Xiaoyun Wu
ACL (2011), pp. 835-845
-
Blognoon: Exploring a Topic in the Blogosphere
Maria Grineva, Maxim Grinev, Dmitry Lizorkin, Alexander Boldakov, Denis Turdakov, Andrey Sysoe, Alexander Kiyko
WWW 2011, ACM, New York, NY, USA, pp. 213-216
-
Controlling Complexity in Part-of-Speech Induction
Joao Graca, Kuzman Ganchev, Luisa Coheur, Fernando Pereira, Ben Taskar
Journal of Artificial Intelligence Research (JAIR), vol. 41 (2011), pp. 527-551
-
Corrective Dependency Parsing
Keith B. Hall, Vaclav Novak
Trends in Parsing Technologies, Springer (2011)
-
Deterministic Statistical Mapping of Sentences to Underspecified Semantics
Hiyan Alshawi, Pi-Chuan Chang, Michael Ringgaard
Proceedings of the Ninth International Conference on Computational Semantics (IWCS 2011)
-
Discovering fine-grained sentiment with latent variable structured prediction models
Oscar Tackstrom, Ryan McDonald
European Conference on Information Retrieval (2011)
-
DiversiWeb 2011
Elena Paslaru Bontas Simperl, Devika P. Madalli, Denny Vrandecic, Enrique Alfonseca
SIGIR Forum, vol. 45 (2011), pp. 49-53
-
DiversiWeb 2011: first international workshop on knowledge diversity on the web
Elena Paslaru Bontas Simperl, Devika P. Madalli, Denny Vrandecic, Enrique Alfonseca
WWW (Companion Volume) (2011), pp. 319-320
-
Efficient Parallel CKY Parsing on GPUs
Youngmin Yi, Chao-Yue Lai, Slav Petrov, Kurt Keutzer
Proceedings of the International Conference on Parsing Technologies (IWPT '11) (2011)
-
Fine-Grained Class Label Markup of Search Queries
Joseph Reisinger, Marius Pasca
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL-2011), pp. 1200-1209
-
Gappy Phrasal Alignment by Agreement
Mohit Bansal, Chris Quirk, Robert C. Moore
Proc. 49th Annual Meeting of the Association for Computational Linguistics, ACL, Portland, Oregon (2011), pp. 1308-1317
-
Improved Video Categorization from Text Metadata and User Comments
Katja Filippova, Keith B. Hall
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information (SIGIR-2011), Beijing, China, pp. 835-842
-
K2Q: Generating Natural Language Questions from Keywords with User Refinements
Zhicheng Zheng, Xiance Si, Edward Y. Chang, Xiaoyan Zhu
Proceedings of the 5th International Joint Conference on Natural Language Processing, ACL (2011), 947–955
-
Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models
Sameer Singh, Amarnag Subramanya, Fernando Pereira, Andrew McCallum
Association for Computational Linguistics (ACL) (2011)
-
Learning to Rank Answers to Non-Factoid Questions from Web Collections
Mihai Surdeanu, Massimiliano Ciaramita, Hugo Zaragoza
Computational Linguistics, vol. 37 (2011), pp. 351-383
-
Multi-Source Transfer of Delexicalized Dependency Parsers
Ryan McDonald, Slav Petrov, Keith B. Hall
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP '11)
-
Piggyback: Using Search Engines for Robust Cross-Domain Named Entity Recognition
Stefan Rued, Massimiliano Ciaramita, Jens Mueller, Hinrich Schuetze
49th Annual Meeting of the Association for Computational Linguistics (ACL-HLT), Association for Computational Linguistics (2011), pp. 965-975
-
Posterior Sparsity in Dependency Grammar Induction
Jennifer Gillenwater, Kuzman Ganchev, Joao Graca, Fernando Pereira, Ben Taskar
Journal of Machine Learning Research, vol. 12 (2011), pp. 455-490
-
Punctuation: Making a Point in Unsupervised Dependency Parsing
Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky
Fifteenth Conference on Computational Natural Language Learning (CoNLL-2011)
-
Question Identification on Twitter, Accepted by CIKM 2011
Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, Edward Y. Chang
Proceedings of the 20th ACM international conference on Information and knowledge management, ACM, New York, NY, USA (2011)
-
Ranking Class Labels Using Query Sessions
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL-2011), pp. 1607-1615
-
Semi-supervised Latent Variable Models for Fine-grained Sentiment Analysis
Oscar Tackstrom, Ryan McDonald
Association for Computational Linguistics (2011)
-
Training Structured Prediction Models with Extrinsic Loss Functions
Keith Hall, Ryan McDonald, Slav Petrov
Domain Adaptation Workshop at NIPS 2011
-
Training a Parser for Machine Translation Reordering
Jason Katz-Brown, Slav Petrov, Ryan McDonald, Franz Och, David Talbot, Hiroshi Ichikawa, Masakazu Seno
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP '11)
-
Training dependency parsers by jointly optimizing multiple objectives
Keith B. Hall, Ryan McDonald, Jason Katz-Brown, Michael Ringgaard
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing
-
Unsupervised Dependency Parsing without Gold Part-of-Speech Tags
Valentin I. Spitkovsky, Hiyan Alshawi, Angel X. Chang, Daniel Jurafsky
2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011)
-
Unsupervised Part-of-Speech Tagging with Bilingual Graph-Based Projections
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL '11) (2011), Best Paper Award
-
A Comparison of Features for Automatic Readability Assessment
Lijun Feng, Martin Jansche, Matt Huenerfauth, Noémie Elhadad
23rd International Conference on Computational Linguistics (COLING 2010), Poster Volume, pp. 276-284
-
A novel approach for proper name transliteration verification
Ea-Ee Jan, Niyu Ge, Shih-Hsiang Lin, S. Roukos, J. Sorensen
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on, pp. 89 -94
-
Acquisition of Instance Attributes via Labeled and Related Instances
Enrique Alfonseca, Marius Pasca, Enrique Robledo-Arnuncio
Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR-10) (2010), pp. 58-65
-
Building Transcribed Speech Corpora Quickly and Cheaply for Many Languages
Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu, Pedro Moreno, Mike LeBeau
Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010), International Speech Communication Association, pp. 1914-1917
-
Direct Construction of Compact Context-Dependency Transducers From Data
Interspeech 2010, ISCA
-
Distributed MAP Inference for Undirected Graphical Models
Sameer Singh, Amarnag Subramanya, Fernando Pereira, Andrew McCallum
Workshop on Learning on Cores, Clusters and Clouds (LCCC), Neural Information Processing Society (NIPS) (2010)
-
Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models
Amarnag Subramanya, Slav Petrov, Fernando Pereira
Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)
-
Evaluation of Dependency Parsers on Unbounded Dependencies
Joakim Nivre, Laura Rimell, Ryan McDonald, Carlos Gómez Rodríguez
International Conference on Computational Linguistics (2010)
-
Expected Sequence Similarity Maximization
Cyril Allauzen, Shankar Kumar, Wolfgang Macherey, Mehryar Mohri, Michael Riley
NAACL HLT (2010)
-
Experiments in Graph-based Semi-Supervised Learning Methods for Class-Instance Acquisition
Partha Pratim Talukdar, Fernando Pereira
48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)
-
From Baby Steps to Leapfrog: How “Less is More” in Unsupervised Dependency Parsing
Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky
Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010)
-
Learning Better Monolingual Models with Unannotated Bilingual Text
David Burkett, Slav Petrov, John Blitzer, Dan Klein
Fourteenth Conference on Computational Natural Language Learning (CoNLL '10) (2010)
-
Learning Dense Models of Query Similarity from User Click Logs
Fabio De Bona, Stefan Riezler, Keith Hall, Massimiliano Ciaramita, Amac Herdagdelen, Maria Holmqvist
Proceedings of NAACL-HLT 2010
-
Lightly Supervised Learning of Text Normalization: Russian Number Names
IEEE Workshop on Spoken Language Technology, Berkeley, CA (2010) (to appear)
-
Logical Leaps and Quantum Connectives: Forging Paths through Predication Space
Trevor Cohen, Dominic Widdows, Roger W. Schvaneveldt, Thomas C. Rindflesch
AAAI-Fall 2010 Symposium on Quantum Informatics for Cognitive, Social, and Semantic Processes. (to appear)
-
Multi-Sentence Compression: Finding Shortest Paths in Word Graphs
Proceedings of the 23rd International Conference on Computational Linguistics (Coling'10) (2010)
-
Products of Random Latent Variable Grammars
Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL/HLT '10) (2010)
-
Profiting from Mark-Up: Hyper-Text Annotations for Guided Parsing
Valentin I. Spitkovsky, Daniel Jurafsky, Hiyan Alshawi
48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)
-
Proper Name Transcription/Transliteration with ICU Transforms
Sascha Brawer, Martin Jansche, Hiroshi Takenaka, Yui Terashima
34th Internationalization & Unicode Conference (2010)
-
Query Language Modeling for Voice Search
Ciprian Chelba, Johan Schalkwyk, Thorsten Brants, Vida Ha, Boulos Harb, Will Neveitt, Carolina Parada, Peng Xu
Proceedings of the 2010 IEEE Workshop on Spoken Language Technology, IEEE, pp. 127-132
-
Query Rewriting using Monolingual Statistical Machine Translation
Stefan Riezler, Yi Liu
Computational Linguistics, vol. 36 (2010)
-
Self-training with Products of Latent Variable Grammars
Zhongqiang Huang, Mary Harper, Slav Petrov
Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)
-
Sparsity in Dependency Grammar Induction
Jennifer Gillenwater, Kuzman Ganchev, João Graça, Fernando Pereira, Ben Taskar
48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)
-
Speech Recognition for Mobiles Devices at Google
PRICAI 2010, Lecture Notes in Artificial Intelligence volume 6230, Springer, Heidelberg, pp. 8-10
-
Study on Interaction between Entropy Pruning and Kneser-Ney Smoothing
Ciprian Chelba, Thorsten Brants, Will Neveitt, Peng Xu
Proceedings of Interspeech (2010), pp. 2242-2245
-
The Role of Queries in Ranking Labeled Instances Extracted from Text
Proceedings of the 23rd International Conference on Computational Linguistics (COLING-2010), pp. 955-962
-
The Role of Query Sessions in Extracting Instance Attributes from Web Search Queries
Marius Pasca, Enrique Alfonseca, Enrique Robledo-Arnuncio, Ricardo Martin-Brualla, Keith Hall
Proceedings of the 32nd European Conference on Information Retrieval (ECIR-2010), pp. 62-74
-
The Semantic Vectors Package: New Algorithms and Public Tools for Distributional Semantics
Dominic Widdows, Trevor Cohen
Fourth IEEE International Conference on Semantic Computing (IEEE ICSC2010), IEEE
-
The Viability of Web-derived Polarity Lexicons
Leonid Velikovich, Sasha Blair-Goldensohn, Kerry Hannan, Ryan McDonald
North American Chapter of the Association for Computational Linguistics (2010)
-
Uptraining for Accurate Deterministic Question Parsing
Slav Petrov, Pi-Chuan Chang, Michael Ringgaard, Hiyan Alshawi
Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)
-
Using Web-scale N-grams to Improve Base NP Parsing Performance
Emily Pitler, Shane Bergsma, Dekang Lin, Ken Church
Proceedings of COLING (2010), 886–894
-
Viterbi Training Improves Unsupervised Dependency Parsing
Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky, Christopher D. Manning
Fourteenth Conference on Computational Natural Language Learning (CoNLL-2010)
-
Etienne Barnard, Johan Schalkwyk, Charl van Heerden, Pedro J. Moreno
Interspeech 2010
-
What’s great and what’s not: learning to classify the scope of negation for improved sentiment analysis
Isaac Councill, Ryan McDonald, Leonid Velikovich
Workshop on Negation and Speculation in Natural Language Processing (2010)
-
A Generalized Composition Algorithm for Weighted Finite-State Transducers
Cyril Allauzen, Michael Riley, Johan Schalkwyk
Interspeech 2009
-
A Panlingual Anomalous Text Detector
DocEng '09: Proceedings of the 9th ACM symposium on Document Engineering, ACM, New York (2009), pp. 201-204
-
A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches
Eneko Agirre, Enrique Alfonseca, Keith Hall, Jana Kravalova, Marius Pasca, Aitor Soroa
Proceedings of NAACL-HLT 2009, pp. 19-27
-
An Approach to Web-Scale Named-Entity Disambiguation
Luís Sarmento, Alexander Kehlenbeck, Eugénio C. Oliveira, Lyle Ungar
MLDM '09: Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition, Springer-Verlag, Berlin, Heidelberg (2009), pp. 689-703
-
Automatic Adaptation of Annotation Standards: Chinese Word Segmentation and POS Tagging -- A Case Study
Wenbin Jiang, Liang Huang, Qun Liu
Proceedings of ACL-IJCNLP (2009)
-
Baby Steps: How “Less is More” in Unsupervised Dependency Parsing
Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky
NIPS 2009 Workshop on Grammar Induction, Representation of Language and Language Learning
-
Back-off Language Model Compression
Boulos Harb, Ciprian Chelba, Jeffrey Dean, Sanjay Ghemawat
Proceedings of Interspeech 2009, International Speech Communication Association (ISCA), pp. 325-355
-
Bilingually-Constrained (Monolingual) Shift-Reduce Parsing
Liang Huang, Wenbin Jiang, Qun Liu
Proceedings of EMNLP (2009), pp. 1222-1231
-
Combining Language Modeling and Discriminative Classification for Word Segmentation
Dekang Lin
CICLing '09: Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing, Springer-Verlag, Berlin, Heidelberg (2009), pp. 170-182
-
Contrastive summarization: An experiment with consumer reviews
Kevin Lerman, Ryan McDonald
North American Association for Computational Linguistics (2009)
-
Dependency Parsing
Sandra Kubler, Ryan McDonald, Joakim Nivre
Morgan & Claypool (2009)
-
Distributed language models
Thorsten Brants, Peng Xu
NAACL '09: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Tutorial Abstracts, Association for Computational Linguistics, Morristown, NJ, USA, pp. 3-4
-
Finite-State Machines for Mining Patterns in Very Large Text Repositories
Wojciech Skut
Proceeding of the 2009 conference on Finite-State Methods and Natural Language Processing, IOS Press, Amsterdam, The Netherlands, The Netherlands, pp. 23-23
-
Gazpacho and summer rash: lexical relationships from temporal patterns of web search queries
Enrique Alfonseca, Massimiliano Ciaramita, Keith Hall
Proceedings of the conference on Empirical Methods in Natural Language Processing (EMNLP) (2009)
-
Generative and Discriminative Latent Variable Grammars
The Generative and Discriminative Learning Interface Workshop at NIPS 2009
-
Glen, Glenda or Glendale: Unsupervised and Semi-supervised Learning of English Noun Gender
Shane Bergsma, Dekang Lin, Randy Goebel
CoNLL, Boulder, CO (2009)
-
Integrating sentence- and word-level error identification for disfluency correction
Erin Fitzgerald, Frederick Jelinek, Keith B. Hall
EMNLP '09: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Morristown, NJ, USA, pp. 765-774
-
Language modelling for what-with-where on GOOG-411
Charl Van Heerden, Johan Schalkwyk, Brian Strope
Proc. International Speech Communication Association (Interspeech 2009), pp. 991-994
-
Large-scale Computation of Distributional Similarities for Queries
Enrique Alfonseca, Keith Hall, Silvana Hartmann
Proceedings of NAACL-HLT-2009
-
Large-scale Semantic Networks: Annotation and Evaluation
Vaclav Novak, Sven Hartrumpf, Keith B. Hall
Proceedings of the Semantic Evaluations Workshop at NAACL-HLT (2009)
-
Latent Variable Models of Concept-Attribute Attachment
Joseph Reisinger, Marius Pasca
Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics (ACL-IJCNLP-2009), pp. 620-628
-
Low-Cost Supervision for Multiple-Source Attribute Extraction
Joseph Reisinger, Marius Pasca
Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing-2009), Mexico City, Mexico, pp. 382-393
-
Named Entity Transcription with Pair n-Gram Models
Martin Jansche, Richard Sproat
2009 Named Entities Workshop: Shared Task on Transliteration (NEWS 2009), ACL-IJCNLP 2009, pp. 32-35
-
Michael Riley, Cyril Allauzen, Martin Jansche
Proceedings of the North American Chapter of the Association for Computational Linguistics -- Human Language Technologies (NAACL HLT) 2009 conference, Tutorials
-
Outclassing Wikipedia in Open-Domain Information Extraction: Weakly-Supervised Acquisition of Attributes over Conceptual Hierarchies
Proceedings of the 12th Conference of the European Chapter of the Association of Computational Linguistics (EACL-2009), Athens, Greece, pp. 639-647
-
Phrase Clustering for Discriminative Learning
Dekang Lin, Xiaoyun Wu
Proceedings of ACL/IJCNLP, Singapore (2009), pp. 1030-1038
-
Posterior vs. Parameter Sparsity in Latent Variable Models
Joao Graca, Kuzman Ganchev, Ben Taskar, Fernando Pereira
Advances in Neural Information Processing Systems 22 (2009), pp. 664-672
-
Randomized Pruning: Efficiently Calculating Expectations in Large Dynamic Programs
Alexandre Bouchard-Côté, Slav Petrov, Dan Klein
Advances in Neural Information Processing Systems 22 (NIPS '09) (2009)
-
Reconstructing false start errors in spontaneous speech text
Erin Fitzgerald, Keith B. Hall, Frederick Jelinek
Proceedings of the European Chapter of the Association for Computational Linguistics (2009)
-
Semantic Vector Combinations and the Synoptic Gospels
Dominic Widdows, Trevor Cohen
Third International Symposium on Quantum Interaction (2009)
-
Semi-Supervised Polarity Lexicon Induction
Delip Rao, Deepak Ravichandran
EACL - 2009
-
Sentiment Summarization: Evaluating and Learning User Preferences
Kevin Lerman, Sasha Blair-Goldensohn, Ryan McDonald
European Association for Computational Linguistics (2009)
-
The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple Languages
Jan Hajič, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antònia Martí, Lluís Màrquez, Adam Meyers, Joakim Nivre, Sebastian Padó, Jan Štepánek, Pavel Straňák, Mihai Surdeanu, Nianwen Xue, Yi Zhang
Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task, Association for Computational Linguistics, 209 N. Eight Street, Stroudsburg, PA 18360, pp. 1-18
-
Using a dependency parser to improve SMT for subject-object-verb languages
Peng Xu, Jaeho Kang, Michael Ringgaard, Franz Och
NAACL '09: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, Morristown, NJ, USA, pp. 245-253
-
Using the web for language independent spellchecking and autocorrection
Casey Whitelaw, Ben Hutchinson, Grace Y. Chung, Gerard Ellis
EMNLP '09: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Morristown, NJ, USA, pp. 890-899
-
Web-Derived Resources for Web Information Retrieval: From Conceptual Hierarchies to Attribute Hierarchies
Marius Pasca, Enrique Alfonseca
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR-09), Boston, Massachusetts (2009), pp. 596-603
-
Web-Scale N-gram Models for Lexical Disambiguation
Shane Bergsma, Dekang Lin, Randy Goebel
Proceedings of IJCAI, Los Angeles, CA (2009), pp. 1507-1512
-
A Joint Model of Text and Aspect Ratings for Sentiment Summarization
Ivan Titov, Ryan McDonald
Association for Computational Linguistics (2008)
-
Answering Definition Questions via Temporally-Anchored Text Snippets
Proceedings of the 3rd International Joint Conference on Natural Language Processing (IJCNLP-2008), Hyderabad, India, pp. 411-417
-
Building a Sentiment Summarizer for Local Service Reviews
Sasha Blair-Goldensohn, Kerry Hannan, Ryan McDonald, Tyler Neylon, George Reis, Jeff Reynar
WWW Workshop on NLP Challenges in the Information Explosion Era (NLPIX) (2008)
-
Decompounding query keywords from compounding languages
Enrique Alfonseca, Slaven Bilac, Stefan Pharies
Proceedings of ACL-2008
-
Discriminative learning of selectional preference from unlabeled text
Shane Bergsma, Dekang Lin, Randy Goebel
EMNLP '08: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Morristown, NJ, USA (2008), pp. 59-68
-
Distributional Identification of Non-Referential Pronouns
Shane Bergsma, Dekang Lin, Randy Goebel
Proceedings of ACL-08: HLT, Association for Computational Linguistics, Columbus, Ohio (2008), pp. 10-18
-
Finding Cars, Goddesses and Enzymes: Parametrizable Acquisition of Labeled Instances for Open-Domain Information Extraction
Benjamin Van Durme, Marius Pasca
Proceedings of the 23rd Annual Conference on Artificial Intelligence (AAAI-2008), pp. 1243-1248
-
German decompounding in a difficult corpus
Enrique Alfonseca, Slaven Bilac, Stefan Pharies
Proceedings of CICLING-2008, Lecture Notes in Computer Science, Springer, pp. 128-139
-
Integrating Graph-based and Transition-based Dependency Parsers
Joakim Nivre, Ryan McDonald
Association for Computational Linguistics (2008)
-
Large Scale Acquisition of Paraphrases for Learning Surface Patterns
Rahul Bhagat, Deepak Ravichandran
ACL-2008
-
Randomized Language Models via Perfect Hash Functions
David Talbot, Thorsten Brants
Proceedings of ACL-08: HLT, Association for Computational Linguistics, Columbus, Ohio (2008), pp. 505-513
-
Reading the Markets: Forecasting Public Opinion of Political Candidates by News Analysis
Kevin Lerman, Ari Gilder, Mark Dredze, Fernando Pereira
Conference on Computational Linguistics (Coling) (2008)
-
Semantic Vector Products: Some Initial Investigations
Dominic Widdows
Proceedings of the Second AAAI Symposium on Quantum Interaction, AAAI (2008)
-
Towards Temporal Web Search
Proceedings of the 23rd ACM Symposium on Applied Computing (SAC-2008), Fortaleza, Brazil, pp. 1117-1121
-
Translating Queries into Snippets for Improved Query Expansion
Stefan Riezler, Yi Liu, Alexander Vasserman
Proceedings of the 22nd International Conference on Computational Linguistics (COLING'08), Manchester, England (2008)
-
Turning Web Text and Search Queries into Factual Knowledge: Hierarchical Class Attribute Extraction
Proceedings of the 23rd Annual Conference on Artificial Intelligence (AAAI-2008), pp. 1225-1230
-
Using Structured Text for Large-Scale Attribute Extraction
Proceedings of the 17th ACM Conference on Information and Knowledge Management (CIKM-2008), pp. 1183-1192
-
Weakly-Supervised Acquisition of Labeled Class Instances using Graph Random Walks
Partha Pratim Talukdar, Joseph Reisinger, Marius Pasca, Deepak Ravichandran, Rahul Bhagat, Fernando Pereira
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-08), Association for Computational Linguistics, Honolulu, Hawaii (2008), pp. 582-590
-
Weakly-Supervised Acquisition of Open-Domain Classes and Class Attributes from Web Documents and Query Logs
Marius Pasca, Benjamin Van Durme
Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-2008), pp. 19-27
-
Web-scale named entity recognition
Casey Whitelaw, Alex Kehlenbeck, Nemanja Petrovic, Lyle Ungar
CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge management, ACM, New York, NY, USA (2008), pp. 123-132
-
Wide-Coverage Deep Statistical Parsing using Automatic Dependency Structure Annotation
Aoife Cahill, Ruth O'Donovan, Josef van Genabith, Michael Burke, Stefan Riezler, Andy Way
Computational Linguistics, vol. 34 (1) (2008), pp. 81-124
-
A Study of Global Inference Algorithms in Multi-Document Summarization
European Conference on Information Retrieval (ECIR) (2007)
-
Characterizing the Errors of Data-Driven Dependency Parsers
Ryan McDonald, Joakim Nivre
Empirical Methods in Natural Language Processing (2007)
-
Frustratingly Hard Domain Adaptation for Dependency Parsing
Mark Dredze, John Blitzer, Partha Pratim Talukdar, Kuzman Ganchev, João V. Graça, Fernando Pereira
Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007, pp. 1051-1055
-
Monojit Choudhury, Markose Thomas, Animesh Mukherjee, Niloy Ganguly, Anupam Basu
Textgraphs 2 Workshop, at HLT/NAACL, ACL (2007), pp. 8
-
Inference in Text Understanding
AAAI Spring Symposium on Machine Reading (2007)
-
Lightweight Web-Based Fact Repositories for Textual Question Answering
Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM-2007), Lisboa, Portugal, pp. 87-96
-
N-Gram Statistical Similarities and Differences between Chinese and English
Pei Cao, Stewart Yang, Hongjun Zhu
First IEEE International Conference on Semantic Computing, IEEE (2007)
-
On the Complexity of Non-Projective Data-Driven Dependency Parsing
Ryan McDonald, Giorgio Satta
International Conference on Parsing Technologies (2007), pp. 121-132
-
Organizing and Searching the World Wide Web of Facts - Step Two: Harnessing the Wisdom of the Crowds
Proceedings of the 16th International World Wide Web Conference (WWW-07) (2007), pp. 101-110
-
Reconocimiento de Entidades, Resolución de Correferencia y Extracción de Relaciones
F. Verdejo (ed.), Acceso y viabilidad de la información multilingüe en la red: el rol de la semántica, Fundación Duques de Soria (2007)
-
Simple training of dependency parsers via structured boosting
Qin Iris Wang, Dekang Lin, Dale Schuurmans
IJCAI'07: Proceedings of the 20th international joint conference on Artifical intelligence, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (2007), pp. 1756-1762
-
Statistical Machine Translation for Query Expansion in Answer Retrieval
Stefan Riezler, Alexander Vasserman, Ioannis Tsochantaridis, Vibhu Mittal, Yi Liu
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL'07), Prague, Czech Republic (2007)
-
Structured Models for Fine-to-Coarse Sentiment Analysis
Ryan McDonald, Kerry Hannan, Tyler Neylon, Mike Wells, Jeff Reynar
45th Annual Meeting of the Association for Computational Linguistics (ACL 2007)
-
The Role of Documents vs. Queries in Extracting Class Attributes from Text
Marius Pasca, Benjamin Van Durme, Nikesh Garera
Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM-2007), Lisboa, Portugal, pp. 485-494
-
Weakly-Supervised Discovery of Named Entities Using Web Search Queries
Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM-2007), Lisboa, Portugal, pp. 683-690
-
What You Seek is What You Get: Extraction of Class Attributes from Query Logs
Marius Pasca, Benjamin Van Durme
Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07) (2007), pp. 2832-2837
-
A Context Pattern Induction Method for Named Entity Extraction
Partha Pratim Talukdar, Thorsten Brants, Mark Liberman, Fernando Pereira
Proceedings of CoNLL-X (2006), pp. 141-148
-
Bootstrapping Path-Based Pronoun Resolution
Shane Bergsma, Dekang Lin
Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Sydney, Australia (2006), pp. 33-40
-
Comparative Experiments on Sentiment Classification for Online Product Reviews
Hang Cui, Vibhu Mittal, Mayur Datar
Proceedings of the 21st National Conference on Artificial Intelligence, AAAI, Boston, MA (2006)
-
Integrating probabilistic extraction models and data mining to discover relations and patterns in text
Aron Culotta, Andrew McCallum, Jonathan Betz
HLT-NAACL, New York, NY (2006), pp. 296-303
-
Names and Similarities on the Web: Fact Extraction in the Fast Lane
Marius Pasca, Dekang Lin, Jeffrey Bigham, Andrei Lifchits, Alpa Jain
Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (COLING-ACL-06), Sydney, Australia (2006), pp. 809-816
-
On a Common Fallacy in Computational Linguistics
A Man of Measure: Festschrift in Honour of Fred Karlsson on this 60th Birthday, SKY Journal of Linguistics, Volume 19 (2006), pp. 432-439
-
Organizing and Searching the World Wide Web of Facts - Step One: the One-Million Fact Extraction Challenge
Marius Pasca, Dekang Lin, Jeffrey Bigham, Andrei Lifchits, Alpa Jain
Proceedings of the 21st National Conference on Artificial Intelligence (AAAI-06), Boston, Massachusetts (2006), pp. 1400-1405
-
Probabilistic Context-Free Grammar Induction Based on Structural Zeros
Proceedings of the Seventh Meeting of the Human Language Technology conference - North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2006), New York, NY
-
Soft Syntactic Constraints for Word Alignment through Discriminative Training
Colin Cherry, Dekang Lin
Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, Association for Computational Linguistics, Sydney, Australia, pp. 105-112
-
Using Encyclopedic Knowledge for Named Entity Disambiguation
Razvan Bunescu, Marius Pasca
Proceedings of the 11th Conference of the European Chapter of the Association of Computational Linguistics (EACL-2006), Trento, Italy, pp. 9-16
-
Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web
Proceedings of the 2nd International Joint Conference on Natural Language Processing (IJCNLP-2005), Jeju Island, Republic of Korea, pp. 119-130
-
Finding Instance Names and Alternative Glosses on the Web: WordNet Reloaded
Proceedings of the 6th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing-2005), Mexico City, Mexico, pp. 280-292
-
Local Grammar Algorithms
Inquiries into Words, Constraints, and Contexts. Festschrift in Honour of Kimmo Koskenniemi on his 60th Birthday, CSLI Publications, Stanford University (2005), pp. 84-93
-
Mining Paraphrases from Self-Anchored Web Sentence Fragments
Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD-2005), Porto, Portugal, pp. 193-204
-
Statistical Natural Language Processing
Applied Combinatorics on Words, Cambridge University Press (2005)
-
Strictly lexical dependency parsing
Qin Iris Wang, Dale Schuurmans, Dekang Lin
Parsing '05: Proceedings of the Ninth International Workshop on Parsing Technology, Association for Computational Linguistics, Morristown, NJ, USA (2005), pp. 152-159
-
The Design Principles and Algorithms of a Weighted Grammar Library
Cyril Allauzen, Mehryar Mohri, Brian Roark
International Journal of Foundations of Computer Science, vol. 16 (2005)
-
Acquisition of Categorized Named Entities for Web Search
Proceedings of the 13th ACM Conference on Information and Knowledge Management (CIKM-04), Washington, D.C. (2004), pp. 137-145
-
Alexander Franz, Brian Milch
Proceedings of the 19th International Conference on Computational Linguistics (COLING) (2002), pp. 1213-1217