Fernando Pereira

Fernando Pereira is research director at Google. His previous appointments include chair of the Computer and Information Science department at University of Pennsylvania, head of the Machine Learning and Information Retrieval department at AT&T Labs, and research and management positions at SRI International. He received a Ph.D. in Artificial Intelligence from the University of Edinburgh in 1982. His main research interests are in machine-learnable models of language and biological sequences. He has over 100 research publications on computational linguistics, machine learning, bioinformatics, speech recognition, and logic programming, and several patents. He was elected Fellow of the American Association for Artificial Intelligence in 1991 for his contributions to computational linguistics and logic programming, and he was president of the Association for Computational Linguistics.

Google Publications

  •    

    Large Scale Distributed Acoustic Modeling With Back-off N-grams

    Ciprian Chelba, Peng Xu, Fernando Pereira, Thomas Richardson

    ICSI, Berkeley, California (2013)

  •    

    Large Scale Distributed Acoustic Modeling With Back-off N-grams

    Ciprian Chelba, Peng Xu, Fernando Pereira, Thomas Richardson

    IEEE Transactions on Audio, Speech and Language Processing, vol. 21 (2013), pp. 1158-1169

  •    

    Distributed Acoustic Modeling with Back-off N-grams

    Ciprian Chelba, Peng Xu, Fernando Pereira, Thomas Richardson

    Proceedings of ICASSP 2012, IEEE, pp. 4129-4132

  •    

    Controlling Complexity in Part-of-Speech Induction

    Joao Graca, Kuzman Ganchev, Luisa Coheur, Fernando Pereira, Ben Taskar

    Journal of Artificial Intelligence Research (JAIR), vol. 41 (2011), pp. 527-551

  •    

    Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models

    Sameer Singh, Amarnag Subramanya, Fernando Pereira, Andrew McCallum

    Association for Computational Linguistics (ACL) (2011)

  •    

    Posterior Sparsity in Dependency Grammar Induction

    Jennifer Gillenwater, Kuzman Ganchev, Joao Graca, Fernando Pereira, Ben Taskar

    Journal of Machine Learning Research, vol. 12 (2011), pp. 455-490

  •    

    A theory of learning from different domains

    Shai Ben-David, John Blitzer, Koby Crammer, Alex Kulesza, Fernando Pereira, Jennifer Vaughan

    Machine Learning, vol. 79 (2010), pp. 151-175

  •   

    Automatically incorporating new sources in keyword search-based data integration

    Partha Pratim Talukdar, Zachary G. Ives, Fernando Pereira

    SIGMOD Conference, ACM Press (2010), pp. 387-398

  •   

    Distributed MAP Inference for Undirected Graphical Models

    Sameer Singh, Amarnag Subramanya, Fernando Pereira, Andrew McCallum

    Workshop on Learning on Cores, Clusters and Clouds (LCCC), Neural Information Processing Society (NIPS) (2010)

  •   

    Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models

    Amarnag Subramanya, Slav Petrov, Fernando Pereira

    Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)

  •   

    Experiments in Graph-based Semi-Supervised Learning Methods for Class-Instance Acquisition

    Partha Pratim Talukdar, Fernando Pereira

    48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)

  •   

    Exploiting Feature Covariance in High-Dimensional Online Learning

    Justin Ma, Alex Kulesza, Mark Dredze, Koby Crammer, Lawrence Saul, Fernando Pereira

    Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, JMLR (2010), pp. 493-500

  •   

    Sparsity in Dependency Grammar Induction

    Jennifer Gillenwater, Kuzman Ganchev, João Graça, Fernando Pereira, Ben Taskar

    48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)

  •    

    A transcription factor affinity-based code for mammalian transcription initiation

    M Megraw, F Pereira, ST Jensen, U Ohler, AG Hatzigeorgiou

    Genome Research, vol. 19 (2009), pp. 644-56

  •   

    Gaussian Margin Machines

    Koby Crammer, Mehryar Mohri, Fernando Pereira

    Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS 2009), Clearwater Beach, Florida, pp. 105-112

  •   

    Group Sparse Coding

    Samy Bengio, Fernando Pereira, Yoram Singer, Dennis Strelow

    Advances in Neural Information Processing Systems (2009)

  •    

    Posterior vs. Parameter Sparsity in Latent Variable Models

    Joao Graca, Kuzman Ganchev, Ben Taskar, Fernando Pereira

    Advances in Neural Information Processing Systems 22 (2009), pp. 664-672

  •   

    The Unreasonable Effectiveness of Data

    Alon Halevy, Peter Norvig, Fernando Pereira

    IEEE Intelligent Systems, vol. 24 (2009), pp. 8-12

  •    

    Confidence-Weighted Linear Classification

    Mark Dredze, Koby Crammer, Fernando Pereira

    International Conference on Machine Learning (ICML) (2008)

  •   

    Generating Summary Keywords for Emails Using Topics

    Mark Dredze, Hanna Wallach, Danny Puller, Fernando Pereira

    Proceedings of the 2008 International Conference on Intelligent User Interfaces

  •   

    Intelligent Email: Reply and Attachment Prediction

    Mark Dredze, Tova Brooks, Josh Carroll, Joshua Magarick, John Blitzer, Fernando Pereira

    Proceedings of the 2008 International Conference on Intelligent User Interfaces

  •   

    Learning Bounds for Domain Adaptation

    John Blitzer, Koby Crammer, Alex Kulesza, Fernando Pereira, Jennifer Wortman

    Advances in Neural Information Processing Systems 20, {MIT} Press, Cambridge, MA (2008)

  •    

    Reading the Markets: Forecasting Public Opinion of Political Candidates by News Analysis

    Kevin Lerman, Ari Gilder, Mark Dredze, Fernando Pereira

    Conference on Computational Linguistics (Coling) (2008)

  •   

    Structured Learning with Approximate Inference

    Alex Kulesza, Fernando Pereira

    Advances in Neural Information Processing Systems 20, {MIT} Press, Cambridge, MA (2008)

  •   

    Weakly-Supervised Acquisition of Labeled Class Instances using Graph Random Walks

    Partha Pratim Talukdar, Joseph Reisinger, Marius Pasca, Deepak Ravichandran, Rahul Bhagat, Fernando Pereira

    Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-08), Association for Computational Linguistics, Honolulu, Hawaii (2008), pp. 582-590

  •   

    Euclidean Embedding of Co-occurrence Data

    Amir Globerson, Gal Chechik, Fernando Pereira, Naftali Tishby

    Journal of Machine Learning Research, vol. 8 (2007), pp. 2265-2295

  •   

    Frustratingly Hard Domain Adaptation for Dependency Parsing

    Mark Dredze, John Blitzer, Partha Pratim Talukdar, Kuzman Ganchev, Jo~{a}o V. Graça, Fernando Pereira

    Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007, pp. 1051-1055

Previous Publications

  •  

    A rate-distortion one-class model and its applications to clustering

    K. Crammer, P. Talukdar, F. Pereira

    Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, pp. 184-191

  •  

    Confidence-weighted linear classification

    M. Dredze, K. Crammer, F. Pereira

    Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, pp. 264-271

  •  

    Evigan: a hidden variable model for integrating gene evidence for eukaryotic gene prediction

    Qian Liu, Aaron J Mackey, David S Roos, Fernando C N Pereira

    Bioinformatics, vol. 24 (2008), pp. 597-605

  •  

    Intelligent Email: Aiding Users with AI

    Mark Dredze, Hanna Wallach, Danny Puller, Tova Brooks, Josh Carroll, Joshua Magarick, John Blitzer, Fernando Pereira

    American National Conference on Artificial Intelligence (AAAI) (2008)

  •  

    Learning to Create Data-Integrating Queries

    Partha Pratim Talukdar, Marie Jacob, M. Salman Mehmood, Koby Crammer, Zachary Ives, Fernando Pereira, Sudipto Guha

    VLDB (2008)

  •  

    Reranking candidate gene models with cross-species comparison for improved gene prediction

    Qian Liu, Koby Crammer, Fernando C. Pereira, David S. Roos

    BMC Bioinformatics, vol. 9 (2008), pp. 433

  •   

    Speech Recognition with Weighted Finite-State Transducers

    Mehryar Mohri, Fernando C. N. Pereira, Michael Riley

    Handbook on Speech Processing and Speech Communication, Part E: Speech recognition, Springer-Verlag, Heidelberg, Germany (2008)

  •   

    Analysis of Representations for Domain Adaptation

    Shai Ben-David, John Blitzer, Koby Crammer, Fernando Pereira

    Advances in Neural Information Processing Systems 20, MIT Press, Cambridge, MA (2007)

  •   

    Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification

    John Blitzer, Mark Dredze, Fernando Pereira

    Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, Association for Computational Linguistics, Prague, Czech Republic (2007), pp. 440-447

  •   

    Global Discriminative Learning for Higher-Accuracy Computational Gene Prediction

    Axel Bernal, Koby Crammer, Artemis Hatzigeorgiou, Fernando Pereira

    PLoS Computational Biology, vol. 3 (2007)

  •   

    Learning to join everything

    Fernando Pereira

    CIKM (2007), pp. 9-10

  •   

    Penn/UMass/CHOP Biocreative II systems

    Kuzman Ganchev, Koby Crammer, Fernando Pereira, Gideon Mann, Kedar Bellare, Andrew McCallum, Steven Carroll, Yang Jin, Peter White

    Proceedings of the Second BioCreative Challenge Evaluation Workshop (2007), pp. 119-124

  •   

    Semi-Automated Named Entity Annotation

    Kuzman Ganchev, Fernando Pereira, Mark Mandel, Steven Carroll, Peter White

    Proceedings of the Linguistic Annotation Workshop, Association for Computational Linguistics (2007), pp. 53-56

  •   

    The Need for Open Source Software in Machine Learning

    Soren Sonnenburg, Mikio L. Braun, Cheng Soon Ong, Samy Bengio, Leon Bottou, Geoff Holmes, Yann LeCun, Klaus-Robert Mueller, Fernando Pereira, Carl-Edward Rasmussen, Gunnar Raetsch, Bernhard Schoelkopf, Alexander Smola, Pascal Vincent, Jason Weston, Robert C. Williamson

    Journal of Machine Learning Research, vol. 8 (2007), pp. 2443-2466

  •   

    Transductive structured classification through constrained min-cuts

    Kuzman Ganchev, Fernando Pereira

    Proceedings of the Second Workshop on TextGraphs: Graph-Based Algorithms for Natural Language Processing, Association for Computational Linguistics (2007), pp. 37-44

  •   

    Speech Recognition with Weighted Finite-State Transducers

    Mehryar Mohri, Fernando C. N. Pereira, Michael Riley

    Handbook on Speech Processing and Speech Communication, Part E: Speech recognition, Springer-Verlag, Heidelberg, Germany (2007)

  •   

    "Sorry I forgot the attachment": Email Attachment Prediction

    Mark Dredze, John Blitzer, Fernando Pereira

    3rd Conference on Email and Anti-Spam, Stanford, CA (2006)

  •   

    A Context Pattern Induction Method for Named Entity Extraction

    Partha Pratim Talukdar, Thorsten Brants, Mark Liberman, Fernando Pereira

    Proceedings of CoNLL-X (2006), pp. 141-148

  •   

    An automated procedure to identify biomedical articles that contain cancer-associated gene variants

    Ryan McDonald, R Scott Winters, Claire K Ankuda, Joan A Murphy, Amy E Rogers, Fernando Pereira, Marc S Greenblatt, Peter S White

    Human Mutation, vol. 27 (2006), pp. 957-64

  •   

    Automated recognition of malignancy mentions in biomedical literature

    Yang Jin, Ryan T. McDonald, Kevin Lerman, Mark A. Mandel, Steven Carroll, Mark Y. Liberman, Fernando C. Pereira, Raymond S. Winters, Peter S. White

    BMC Bioinformatics, vol. 7 (2006), pp. 492

  •   

    Domain Adaptation with Structural Correspondence Learning

    John Blitzer, Ryan McDonald, Fernando Pereira

    EMNLP 2006: 2006 Conference on Empirical Methods in Natural Language Processing, pp. 120-128

  •  

    Embedding Heterogeneous Data Using Statistical Models

    Amir Globerson, Gal Chechik, Fernando Pereira, Naftali Tishby

    AAAI (2006)

  •   

    Multilingual Dependency Parsing with a Two-Stage Discriminative Parser

    Ryan McDonald, Kevin Lerman, Fernando Pereira

    Tenth Conference on Computational Natural Language Learning (CoNLL-X) (2006)

  •   

    Online Learning of Approximate Dependency Parsing Algorithms

    Ryan McDonald, Fernando Pereira

    11th Conference of the European Chapter of the Association for Computational Linguistics: EACL 2006, pp. 81-88

  •  

    Online Learning of Approximate Dependency Parsing Algorithms

    Ryan McDonald, Fernando Pereira

    Proceedings of EACL (2006)

  •   

    A Conditional Random Field for Discriminatively-trained Finite-state String Edit Distance

    Andrew McCallum, Kedar Bellare, Fernando Pereira

    Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence (UAI 2005)

  •  

    Automatically annotating documents with normalized gene lists

    Jeremiah Crim, Ryan McDonald, Fernando Pereira

    BMC Bioinformatics (2005)

  •   

    Distributed Latent Variable Models of Lexical Co-occurrences

    John Blitzer, Amir Globerson, Fernando Pereira

    Tenth International Workshop on Artificial Intelligence and Statistics (2005)

  •  

    Flexible Text Segmentation with Structured Multilabel Classification

    Ryan McDonald, Koby Crammer, Fernando Pereira

    Proceedings of HLT-EMNLP (2005)

  •  

    Identifying gene and protein mentions in text using conditional random fields

    Ryan McDonald, Fernando Pereira

    BMC Bioinformatics (2005)

  •   

    Non-Projective Dependency Parsing using Spanning Tree Algorithms

    Ryan T. McDonald, Fernando Pereira, Kiril Ribarov, Jan Hajic

    HLT/EMNLP (2005)

  •  

    Non-Projective Dependency Parsing using Spanning Tree Algorithms

    Ryan McDonald, Fernando Pereira, Kiril Ribarov, Jan Hajic

    Proceedings of HLT-EMNLP (2005)

  •   

    Online Large-Margin Training of Dependency Parsers

    Ryan McDonald, Koby Crammer, Fernando Pereira

    43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005)

  •  

    Online Large-Margin Training of Dependency Parsers

    Ryan McDonald, Koby Crammer, Fernando Pereira

    Proceedings of ACL (2005)

  •   

    Reply Expectation Prediction for Email Management

    Mark Dredze, John Blitzer, Fernando Pereira

    2nd Conference on Email and Anti-Spam, Stanford, CA (2005)

  •   

    Reply Expectation Prediction for Email Management

    Mark Dredze, John Blitzer, Fernando Pereira

    CEAS (2005)

  •  

    Simple Algorithms for Complex Relation Extraction with Applications to Biomedical IE

    Ryan McDonald, Seth Kulick, Fernando Pereira, Scott Winters, Yang Jin, Pete White

    Proceedings of ACL (2005)

  •   

    Simple Algorithms for Complex Relation Extraction with Applications to Biomedical IE

    Ryan McDonald, Fernando Pereira, Seth Kulick, Scott Winters, Yang Jin, Pete White

    43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005)

  •   

    Weighted Automata in Text and Speech Processing

    Mehryar Mohri, Fernando Pereira, Michael Riley

    arXiv, vol. abs/cs/0503077 (2005)

  •  

    ATDD: An Algorithmic Tool for Domain Discovery in Protein Sequences

    Stanislav Angelov, Sanjeev Khanna, Li Li, Fernando Pereira

    Algorithms in Bioinformatics, 4th International Workshop (WABI 2004), Springer, pp. 206-217

  •  

    An entity tagger for recognizing acquired genomic variations in cancer literature

    Ryan McDonald, Scott Winters, Mark Mandel, Yang Jin, Pete White, Fernando Pereira

    Bioinformatics (2004)

  •   

    Case-Factor Diagrams for Structured Probabilistic Modeling

    David McAllester, Michael Collins, Fernando Pereira

    Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence (2004)

  •   

    Case-Factor Diagrams for Structured Probabilistic Modeling

    David A. McAllester, Michael Collins, Fernando Pereira

    UAI (2004), pp. 382-391

  •   

    Euclidean Embedding of Co-Occurrence Data

    Amir Globerson, Gal Chechik, Fernando C. Pereira, Naftali Tishby

    Advances in Neural Information Processing Systems (NIPS), MIT press, Cambridge, MA (2004), pp. 497-504

  •   

    Hierarchical Distributed Representations for Statistical Language Modeling

    John Blitzer, Kilian Weinberger, Lawrence Saul, Fernando Pereira

    Advances in Neural Information Processing Systems 17, MIT Press, Cambridge, MA (2004)

  •   

    Hierarchical Distributed Representations for Statistical Language Modeling

    John Blitzer, Kilian Q. Weinberger, Lawrence K. Saul, Fernando Pereira

    NIPS (2004)

  •   

    Shallow Parsing with Conditional Random Fields

    Fei Sha, Fernando C. N. Pereira

    HLT-NAACL (2003)

  •  

    Weighted finite-state transducers in speech recognition

    Mehryar Mohri, Fernando Pereira, Michael Riley

    Computer Speech & Language, vol. 16 (2002), pp. 69-88

  •   

    Weighted Finite-State Transducers in Speech Recognition

    Mehryar Mohri, Fernando C. N. Pereira, Michael Riley

    Computer Speech and Language, vol. 16 (2002), pp. 69-88

  •   

    Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

    John Lafferty, Andrew McCallum, Fernando Pereira

    Proceedings of ICML-01 (2001), pp. 282-289

  •   

    Formal Grammar and Information Theory: Together Again?

    Fernando Pereira

    Philosophical Transactions of the Royal Society, vol. 358 (2000), pp. 1239-1253

  •   

    Machine Learning for Efficient Natural-Language Processing

    Fernando C. N. Pereira

    CPM (2000), pp. 11

  •   

    Maximum Entropy Markov Models for Information Extraction and Segmentation

    Andrew McCallum, Dayne Freitag, Fernando Pereira

    Machine Learning: Proceedings of the Seventeenth International Conference (ICML 2000), Stanford, California, pp. 591-598

  •   

    The information bottleneck method

    Naftali Tishby, Fernando C. Pereira, William Bialek

    arXiv, vol. physics/0004057 (2000)

  •   

    The Design Principles of a Weighted Finite-State Transducer Library

    Mehryar Mohri, Fernando C. N. Pereira, Michael Riley

    Theoretical Computer Science, vol. 231 (2000), pp. 17-32

  •   

    Weighted Finite-State Transducers in Speech Recognition

    Mehryar Mohri, Fernando C. N. Pereira, Michael Riley

    Proceedings of the ISCA Tutorial and Research Workshop, Automatic Speech Recognition: Challenges for the new Millenium (ASR2000), Paris, France

  •   

    AT&T at TREC-8

    Amit Singhal, Steven P. Abney, Michiel Bacchiani, Michael Collins, Donald Hindle, Fernando C. N. Pereira

    TREC (1999)

  •  

    An Efficient Extension to Mixture Techniques for Prediction and Decision Trees

    Fernando C. N. Pereira, Yoram Singer

    Machine Learning, vol. 36 (1999), pp. 183-199

  •  

    Declarative Programming for a Messy World

    Fernando C. N. Pereira

    ICLP (1999), pp. 3-5

  •   

    Distributional Similarity Models: Clustering vs.~Nearest Neighbors

    Lillian Lee, Fernando Pereira

    37th Annual Meeting of the Association for Computational Linguistics, Morgan Kaufmann, San Francisco, California (1999), pp. 33-40

  •  

    Document Expansion for Speech Retrieval

    Amit Singhal, Fernando C. N. Pereira

    SIGIR (1999), pp. 34-41

  •   

    Efficient General Lattice Generation and Rescoring

    Andrej Ljolje, Fernando Pereira, Michael Riley

    EUROSPEECH 99 (1999), pp. 1251-1254

  •   

    Finding Information in Audio: A New Paradigm for Audio Browsing and Retrieval

    Julia Hirschberg, Steve Whittaker, Don Hindle, Fernando Pereira, Amit Singhal

    Accessing Information in Spoken Audio: Proceedings of the ESCA ETRW Workshop, Cambridge, England (1999), pp. 117-122

  •   

    Multimedia Standards: Present and Future

    Fernando C. N. Pereira

    ICMCS, Vol. 1 (1999), pp. 145-146

  •  

    Quantifiers, Anaphora, and Intensionality

    Mary Dalrymple, John Lamping, Fernando Pereira, Vijay Saraswat

    Semantics and Syntax in Lexical Functional Grammar, MIT Press, Cambridge, Massachusetts (1999), pp. 39-89

  •   

    Relating Probabilistic Grammars and Automata

    Steven Abney, David McAllester, Fernando Pereira

    37th Annual Meeting of the Association for Computational Linguistics, Morgan Kaufmann, San Francisco, California (1999), pp. 542-549

  •   

    Relating Probabilistic Grammars and Automata

    Steven P. Abney, David A. McAllester, Fernando Pereira

    ACL (1999)

  •  

    SCAN: Designing and Evaluating User Interfaces to Support Retrieval From Speech Archives

    Steve Whittaker, Julia Hirschberg, John Choi, Donald Hindle, Fernando C. N. Pereira, Amit Singhal

    SIGIR (1999), pp. 26-33

  •  

    Similarity-Based Models of Word Cooccurrence Probabilities

    Ido Dagan, Lillian Lee, Fernando C. N. Pereira

    Machine Learning, vol. 34 (1999), pp. 43-69

  •   

    The Information Bottleneck Method

    Naftali Z. Tishby, Fernando Pereira, William Bialek

    Proceedings of the 37th Allerton Conference on Communication, Control and Computing, Urbana, Illinois (1999)

  •  

    AT&T at TREC-7

    Amit Singhal, John Choi, Donald Hindle, David D. Lewis, Fernando C. N. Pereira

    TREC (1998), pp. 186-198

  •   

    Dynamic Compilation of Weighted Context-Free Grammars

    Mehryar Mohri, Fernando Pereira

    Proceedings of COLING-ACL '98, Montreal, Canada (1998), pp. 891-897

  •  

    Modelling Divergent Production: A multi-domain approach

    F. Pereira

    ECAI (1998), pp. 131-132

  •  

    A Rational Design for a Weighted Finite-State Transducer Library

    Mehryar Mohri, Fernando C. N. Pereira, Michael Riley

    Proceedings of the Second International Workshop on Implementing Automata (WIA '97), Springer-Verlag, Berlin-NY (1998), pp. 144-158

  •   

    Dynamic Compilation of Weighted Context-Free Grammars

    Mehryar Mohri, Fernando C. N. Pereira

    $36$th Meeting of the Association for Computational Linguistics (ACL '98), Proceedings of the Conference, Montréal, Québec, Canada (1998), pp. 891-897

  •   

    Full Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition

    Mehryar Mohri, Michael Riley, Don Hindle, Andrej Ljolje, Fernando C. N. Pereira

    Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98), Seattle, Washington (1998)

  •   

    SCAN - Speech Content Based Audio Navigator: A Systems Overview

    John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-Chagnolleau, Christine Nakatani, Fernando Pereira, Amit Singhal, Steve Whittaker

    Proceedings of the Fifth International Conference on Spoken Language Processing, Sydney (1998)

  •   

    A Rational Design for a Weighted Finite-State Transducer Library

    Mehryar Mohri, Fernando Pereira, Michael Riley

    WIA'97: Proceedings of the Workshop on Implementing Automata, Springer-Verlag (1997)

  •   

    AT&T at TREC-6: SDR Track

    Amit Singhal, John Choi, Donald Hindle, Fernando C. N. Pereira

    TREC (1997), pp. 227-232

  •   

    Aggregate and Mixed-Order Markov Models for Statistical Language Processing

    Lawrence Saul, Fernando Pereira

    Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Somerset, NJ. Distributed by Morgan Kaufmann, San Francisco, CA (1997), pp. 81-89

  •   

    Finite-State Approximation of Phrase-Structure Grammars

    Fernando Pereira, Rebecca N. Wright

    Finite-State Language Processing, MIT Press, Cambridge, Massachusetts (1997), pp. 149-173

  •  

    Quantifiers, Anaphora, and Intensionality

    Mary Dalrymple, John Lamping, Fernando C. N. Pereira, Vijay A. Saraswat

    Journal of Logic, Language, and Information, vol. 6, no. 3 (1997), pp. 219-273

  •   

    Similarity-Based Methods For Word Sense Disambiguation

    Ido Dagan, Lillian Lee, Fernando C. N. Pereira

    arXiv (1997)

  •   

    Similarity-Based Methods For Word Sense Disambiguation

    Ido Dagan, Lillian Lee, Fernando Pereira

    35th Annual Meeting of the Association for Computational Linguistics, Morgan Kaufmann, San Francisco, California (1997), pp. 56-63

  •   

    Speech Recognition by Composition of Weighted Finite Automata

    Fernando Pereira, Michael Riley

    Finite-State Language Processing, MIT Press, Cambridge, Massachusetts (1997), pp. 431-453

  •  

    Transducer Composition for Context-Dependent Network Expansion

    Michael Riley, Fernando Pereira, Mehryar Mohri

    EuroSpeech'97, European Speech Communication Association, Genova, Italy (1997), pp. 1427-1430

  •  

    A Rational Design for a Weighted Finite-State Transducer Library

    Mehryar Mohri, Fernando C. N. Pereira, Michael Riley

    Proceedings of the Workshop on Implementing Automata (WIA '97), London, Ontario, Canada, University of Western Ontario, London, Ontario, Canada (1997)

  •  

    Transducer Composition for Context-Dependent Network Expansion

    Michael Riley, Fernando C. N. Pereira, Mehryar Mohri

    Proceedings of the 5th European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece (1997)

  •   

    A Deductive Account of Quantification in LFG

    Mary Dalrymple, John Lamping, Fernando Pereira, Vijay Saraswat

    Quantifiers, Deduction, and Context, {CSLI} Publications, Stanford, California (1996), pp. 33-57

  •   

    Intensional Verbs Without Type-Raising or Lexical Ambiguity

    Mary Dalrymple, John Lamping, Fernando Pereira, Vijay Saraswat

    Logic, Language and Computation (Volume 1), {CSLI} Publications, Stanford, California (1996), pp. 167-182

  •   

    Interactions of Scope and Ellipsis

    Stuart M. Shieber, Fernando Pereira, Mary Dalrymple

    Linguistics and Philosophy, vol. 19 (1996), pp. 527-552

  •  

    Language, Computation and Artificial Intelligence

    Fernando C. N. Pereira

    ACM Computing Surveys, vol. 28 (1996), pp. 9

  •   

    Speech Recognition by Composition of Weighted Finite Automata

    Fernando C. N. Pereira, Michael Riley

    arXiv (1996)

  •  

    Rational Power Series in Text and Speech Processing

    Mehryar Mohri, Fernando C. N. Pereira, Michael Riley

    Graduate course, University of Pennsylvania, Department of Computer Science, Philadelphia, PA (1996)

  •   

    Weighted Automata in Text and Speech Processing

    Mehryar Mohri, Fernando C. N. Pereira, Michael Riley

    Proceedings of the 12th biennial European Conference on Artificial Intelligence (ECAI-96), Workshop on Extended finite state models of language, John Wiley and Sons, Chichester, Budapest, Hungary (1996)

  •   

    Beyond Word N-Grams

    Fernando Pereira, Yoram Singer, Naftali Z. Tishby

    Proceedings of the Third Workshop on Very Large Corpora, Association for Computational Linguistics, Columbus, Ohio (1995), pp. 95-106

  •   

    Design of a Linguistic Postprocessor using Variable Memory Length Markov Models

    Isabelle Guyon, Fernando Pereira

    Proceedings of the Third International Conference on Document Analysis and Recognition, IEEE Computer Society Press, Los Alamitos, California (1995), pp. 454-457

  •   

    Ellipsis and Higher-Order Unification

    Mary Dalrymple, Stuart M. Shieber, Fernando C. N. Pereira

    arXiv (1995)

  •   

    Linear Logic for Meaning Assembly

    Mary Dalrymple, John Lamping, Fernando C. N. Pereira, Vijay A. Saraswat

    arXiv (1995)

  •   

    Principles and Implementation of Deductive Parsing

    Stuart M. Shieber, Yves Schabes, Fernando Pereira

    Journal of Logic Programming, vol. 24 (1995), pp. 3-36

  •  

    The AT&T 60,000 Word Speech-to-Text System

    Michael Riley, Andrej Ljolje, Don Hindle, Fernando Pereira

    Eurospeech'95: ESCA 4th European Conference on Speech Communication and Technology, Madrid, Spain (1995), pp. 207-210

  •   

    Frequencies vs Biases: Machine Learning Problems in Natural Language Processing (Extended Abstract)

    Fernando C. N. Pereira

    COLT (1994), pp. 12

  •  

    Frequencies vs. Biases: Machine Learning Problems in Natural Language Processing - Abstract

    Fernando C. N. Pereira

    ICML (1994), pp. 380

  •   

    Similarity-Based Estimation of Word Cooccurrence Probabilities

    Ido Dagan, Fernando Pereira, Lillian Lee

    32nd Annual Meeting of the Association for Computational Linguistics, Morgan Kaufmann, San Francisco, California (1994), pp. 272-278

  •  

    Weighted Rational Transductions and their Application to Human Language Processing

    Fernando Pereira, Michael Riley, Richard W. Sproat

    Human Language Technology Workshop, Morgan Kaufmann, San Francisco, California (1994), pp. 262-267

  •   

    Distributional Clustering of English Words

    Fernando Pereira, Naftali Z. Tishby, Lillian Lee

    30th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Columbus, Ohio (1993), pp. 183-190

  •  

    Introduction to Special Issue on Natural Language Processing

    Fernando Pereira, Barbara J. Grosz

    Artificial Intelligence, vol. 63 (1993), pp. 1-15

  •  

    A spoken language translator for restricted-domain context-free languages

    David B. Roe, Pedro J. Moreno, Richard W. Sproat, Fernando Pereira, Michael Riley, Alejandro Macarr{\'o}n

    Speech Communication, vol. 11 (1992), pp. 311-319

  •  

    Efficient Grammar Processing for a Spoken Language Translation System

    David B. Roe, Pedro J. Moreno, Richard W. Sproat, Fernando Pereira, Michael Riley, Alejandro Macarr{\'o}n

    Proceedings of ICASSP, IEEE, San Francisco, California (1992), pp. 213-216

  •  

    Empirical Properties of Finite State Approximations for Phrase Structure Grammars

    Fernando Pereira, David B. Roe

    Proceedings of the International Conference on Spoken Language Processing, Banff, Alberta (1992), pp. 261-264

  •  

    Inside-Outside Reestimation from Partially Bracketed Corpora

    Fernando Pereira, Yves Schabes

    30th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Newark, Delaware (1992), pp. 128-135

  •  

    Quantifier Scoping

    Douglas B. Moran, Fernando Pereira

    The Core Language Engine, MIT Press, Cambridge, Massachusetts (1992), pp. 149-172

  •  

    Deductive Interpretation

    Fernando Pereira

    Natural Language and Speech, Springer-Verlag (1991), pp. 116-133

  •   

    Ellipsis and Higher-Order Unification

    Mary Dalrymple, Stuart M. Shieber, Fernando Pereira

    Linguistics and Philosophy, vol. 14 (1991), pp. 399-452

  •  

    Finite-State Approximation of Phrase-Structure Grammars

    Fernando Pereira, Rebecca N. Wright

    29th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Berkeley, California (1991), pp. 246-255

  •  

    Incremental Interpretation

    Fernando Pereira, Martha E. Pollack

    Artificial Intelligence, vol. 50 (1991), pp. 37-82

  •  

    Semantic Interpretation as Higher-Order Deduction

    Fernando Pereira

    Logics in AI: European Workshop JELIA'90, Springer-Verlag, Berlin, Germany, Amsterdam, Holland (1991), pp. 78-96

  •  

    Toward a Spoken Language Translator for Restricted-Domain Context-Free Languages

    David B. Roe, Fernando Pereira, Richard W. Sproat, Michael Riley, Pedro J. Moreno, Alejandro Macarr{\'o}n

    EUROSPEECH 91 - 2nd European Conference on Speech Communication and Technology, Genova, Italy (1991), pp. 1063-1066

  •  

    Categorial Semantics and Scoping

    Fernando Pereira

    Computational Linguistics, vol. 16 (1990), pp. 1-10

  •  

    Finite-State Approximations of Grammars

    Fernando Pereira

    Proceedings of the Second Speech and Natural Language Workshop (1990), pp. 20-25

  •  

    Prolog and Natural-Language Analysis: into the Third Decade

    Fernando Pereira

    Logic Programming: Proceedings of the 1990 North American Conference, MIT Press, Cambridge, Massachusetts, Austin, Texas, pp. 813-832

  •  

    Semantic-Head-Driven Generation

    Stuart M. Shieber, Gertjan van Noord, Fernando Pereira, Robert C. Moore

    Computational Linguistics, vol. 16 (1990), pp. 30-42

  •  

    A Calculus for Semantic Composition and Scoping

    Fernando Pereira

    27th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, University of British Columbia, Vancouver, Canada (1989), pp. 152-160

  •  

    A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms

    Stuart M. Shieber, Gertjan van Noord, Robert C. Moore, Fernando Pereira

    27th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, University of British Columbia, Vancouver, Canada (1989), pp. 7-17

  •  

    A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms

    Stuart M. Shieber, Gertjan van Noord, Robert C. Moore, Fernando C. N. Pereira

    ACL (1989), pp. 7-17

  •  

    Integrating Speech and Natural Language Processing

    Robert C. Moore, Fernando Pereira, Hy Murveit

    First Speech and Natural Language Workshop (1989), pp. 243-247

  •  

    Synergistic Use of Direct Manipulation and Natural Language

    Phil R. Cohen, Mary Dalrymple, Douglas B. Moran, Fernando Pereira, J. W. Sullivan, R. A. Gargan, Jr., J. L. Schlossberg, S. W. Tyler

    Proceedings of CHI'89, Austin, Texas (1989)

  •  

    An Integrated Framework for Semantic and Pragmatic Interpretation

    Martha E. Pollack, Fernando Pereira

    26th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Buffalo, New York (1988), pp. 75-86

  •  

    Grammars and Logics of Partial Information

    Fernando Pereira

    Logic Programming: Proceedings of the Fourth International Conference, MIT Press, Cambridge Massachusetts, Melbourne, Australia (1987), pp. 989-1013

  •  

    Prolog and Natural-Language Analysis

    Fernando Pereira, Stuart M. Shieber

    Center for the Study of Language and Information, Stanford, California (1987)

  •  

    TEAM: An Experiment in the Design of Transportable Natural Language Interfaces

    Barbara J. Grosz, Douglas E. Appelt, Paul A. Martin, Fernando Pereira

    Artificial Intelligence, vol. 32 (1987), pp. 173-243

  •  

    A Sheaf-Theoretic Model of Concurrency

    Luis F. Monteiro, Fernando Pereira

    Symposium on Logic and Computer Science, IEEE Computer Society Press, Cambridge, Massachusetts (1986), pp. 66-76

  •  

    Can Drawing Be Liberated from the von Neumann Style

    Fernando Pereira

    Logic Programming and Its Applications, Ablex, Norwood, New Jersey (1986), pp. 175-187

  •  

    TEAM: An Experimental Transportable Natural-Language Interface

    Paul A. Martin, Douglas E. Appelt, Barbara J. Grosz, Fernando C. N. Pereira

    FJCC (1986), pp. 260-267

  •  

    A New Characterization of Attachment Preferences

    Fernando Pereira

    Natural Language Parsing--Psychological, Computational and Theoretical perspectives, Cambridge University Press, Cambridge, England (1985), pp. 307-319

  •  

    A Structure-Sharing Representation for Unification-Based Grammar Formalisms

    Fernando Pereira

    23rd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Chicago, Illinois (1985), pp. 137-144

  •  

    An Overview of Automated Reasoning and Related Fields

    L. Wos, Fernando Pereira, Robert Hong, Robert S. Boyer, J Strother Moore, W. W. Bledsoe, L. J. Henschen, Bruce G. Buchanan, Graham Wrightson, Cordell Green

    Journal of Automated Reasoning, vol. 1 (1985), pp. 5-48

  •  

    The Semantics of Grammar Formalisms Seen as Computer Languages

    Fernando Pereira, Stuart M. Shieber

    Proceedings of COLING 84, Association for Computational Linguistics, Stanford, California (1984), pp. 123-129

  •  

    A Fact Dependency System for the Logic Programmer

    Peter S. G. Swinson, Fernando Pereira, Aart Bijl

    Computer-Aided Design, vol. 14 (1983), pp. 235-243

  •  

    Can Drawing Be Liberated From the Von Neumann Style?

    Fernando C. N. Pereira

    Databases for Business and Office Applications (1983), pp. 184-190

  •  

    Parsing as Deduction

    Fernando Pereira, David H. D. Warren

    21st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Cambridge, Massachusetts (1983), pp. 137-144

  •  

    Transportability and Generality in a Natural-Language Interface System

    Paul A. Martin, Douglas E. Appelt, Fernando Pereira

    Proceedings of the Eight International Joint Conference on Artificial Intelligence (1983), pp. 573-581

  •  

    An Efficient Easily Adaptable System for Interpreting Natural Language Queries

    David H. D. Warren, Fernando Pereira

    Computational Linguistics, vol. 8 (1982), pp. 110-122

  •  

    Extraposition Grammars

    Fernando Pereira

    Computational Linguistics, vol. 7 (1981), pp. 243-256

  •  

    Definite Clause Grammars for Language Analysis--a Survey of the Formalism and a Comparison with Augmented Transition Networks

    Fernando Pereira, David H. D. Warren

    Artificial Intelligence, vol. 13 (1980), pp. 231-278

  •  

    Prolog - The Language and its Implementation Compared with Lisp

    David H. D. Warren, Luis M. Pereira, Fernando Pereira

    Proceedings of the Symposium on Artificial Intelligence and Programming Languages, Rochester, New York (1977), pp. 109-115