Slav Petrov

I work on problems at the intersection of natural language processing and machine learning. I am particularly interested in multilingual syntactic analysis and its applications to machine translation and information extraction. Here is a video of a recent talk about my work, which has been recognized with best paper awards at NAACL 2012 and ACL 2011.

I completed my PhD at UC Berkeley in 2009, where I worked with Dan Klein. My Master's degree is from the Free University of Berlin, where our team won the RoboCup world championship in robotic soccer. I also spent a year as an exchange student at Duke University.

I grew up in Berlin, Germany, but I am originally from Sofia, Bulgaria. I therefore consider myself a Berliner from Bulgaria. Whenever Bulgaria plays Germany in soccer, I support Bulgaria.

See also my personal webpage for more information (including presentation slides).

Google Publications

  •   

    Enhanced Search with Wildcards and Morphological Inflections in the Google Books Ngram Viewer

    Jason Mann, David Zhang, Lu Yang, Dipanjan Das, Slav Petrov

    Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (Demonstrations), Association for Computational Linguistics (2014)

  •    

    Source-Side Classifier Preordering for Machine Translation

    Uri Lerner, Slav Petrov

    Proc. of EMNLP '13 (2013) (to appear)

  •   

    Token and Type Constraints for Cross-Lingual Part-of-Speech Tagging

    Oscar Tackstrom, Dipanjan Das, Slav Petrov, Ryan McDonald, Joakim Nivre

    Transactions of the Association for Computational Linguistics (TACL '13) (2013)

  •   

    Universal Dependency Annotation for Multilingual Parsing

    Ryan McDonald, Joakim Nivre, Yoav Goldberg, Yvonne Quirmbach-Brundage, Dipanjan Das, Kuzman Ganchev, Keith Hall, Slav Petrov, Hao Zhang, Oscar Tackstrom, Claudia Bedini, Nuria Bertomeu Castello, Jungmee Lee

    Association for Computational Linguistics (2013)

  •   

    A Universal Part-of-Speech Tagset

    Slav Petrov, Dipanjan Das, Ryan McDonald

    Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC '12) (2012)

  •    

    Google's Hybrid Approach to Research

    Alfred Spector, Peter Norvig, Slav Petrov

    Communications of the ACM, vol. 55 Issue 7 (2012), pp. 34-37

  •   

    Overview of the 2012 Shared Task on Parsing the Web

    Slav Petrov, Ryan McDonald

    Notes of the First Workshop on Syntactic Analysis of Non-Canonical Language (SANCL) (2012)

  •   

    Syntactic Annotations for the Google Books Ngram Corpus

    Yuri Lin, Jean-Baptiste Michel, Erez Lieberman Aiden, Jon Orwant, William Brockman, Slav Petrov

    Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Volume 2: Demo Papers (ACL '12) (2012)

  •   

    Using Search-Logs to Improve Query Tagging

    Kuzman Ganchev, Keith B. Hall, Ryan McDonald, Slav Petrov

    Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Volume 2: Short Papers (ACL '12) (2012)

  •    

    Vine Pruning for Efficient Multi-Pass Dependency Parsing

    Alexander Rush, Slav Petrov

    The 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL '12), Best Paper Award

  •   

    Efficient Parallel CKY Parsing on GPUs

    Youngmin Yi, Chao-Yue Lai, Slav Petrov, Kurt Keutzer

    Proceedings of the International Conference on Parsing Technologies (IWPT '11) (2011)

  •   

    Multi-Source Transfer of Delexicalized Dependency Parsers

    Ryan McDonald, Slav Petrov, Keith B. Hall

    Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP '11)

  •    

    Training a Parser for Machine Translation Reordering

    Jason Katz-Brown, Slav Petrov, Ryan McDonald, Franz Och, David Talbot, Hiroshi Ichikawa, Masakazu Seno

    Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP '11)

  •   

    Training Structured Prediction Models with Extrinsic Loss Functions

    Keith Hall, Ryan McDonald, Slav Petrov

    Domain Adaptation Workshop at NIPS 2011

  •    

    Unsupervised Part-of-Speech Tagging with Bilingual Graph-Based Projections

    Dipanjan Das, Slav Petrov

    Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL '11) (2011), Best Paper Award

  •   

    Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models

    Amarnag Subramanya, Slav Petrov, Fernando Pereira

    Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)

  •   

    Learning Better Monolingual Models with Unannotated Bilingual Text

    David Burkett, Slav Petrov, John Blitzer, Dan Klein

    Fourteenth Conference on Computational Natural Language Learning (CoNLL '10) (2010)

  •   

    Products of Random Latent Variable Grammars

    Slav Petrov

    Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL/HLT '10) (2010)

  •   

    Self-training with Products of Latent Variable Grammars

    Zhongqiang Huang, Mary Harper, Slav Petrov

    Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)

  •   

    Uptraining for Accurate Deterministic Question Parsing

    Slav Petrov, Pi-Chuan Chang, Michael Ringgaard, Hiyan Alshawi

    Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)

  •   

    Generative and Discriminative Latent Variable Grammars

    Slav Petrov

    The Generative and Discriminative Learning Interface Workshop at NIPS 2009

  •   

    Randomized Pruning: Efficiently Calculating Expectations in Large Dynamic Programs

    Alexandre Bouchard-Côté, Slav Petrov, Dan Klein

    Advances in Neural Information Processing Systems 22 (NIPS '09) (2009)

Previous Publications

  •   

    Coarse-to-Fine Natural Language Processing

    Slav Petrov

    Ph.D. Thesis, University of California at Berkeley (2009)

  •   

    Coarse-to-Fine Syntactic Machine Translation using Language Projections

    Slav Petrov, Aria Haghighi, Dan Klein

    Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics, Honolulu, Hawaii, pp. 108-116

  •   

    Discriminative Log-Linear Grammars with Latent Variables

    Slav Petrov, Dan Klein

    Advances in Neural Information Processing Systems 20 (NIPS), MIT Press, Cambridge, MA (2008), pp. 1153-1160

  •   

    Efficient Sentence Segmentation Using Syntactic Features

    Benoit Favre, Dilek Hakkani-Tür, Slav Petrov, Dan Klein

    Spoken Language Technologies (SLT), Goa, India (2008)

  •   

    Parsing German with Latent Variable Grammars

    Slav Petrov, Dan Klein

    Proceedings of the Workshop on Parsing German at ACL '08, Association for Computational Linguistics, Columbus, Ohio (2008), pp. 33-39

  •   

    Sparse Multi-Scale Grammars for Discriminative Latent Variable Parsing

    Slav Petrov, Dan Klein

    Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics, Honolulu, Hawaii, pp. 867-876

  •   

    Improved Inference for Unlexicalized Parsing

    Slav Petrov, Dan Klein

    Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, Association for Computational Linguistics, Rochester, New York, pp. 404-411

  •   

    Learning Structured Models for Phone Recognition

    Slav Petrov, Adam Pauls, Dan Klein

    Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 897-905

  •   

    Learning and Inference for Hierarchically Split PCFGs

    Slav Petrov, Dan Klein

    AAAI 2007 (Nectar Track)

  •   

    The Infinite PCFG Using Hierarchical Dirichlet Processes

    Percy Liang, Slav Petrov, Michael Jordan, Dan Klein

    Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 688-697

  •   

    Detecting Categories in News Video using Acoustic, Speech and Image Features

    Slav Petrov, Arlo Faria, Pascal Michaillat, Alexander Berg, Andreas Stolcke, Dan Klein, Jitendra Malik

    Proceedings of (VIDEO) TREC (TrecVid 2006)

  •   

    Learning Accurate, Compact, and Interpretable Tree Annotation

    Slav Petrov, Leon Barrett, Romain Thibaux, Dan Klein

    Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (ACL/COLING), Association for Computational Linguistics, Sydney, Australia (2006), pp. 433-440

  •   

    Non-Local Modeling with a Mixture of PCFGs

    Slav Petrov, Leon Barrett, Dan Klein

    Proceedings of the Tenth Conference on Computational Natural Language Learning (CoNLL-X), Association for Computational Linguistics, New York City (2006), pp. 14-20

  •   

    3D Tracking = Classification + Interpolation

    Carlo Tomasi, Slav Petrov, Arvind Sastry

    Proceedings of the Ninth IEEE International Conference on Computer Vision (ICCV) (2003)