Cyril Allauzen

Cyril Allauzen is a research scientist at Google in New York. His main research interests are in finite-state methods and their applications to text, speech and natural language processing and machine learning. Before joining Google, he worked as a researcher at AT&T Labs Research and at NYU's Courant Institute of Mathematical Sciences. Cyril received his Ph.D. in computer science from the Université de Marne-la-Vallée in 2001.

Cyril is an author of the OpenFst Library, the OpenKernel Library and the GRM Library.

Google Publications

Previous Publications

  •  

    OpenFst: a General and Efficient Weighted Finite-State Transducer Library

    Cyril Allauzen, Michael Riley, Johan Schalkwyk, Wojciech Skut, Mehryar Mohri

    Proceedings of the 12th International Conference on Implementation and Application of Automata (CIAA 2007), Springer-Verlag, Heidelberg, Germany, Prague, Czech Republic

  •  

    A Unified Construction of the Glushkov, Follow, and Antimirov Automata

    Cyril Allauzen, Mehryar Mohri

    MFCS (2006), pp. 110-121

  •   

    A Unified Construction of the Glushkov, Follow, and Antimirov Automata

    Cyril Allauzen, Mehryar Mohri

    Proceedings of the 31st International Symposium on Mathematical Foundations of Computer Science (MFCS 2006), Springer-Verlag, Heidelberg, Germany, Star\'a Lesn\'a, Slovakia, pp. 110-121

  •   

    The design principles and algorithms of a weighted grammar library

    Cyril Allauzen, Mehryar Mohri, Brian Roark

    Int. J. Found. Comput. Sci., vol. 16 (2005), pp. 403-421

  •   

    A General Weighted Grammar Library

    Cyril Allauzen, Mehryar Mohri, Brian Roark

    Ninth International Conference on Automata (CIAA 2004), Kingston, Canada, July 22-24, 2004, Springer-Verlag, Berlin-NY (2005)

  •   

    The Design Principles and Algorithms of a Weighted Grammar Library

    Cyril Allauzen, Mehryar Mohri, Brian Roark

    International Journal of Foundations of Computer Science, vol. 16 (2005)

  •  

    A General Weighted Grammar Library

    Cyril Allauzen, Mehryar Mohri, Brian Roark

    CIAA (2004), pp. 23-34

  •   

    A Generalized Construction of Integrated Speech Recognition Transducers

    Cyril Allauzen, Mehryar Mohri, Brian Roark, Michael Riley

    Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), Montreal, Canada

  •  

    An optimal pre-determinization algorithm for weighted transducers

    Cyril Allauzen, Mehryar Mohri

    Theor. Comput. Sci., vol. 328 (2004), pp. 3-18

  •   

    Statistical Modeling for Unit Selection in Speech Synthesis

    Cyril Allauzen, Mehryar Mohri, Michael Riley

    42nd Meeting of the Association for Computational Linguistics (ACL 2004), Proceedings of the Conference, Barcelona, Spain

  •   

    A General Weighted Grammar Library

    Cyril Allauzen, Mehryar Mohri, Brian Roark

    Proceedings of the Ninth International Conference on Automata (CIAA 2004), Kingston, Ontario, Canada

  •   

    An Optimal Pre-Determinization Algorithm for Weighted Transducers

    Cyril Allauzen, Mehryar Mohri

    Theoretical Computer Science, vol. 328 (2004)

  •   

    General Indexation of Weighted Automata - Application to Spoken Utterance Retrieval

    Cyril Allauzen, Mehryar Mohri, Murat Saraclar

    Proceedings of the annual meeting of the Human Language Technology conference and North American Chapter of the Association for Computational Linguistics (HLT/NAACL 2004), Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, Boston, Massachusetts

  •   

    Statistical Modeling for Unit Selection in Speech Synthesis

    Cyril Allauzen, Mehryar Mohri, Michael Riley

    $42$nd Meeting of the Association for Computational Linguistics (ACL 2004), Proceedings of the Conference, Barcelona, Spain

  •   

    An Efficient Pre-determinization Algorithm

    Cyril Allauzen, Mehryar Mohri

    CIAA (2003)

  •  

    Efficient Algorithms for Testing the Twins Property

    Cyril Allauzen, Mehryar Mohri

    Journal of Automata, Languages and Combinatorics, vol. 8 (2003)

  •   

    Finitely Subsequential Transducers

    Cyril Allauzen, Mehryar Mohri

    Int. J. Found. Comput. Sci., vol. 14 (2003)

  •   

    Generalized Algorithms for Constructing Statistical Language Models

    Cyril Allauzen, Mehryar Mohri, Brian Roark

    ACL (2003)

  •   

    $p$-Subsequentiable Transducers

    Cyril Allauzen, Mehryar Mohri

    Seventh International Conference on Automata (CIAA 2002), Tours, France, Springer, Berlin-NY (2003), pp. 24-34

  •   

    An Efficient Pre-Determinization Algorithm

    Cyril Allauzen, Mehryar Mohri

    Eighth International Conference on Automata (CIAA 2003), Santa Barbara, CA, Springer, Berlin-NY, pp. 83-95

  •   

    Finitely Subsequential Transducers

    Cyril Allauzen, Mehryar Mohri

    International Journal of Foundations of Computer Science, vol. 14 (2003), pp. 983-994

  •   

    Generalized Algorithms for Constructing Statistical Language Models

    Cyril Allauzen, Mehryar Mohri, Brian Roark

    $41$st Meeting of the Association for Computational Linguistics (ACL 2003), Proceedings of the Conference, Sapporo, Japan

  •   

    Generalized Optimization Algorithm for Speech Recognition Transducers

    Cyril Allauzen, Mehryar Mohri

    Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong

  •   

    p-Subsequentiable Transducers

    Cyril Allauzen, Mehryar Mohri

    CIAA (2002), pp. 24-34

  •   

    $p$-Subsequentiable Transducers

    Cyril Allauzen, Mehryar Mohri

    Proceedings of the Seventh International Conference on Automata (CIAA 2002), Tours, France

  •   

    On the Determinizability of Weighted Automata and Transducers

    Cyril Allauzen, Mehryar Mohri

    Proceedings of the workshop Weighted Automata: Theory and Applications (WATA), Dresden, Germany (2002)