Cyril Allauzen
Cyril is an author of the OpenFst Library, the OpenKernel Library and the GRM Library.
Google Publications
-
Language Model Verbalization for Automatic Speech Recognition
Hasim Sak, Françoise Beaufays, Kaisuke Nakajima, Cyril Allauzen
Proc ICASSP, IEEE (2013) (to appear)
-
A Pushdown Transducer Extension for the OpenFst Library
CIAA, Springer (2012), pp. 66-77
-
Language Modeling for Automatic Speech Recognition Meets the Web: Google Search by Voice
Ciprian Chelba, Johan Schalkwyk, Boulos Harb, Carolina Parada, Cyril Allauzen, Leif Johnson, Michael Riley, Peng Xu, Preethi Jyothi, Thorsten Brants, Vida Ha, Will Neveitt
University of Toronto (2012)
-
The OpenGrm Open-Source Finite-State Grammar Software Libraries
Brian Roark, Richard Sproat, Cyril Allauzen, Michael Riley, Jeffrey Sorensen, Terry Tai
ACL (System Demonstrations) (2012), pp. 61-66
-
Voice Query Refinement
Cyril Allauzen, Edward Benson, Ciprian Chelba, Michael Riley, Johan Schalkwyk
Interspeech (2012)
-
A Dual Coordinate Descent Algorithm for SVMs Combined with Rational Kernels
Cyril Allauzen, Corinna Cortes, Mehryar Mohri
International Journal of Foundations of Computer Science, vol. 22 (2011), pp. 1761-1779
-
A Filter-based Algorithm for Efficient Composition of Finite-State Transducers
Cyril Allauzen, Michael Riley, Johan Schalkwyk
International Journal of Foundations of Computer Science, vol. 22 (2011), pp. 1781-1795
-
Bayesian Language Model Interpolation for Mobile Speech Input
Interspeech 2011, pp. 1429-1432
-
General Algorithms for Testing the Ambiguity of Finite Automata and the Double-Tape Ambiguity of Finite-State Transducers
Cyril Allauzen, Mehryar Mohri, Ashish Rastogi
International Journal of Foundations of Computer Science, vol. 22 (2011), pp. 883-904
-
Hierarchical Phrase-Based Translation Representations
Gonzalo Iglesias, Cyril Allauzen, William Byrne, Adrià de Gispert, Michael Riley
Proceedings of EMNLP 2011
-
Language Modeling for Automatic Speech Recognition Meets the Web: Google Search by Voice
Ciprian Chelba, Johan Schalkwyk, Boulos Harb, Carolina Parada, Cyril Allauzen, Michael Riley, Peng Xu, Thorsten Brants, Vida Ha, Will Neveitt
OGI/OHSU Seminar Series, Portland, Oregon, USA (2011)
-
Unary Data Structures for Language Models
Jeffrey Sorensen, Cyril Allauzen
Interspeech 2011, International Speech Communication Association, pp. 1425-1428
-
Expected Sequence Similarity Maximization
Cyril Allauzen, Shankar Kumar, Wolfgang Macherey, Mehryar Mohri, Michael Riley
NAACL HLT (2010)
-
Filters for Efficient Composition of Weighted Finite-State Transducers
Cyril Allauzen, Michael Riley, Johan Schalkwyk
CIAA (2010), pp. 28-38
-
Large-Scale Training of SVMs with Automata Kernels
Cyril Allauzen, Corinna Cortes, Mehryar Mohri
CIAA (2010), pp. 17-27
-
On-Demand Language Model Interpolation for Mobile Speech Input
Brandon Ballinger, Cyril Allauzen, Alexander Gruenstein, Johan Schalkwyk
Interspeech (2010), pp. 1812-1815
-
SVM Optimization for Lattice Kernels
Cyril Allauzen, Corinna Cortes, Mehryar Mohri
Mining and Learning with Graphs (2010)
-
A Generalized Composition Algorithm for Weighted Finite-State Transducers
Cyril Allauzen, Michael Riley, Johan Schalkwyk
Interspeech 2009
-
N-Way Composition of Weighted Finite-State Transducers
International Journal of Foundations of Computer Science, vol. 20 (2009), pp. 613-627
-
OpenFst: An Open-Source, Weighted Finite-State Transducer Library and its Applications to Speech and Language
Michael Riley, Cyril Allauzen, Martin Jansche
Proceedings of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 conference, Tutorials
-
3-Way Composition of Weighted Finite-State Transducers
Proceedings of the 13th International Conference on Implementation and Application of Automata (CIAA 2008), Springer-Verlag, Heidelberg, Germany, San Francisco, California, pp. 262-273
-
General Algorithms for Testing the Ambiguity of Finite Automata
Cyril Allauzen, Mehryar Mohri, Ashish Rastogi
Proceedings of Twelfth International Conference Developments in Language Theory (DLT 2008), Springer, Heidelberg, Germany, Kyoto, Japan
-
General Algorithms for Testing the Ambiguity of Finite Automata
Cyril Allauzen, Mehryar Mohri, Ashish Rastogi
DLT 2008, LNCS 5257, Springer, pp. 108-120
-
Linear-Space Computation of the Edit-Distance between a String and a Finite Automaton
London Algorithmics 2008: Theory and Practice, College Publications (to appear)
-
Sequence Kernels for Predicting Protein Essentiality
Cyril Allauzen, Mehryar Mohri, Ameet Talwalkar
Proceedings of ICML 2008
Previous Publications
-
Written-Domain Language Modeling for Automatic Speech Recognition
Hasim Sak, Yun-hsuan Sung, Françoise Beaufays, Cyril Allauzen
Interspeech (2013) (to appear)
-
OpenFst: a General and Efficient Weighted Finite-State Transducer Library
Cyril Allauzen, Michael Riley, Johan Schalkwyk, Wojciech Skut, Mehryar Mohri
Proceedings of the 12th International Conference on Implementation and Application of Automata (CIAA 2007), Springer-Verlag, Heidelberg, Germany, Prague, Czech Republic
-
A Unified Construction of the Glushkov, Follow, and Antimirov Automata
MFCS (2006), pp. 110-121
-
A Unified Construction of the Glushkov, Follow, and Antimirov Automata
Proceedings of the 31st International Symposium on Mathematical Foundations of Computer Science (MFCS 2006), Springer-Verlag, Heidelberg, Germany, Star\'a Lesn\'a, Slovakia, pp. 110-121
-
The design principles and algorithms of a weighted grammar library
Cyril Allauzen, Mehryar Mohri, Brian Roark
Int. J. Found. Comput. Sci., vol. 16 (2005), pp. 403-421
-
A General Weighted Grammar Library
Cyril Allauzen, Mehryar Mohri, Brian Roark
Ninth International Conference on Automata (CIAA 2004), Kingston, Canada, July 22-24, 2004, Springer-Verlag, Berlin-NY (2005)
-
The Design Principles and Algorithms of a Weighted Grammar Library
Cyril Allauzen, Mehryar Mohri, Brian Roark
International Journal of Foundations of Computer Science, vol. 16 (2005)
-
A General Weighted Grammar Library
Cyril Allauzen, Mehryar Mohri, Brian Roark
CIAA (2004), pp. 23-34
-
A Generalized Construction of Integrated Speech Recognition Transducers
Cyril Allauzen, Mehryar Mohri, Brian Roark, Michael Riley
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), Montreal, Canada
-
An optimal pre-determinization algorithm for weighted transducers
Theor. Comput. Sci., vol. 328 (2004), pp. 3-18
-
Statistical Modeling for Unit Selection in Speech Synthesis
Cyril Allauzen, Mehryar Mohri, Michael Riley
42nd Meeting of the Association for Computational Linguistics (ACL 2004), Proceedings of the Conference, Barcelona, Spain
-
A General Weighted Grammar Library
Cyril Allauzen, Mehryar Mohri, Brian Roark
Proceedings of the Ninth International Conference on Automata (CIAA 2004), Kingston, Ontario, Canada
-
An Optimal Pre-Determinization Algorithm for Weighted Transducers
Theoretical Computer Science, vol. 328 (2004)
-
General Indexation of Weighted Automata - Application to Spoken Utterance Retrieval
Cyril Allauzen, Mehryar Mohri, Murat Saraclar
Proceedings of the annual meeting of the Human Language Technology conference and North American Chapter of the Association for Computational Linguistics (HLT/NAACL 2004), Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, Boston, Massachusetts
-
Statistical Modeling for Unit Selection in Speech Synthesis
Cyril Allauzen, Mehryar Mohri, Michael Riley
$42$nd Meeting of the Association for Computational Linguistics (ACL 2004), Proceedings of the Conference, Barcelona, Spain
-
An Efficient Pre-determinization Algorithm
CIAA (2003)
-
Efficient Algorithms for Testing the Twins Property
Journal of Automata, Languages and Combinatorics, vol. 8 (2003)
-
Finitely Subsequential Transducers
Int. J. Found. Comput. Sci., vol. 14 (2003)
-
Generalized Algorithms for Constructing Statistical Language Models
Cyril Allauzen, Mehryar Mohri, Brian Roark
ACL (2003)
-
$p$-Subsequentiable Transducers
Seventh International Conference on Automata (CIAA 2002), Tours, France, Springer, Berlin-NY (2003), pp. 24-34
-
An Efficient Pre-Determinization Algorithm
Eighth International Conference on Automata (CIAA 2003), Santa Barbara, CA, Springer, Berlin-NY, pp. 83-95
-
Finitely Subsequential Transducers
International Journal of Foundations of Computer Science, vol. 14 (2003), pp. 983-994
-
Generalized Algorithms for Constructing Statistical Language Models
Cyril Allauzen, Mehryar Mohri, Brian Roark
$41$st Meeting of the Association for Computational Linguistics (ACL 2003), Proceedings of the Conference, Sapporo, Japan
-
Generalized Optimization Algorithm for Speech Recognition Transducers
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong
-
p-Subsequentiable Transducers
CIAA (2002), pp. 24-34
-
$p$-Subsequentiable Transducers
Proceedings of the Seventh International Conference on Automata (CIAA 2002), Tours, France
-
On the Determinizability of Weighted Automata and Transducers
Proceedings of the workshop Weighted Automata: Theory and Applications (WATA), Dresden, Germany (2002)













