Machine Intelligence

Google is at the forefront of innovation in Machine Intelligence, with active research exploring virtually all aspects of machine learning, including deep learning and more classical algorithms. Exploring theory as well as application, much of our work on language, speech, translation, visual processing, ranking and prediction relies on Machine Intelligence. In all of those tasks and many others, we gather large volumes of direct or indirect evidence of relationships of interest, applying learning algorithms to understand and generalize.

Machine Intelligence at Google raises deep scientific and engineering challenges, allowing us to contribute to the broader academic research community through technical talks and publications in major conferences and journals. Contrary to much of current theory and practice, the statistics of the data we observe shifts rapidly, the features of interest change as well, and the volume of data often requires enormous computation capacity. When learning systems are placed at the core of interactive services in a fast changing and sometimes adversarial environment, combinations of techniques including deep learning and statistical models need to be combined with ideas from control and game theory.

Recent Publications

InstructPipe: Building Visual Programming Pipelines in Visual Blocks with Human Instructions Using LLMs
Alex Olwal
Mark Sherwood
Jing Jin
Na Li
Jingtao Zhou
Jun Jiang
Ram Iyengar
Zhongyi Zhou
Yiyi Huang
Kristen Wright
Xiuxiu Yuan
Jason Mayes
Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI), ACM, pp. 23
USM-SCD: USM-Based Multilingual Speaker Change Detection
Yu Zhang
Yongqiang Wang
Jason Pelecanos
Yiling Huang
Han Lu
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 11801-11805
Artificial Intelligence in Healthcare: A Perspective from Google
Lily Peng
Lisa Lehmann
Artificial Intelligence in Healthcare, Elsevier (2024)
FP-Fed: Privacy-Preserving Federated Detection of Browser Fingerprinting
Meenatchi Sundaram Muthu Selva Annamalai
Emiliano De Cristofaro
Network and Distributed System Security (NDSS) Symposium (2024)
Multimodal Modeling for Spoken Language Identification
Yu Zhang
Wei Han
Shikhar Bharadwaj
Sriram (Sri) Ganapathy
Sid Dalmia
Proceedings of 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024) (2024)