Aren Jansen

I am currently a Research Scientist at Google, working in the Sound Understanding Group on machine learning for speech, music and audio processing. Before joining Google, I was a Research Scientist at the Johns Hopkins University Human Language Technology Center of Excellence, an Assistant Research Professor in the John Hopkins Department of Electrical and Computer Engineering, and a faculty member of the Center for Language and Speech Processing. My research has explored a wide range of ML topics that involve unsupervised/semi-supervised representation learning, information retrieval, content-based recommendation, latent structure discovery, time series modeling and analysis, and scalable algorithms for big data applications. See my personal website or my Google scholar page for a full list of publications.

Research Areas

Authored Publications

Google Publications

Other Publications

MusicLM: Generating Music From Text

Andrea Agostinelli

Timo Denk

Zalán Borsos

Jesse Engel

Mauro Verzetti

Antoine Caillon

Qingqing Huang

Aren Jansen

Adam Roberts

Marco Tagliasacchi

Matt Sharifi

Neil Zeghidour

Christian Frank

under review (2023)

A Machine-Learning Based Objective Measure for ALS disease progression

Fernando Viera

Subhashini Venugopalan

Alan S Premasiri

Maeve McNally

Aren Jansen

Kevin James Fleming McCloskey

Michael Brenner

Steven Perrin

npj Digital Medicine (2022)

MuLan: A Joint Embedding of Music Audio and Natural Language

Qingqing Huang

Aren Jansen

Joonseok Lee

Ravi Ganti

Judith Yue Li

Daniel P. W. Ellis

Proceedings of the the 23rd International Society for Music Information Retrieval Conference (ISMIR) (2022) (to appear)

Shared computational principles for language processing in humans and deep language models

Ariel Goldstein

Zaid Zada

Eliav Buchnik

Mariano Schain

Amy Price

Bobbi Aubrey

Samuel A. Nastase

Amir Feder

Dotan Emanuel

Alon Cohen

Aren Jansen

Harshvardhan Gazula

Gina Choe

Aditi Rao

Catherine Kim

Colton Casto

Lora Fanda

Werner Doyle

Daniel Friedman

Patricia Dugan

Lucia Melloni

Roi Reichart

Sasha Devore

Adeen Flinker

Liat Hasenfratz

Omer Levy,

Avinatan Hassidim

Michael Brenner

Yossi Matias

Kenneth A. Norman

Orrin Devinsky

Uri Hasson

Nature Neuroscience (2022)

Universal Paralinguistic Speech Representations Using Self-Supervised Conformers

Aren Jansen

Daniel S. Park

Joel Shor

Wei Han

Yu Zhang

ICASSP 2022 (2022)

Self-Supervised Learning from Automatically Separated Sound Scenes

Eduardo Fonseca

Aren Jansen

Dan Ellis

Scott Wisdom

Marco Tagliasacchi

John Hershey

Manoj Plakal

Shawn Hershey

R. Channing Moore

Xavier Serra

WASPAA 2021 (2021)

Attention Bottlenecks for Multimodal Fusion

Arsha Nagrani

Shan Yang

Anurag Arnab

Aren Jansen

Cordelia Schmid

Chen Sun

(2021)

A Convolutional Neural Network for Automated Detection of Humpback Whale Song in a Diverse, Long-Term Passive Acoustic Dataset

Ann N. Allen

Matt Harvey

Lauren Harrell

Aren Jansen

Karlina P. Merkens

Carrie C. Wall

Julie Cattiau

Erin M. Oleson

Frontiers in Marine Science, vol. 8 (2021), pp. 165

Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds

Efthymios Tzinis

Scott Wisdom

Aren Jansen

Shawn Hershey

Tal Remez

Daniel P. W. Ellis

John R. Hershey

International Conference on Learning Representations (ICLR) 2021

The Benefit of Temporally-Strong Labels in Audio Event Classification

Shawn Hershey

Dan Ellis

Eduardo Fonseca

Aren Jansen

Caroline Liu

R. Channing Moore

Manoj Plakal

Proceedings of ICASSP 2021 (2021)

Sparse, Efficient, and Semantic MixIT: Taming In-the-Wild Unsupervised Sound Separation

Scott Wisdom

Aren Jansen

Ron J. Weiss

Hakan Erdogan

John Hershey

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (2021)

Improving Universal Sound Separation Using Sound Classification

Efthymios Tzinis

Scott Wisdom

John Hershey

Aren Jansen

Dan Ellis

ICASSP (2020)

Self-Supervised Audio-Visual Separation of On-Screen Sounds from Unlabeled Videos

Efthymios Tzinis

Scott Wisdom

Aren Jansen

Shawn Hershey

Tal Remez

Dan Ellis

John Hershey

NeurIPS 2020 Workshop on Self-Supervised Learning for Speech and Audio Processing

Large-Scale Weakly-Supervised Content Embeddingsfor Music Recommendation and Tagging

Qingqing Huang

Aren Jansen

Li Zhang

Dan Ellis

Rif A. Saurous

John Roberts Anderson

ICASSP 2020 (2020)

Towards Learning a Universal Non-Semantic Representation of Speech

Joel Shor

Aren Jansen

Ronnie Zvi Maor

Oran Lang

Omry Tuval

Félix de Chaumont Quitry

Marco Tagliasacchi

Ira Shavitt

Dotan Emanuel

Proc. Interspeech 2020 (2020)

Coincidence, Categorization, and Consolidation: Learning to Recognize Sounds with Minimal Supervision

Aren Jansen

Dan Ellis

Shawn Hershey

R. Channing Moore

Manoj Plakal

Ashok Popat

Rif A. Saurous

Proceedings of ICASSP 2020 (2020) (to appear)

Unsupervised Learning of Semantic Audio Representations

Aren Jansen

Manoj Plakal

Ratheet Pandya

Dan Ellis

Shawn Hershey

Jiayang Liu

Channing Moore

Rif A. Saurous

Proceedings of ICASSP 2018 (to appear)

A Segmental Framework for Fully-Unsupervised Large-Vocabulary Speech Recognition

Herman Kamper

Aren Jansen

Sharon Goldwater

Computer Speech and Language (2017) (to appear)

Large-Scale Audio Event Discovery in One Million YouTube Videos

Aren Jansen

Jort F. Gemmeke

Daniel P. W. Ellis

Xiaofeng Liu

Wade Lawrence

Dylan Freedman

Proceedings of ICASSP (2017) (to appear)

Audio Set: An ontology and human-labeled dataset for audio events

Jort F. Gemmeke

Daniel P. W. Ellis

Dylan Freedman

Aren Jansen

Wade Lawrence

R. Channing Moore

Manoj Plakal

Marvin Ritter

Proc. IEEE ICASSP 2017, New Orleans, LA (to appear)

Towards Learning Semantic Audio Representations from Unlabeled Data

Aren Jansen

Manoj Plakal

Ratheet Pandya

Dan Ellis

Shawn Hershey

Jiayang Liu

Channing Moore

Rif A. Saurous

NIPS Workshop on Machine Learning for Audio Signal Processing (ML4Audio) (2017) (to appear)

CNN Architectures for Large-Scale Audio Classification

Shawn Hershey

Sourish Chaudhuri

Daniel P. W. Ellis

Jort F. Gemmeke

Aren Jansen

Channing Moore

Manoj Plakal

Devin Platt

Rif A. Saurous

Bryan Seybold

Malcolm Slaney

Ron Weiss

Kevin Wilson

International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE (2017)

Unsupervised Word Segmentation and Lexicon Discovery Using Acoustic Word Embeddings

Herman Kamper

Aren Jansen

Sharon Goldwater

IEEE Transactions on Audio, Speech, and Language Processing (2016)

No Results Found

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Aren Jansen

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Aren Jansen

Research Areas

Filter by:

Year

Team

Research Area

Join us

AI/ML Foundations  & Capabilities