Artificial Intelligence and Machine Learning

376 Publications

  •    

    A Discriminative Latent Variable Model for Online Clustering

    Rajhans Samdani, Kai-Wei Chang, Dan Roth

    International Conference on Machine Learning (2014) (to appear)

  •    

    Affinity Weighted Embedding

    Jason Weston, Ron Weiss, Hector Yee

    International Conference on Machine Learning (2014)

  •   

    Applications of Maximum Entropy Rankers to Problems in Spoken Language Processing

    Richard Sproat, Keith Hall

    Interspeech 2014, International Speech Communications Association (to appear)

  •    

    Asynchronous Stochastic Optimization for Sequence Training of Deep Neural Networks

    Georg Heigold, Erik McDermott, Vincent Vanhoucke, Andrew Senior, Michiel Bacchiani

    Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Firenze, Italy (2014)

  •    

    Cicada: Predictive Guarantees for Cloud Network Bandwidth

    Katrina LaCurts, Jeffrey C Mogul, Hari Balakrishnan, Yoshio Turner

    MIT (2014), MIT-CSAIL-TR-2014-004

  •  

    Deep Convolutional Ranking for Multilabel Image Annotation

    Yunchao Gong, Yangqing Jia, Alexander Toshev, Thomas Leung, Sergey Ioffe

    International Conference on Learning Representations (2014) (to appear)

  •   

    Enhanced Search with Wildcards and Morphological Inflections in the Google Books Ngram Viewer

    Jason Mann, David Zhang, Lu Yang, Dipanjan Das, Slav Petrov

    Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (Demonstrations), Association for Computational Linguistics (2014)

  •    

    Frame-Semantic Parsing

    Dipanjan Das, Desai Chen, André F. T. Martins, Nathan Schneider, Noah A. Smith

    Computational Linguistics, vol. 40:1 (2014), pp. 9-56

  •    

    Insulin Resistance: Regression and Clustering

    Sangho Yoon

    PLoS ONE, vol. 9(6) (2014)

  •    

    Intriguing properties of neural networks

    Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, Rob Fergus

    International Conference on Learning Representations (2014)

  •    

    Large-scale Video Classification with Convolutional Neural Networks

    Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei

    Proceedings of International Computer Vision and Pattern Recognition (CVPR 2014), IEEE

  •    

    Machine Learning Applications for Data Center Optimization

    Jim Gao

    Google (2014)

  •    

    Machine Learning in an Auction Environment

    Patrick Hummel, Preston McAfee

    Proceedings of the 23rd International Conference on the World Wide Web (WWW) (2014), pp. 7-18

  •   

    Projecting the Knowledge Graph to Syntactic Parsing

    Andrea Gesmundo, Keith Hall

    EACL 2014: 15th Conference of the European Chapter of the Association for Computational Linguistics

  •   

    Reducing the Sampling Complexity of Topic Models

    Aaron Li, Amr Ahmed, Sujith Ravi, Alexander J Smola

    ACM Conference on Knowledge Discovery and Data Mining (KDD) (2014) (to appear)

  •    

    Scalable Hierarchical Multitask Learning Algorithms for Conversion Optimization in Display Advertising

    Amr Ahmed, Abhimanyu Das, Alexander J. Smola

    ACM International Conference on Web Search And Data Mining (WSDM) (2014)

  •    

    Semantic Frame Identification with Distributed Word Representations

    Karl Moritz Hermann, Dipanjan Das, Jason Weston, Kuzman Ganchev

    Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (2014)

  •   

    Sequence Discriminative Distributed Training of Long Short-Term Memory Recurrent Neural Networks

    Hasim Sak, Oriol Vinyals, Georg Heigold, Andrew Senior, Erik McDermott, Rajat Monga, Mark Mao

    Interspeech (2014) (to appear)

  •   

    Small-Footprint Keyword Spotting using Deep Neural Networks

    Guoguo Chen, Carolina Parada, Georg Heigold

    ICASSP, IEEE (2014)

  •    

    Statistical Parametric Speech Synthesis

    Heiga Zen

    UKSpeech Conference, Edinburgh, UK (2014)

  •    

    Taxonomy Discovery for Personalized Recommendation

    Yuchen Zhang, Amr Ahmed, Vanja Josifovski, Alexander J Smola

    ACM International Conference on Web Search And Data Mining (WSDM) (2014)

  •    

    Training Highly Multi-class Linear Classifiers

    Maya R. Gupta, Samy Bengio, Jason Weston

    Journal Machine Learning Research (JMLR) (2014), 1461-−1492

  •   

    Unconstrained Online Linear Learning in Hilbert Spaces: Minimax Algorithms and Normal Approximations

    H. Brendan McMahan, Francesco Orabona

    Proceedings of the 27th Annual Conference on Learning Theory (COLT) (2014) (to appear)

  •    

    Word Embeddings for Speech Recognition

    Samy Bengio, Georg Heigold

    Proceedings of the 15th Conference of the International Speech Communication Association, Interspeech (2014) (to appear)

  •    

    Zero-Shot Learning by Convex Combination of Semantic Embeddings

    Mohammad Norouzi, Tomas Mikolov, Samy Bengio, Yoram Singer, Jonathon Shlens, Andrea Frome, Greg Corrado, Jeffrey Dean

    International Conference on Learning Representations (2014)

  •    

    Local Collaborative Ranking

    Joonseok Lee, Samy Bengio, Seungyeon Kim, Guy Lebanon, Yoram Singer

    International World Wide Web Conference, WWW (2014)

  •    

    3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding

    Scott Satkin, Martial Hebert

    Proceedings of the International Conference on Computer Vision (ICCV) (2013) (to appear)

  •   

    A Generic Technique for Synthesizing Bounded Finite-State Controllers

    Yuxiao Hu, Giuseppe De Giacomo

    Proceedings of the International Conference on Automated Planning and Scaduling, Association for the Advancement of Artificial Intelligence (2013), pp. 109-116

  •    

    A Method for Measuring Online Audiences

    Jim Koehler, Evgeny Skvortsov, Wiesner Vos

    Google Inc (2013), pp. 1-24 (to appear)

  •  

    A Semantic Matching Energy Function for Learning with Multi-relational Data

    Xavier Glorot, Antoine Bordes, Jason Weston, Yoshua Bengio

    International Conference on Learning Representations (2013)

  •    

    Ad Click Prediction: a View from the Trenches

    H. Brendan McMahan, Gary Holt, D. Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, Sharat Chikkerur, Dan Liu, Martin Wattenberg, Arnar Mar Hrafnkelsson, Tom Boulos, Jeremy Kubica

    Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (2013)

  •   

    Affinity Weighted Embedding

    Jason Weston, Ron Weiss, Hector Yee

    International Conference on Learning Representations (2013)

  •   

    An Empirical study of learning rates in deep neural networks for speech recognition

    Andrew Senior, Georg Heigold, Marc'aurelio Ranzato, Ke Yang

    Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, CA (2013) (to appear)

  •    

    Bayes and Big Data: The Consensus Monte Carlo Algorithm

    Steven L. Scott, Alexander W. Blocker, Fernando V. Bonassi

    Bayes 250 (2013) (to appear)

  •   

    Breaking Out of Local Optima with Count Transforms and Model Recombination: A Study in Grammar Induction

    Valentin I. Spitkovsky, Daniel Jurafsky, Hiyan Alshawi

    2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013; Best Paper Award)

  •    

    Classifying with Confidence From Incomplete Test Data

    Nathan Parris, Hyrum S. Anderson, Maya R. Gupta, Dun Yu Hsaio

    Journal Machine Learning Research (JMLR), vol. 14 (2013) (to appear)

  •    

    Cluster forest

    Donghui Yan, Aiyou Chen, Michael I Jordan

    Computational Statistics and Data Analysis, vol. 66 (2013), pp. 178-192

  •   

    Comparative study of classifiers to mitigate intersymbol interference in diffuse indoor optical wireless communication links

    Sujan Rajbhandari, Joe Faith, Zabih Ghassemlooy

    Optik - International Journal for Light and Electron Optics (2013)

  •  

    Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction

    Jason Weston, Antoine Bordes, Oksana Yakhnenko, Nicolas Usunier

    Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing.

  •    

    Cross-Lingual Discriminative Learning of Sequence Models with Posterior Regularization

    Kuzman Ganchev, Dipanjan Das

    Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing

  •    

    Data Fusion: Resolving Conflicts from Multiple Sources

    Xin Luna Dong, Laure Berti-Equille, Divesh Srivastava

    WAIM (2013), pp. 64-76 (to appear)

  •    

    DeViSE: A Deep Visual-Semantic Embedding Model

    Andrea Frome, Greg Corrado, Jonathon Shlens, Samy Bengio, Jeffrey Dean, Marc’Aurelio Ranzato, Tomas Mikolov

    Neural Information Processing Systems (NIPS) (2013)

  •    

    Deep Learning in Speech Synthesis

    Heiga Zen

    8th ISCA Speech Synthesis Workshop, Barcelona, Spain (2013)

  •  

    Deep Learning via Semi-Supervised Embedding

    Jason Weston, Frederic Ratle, Hossein Mobahi, Ronan Collobert

    Neural Networks Tricks of the Trade, Reloaded, Springer (2013)

  •  

    Deep Neural Networks for Object Detection

    Christian Szegedy, Alexander Toshev, Dumitru Erhan

    Advances in Neural Information Processing Systems (2013)

  •    

    Discriminative Segment Annotation in Weakly Labeled Video

    Kevin Tang, Rahul Sukthankar, Jay Yagnik, Li Fei-Fei

    Proceedings of International Conference on Computer Vision and Pattern Recognition (CVPR 2013)

  •   

    Distributed Large-scale Natural Graph Factorization

    Amr Ahmed, Nino Shervashidze, Shravan Narayanamurthy,, Vanja Josifovski, Alexander J Smola

    Proceedings of the 22nd International World Wide Web Conference (WWW 2013) (to appear)

  •    

    Efficient Estimation of Word Representations in Vector Space

    Tomas Mikolov, Kai Chen, Greg S. Corrado, Jeffrey Dean

    International Conference on Learning Representations (2013)

  •    

    Efficient Learning of Sparse Ranking Functions

    Mark Stevens, Samy Bengio, Yoram Singer

    Empirical Inference, Springer (2013)

  •    

    Estimation, Optimization, and Parallelism when Data is Sparse

    John C. Duchi, Michael I. Jordan, H. Brendan McMahan

    Advances in Neural Information Processing Systems (NIPS) (2013)

  •    

    Fast, Accurate Detection of 100,000 Object Classes on a Single Machine

    Thomas Dean, Mark Ruzon, Mark Segal, Jonathon Shlens, Sudheendra Vijayanarasimhan, Jay Yagnik

    Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, Washington, DC, USA (2013)

  •    

    Fastfood - Approximating Kernel Expansions in Loglinear Time

    Quoc Le, Tamas Sarlos, Alex Smola

    30th International Conference on Machine Learning (ICML), Omnipress (2013)

  •   

    Focused Marix Factorization for Audience Selection in Display Advertising

    Bhargav Kanagal, Amr Ahmed, Sandeep Pandey, Vanja Josifovski, Lluis Garcia-Pueyo, Jeff Yuan

    Proceedings of the 29th International Conference on Data Engineering (ICDE) (2013) (to appear)

  •   

    Guest editors' introduction: Special section on learning deep architectures

    Samy Bengio, Li Deng, Hugo Larochelle, Honglak Lee, Ruslan Salakhutdinov

    IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol. 35 (2013), pp. 1795-1797

  •   

    Hierarchical Geographical Modeling of User locations from Social Media Posts

    Amr Ahmed, Liangjie Hong, Alexander J Smola

    Proceedings of the 22nd International World Wide Web Conference (WWW 2013) (to appear)

  •   

    Image Annotation in Presence of Noisy Labels

    Chandrashekhar V., Shailesh Kumar, C. V. Jawahar

    International Conference on Pattern Recognition and Machine Intelligence (2013) (to appear)

  •  

    KDD tutorial: The Dataminer Guide to Scalable Mixed-Membership and Nonparametric Bayesian Models

    Amr Ahmed, Alexander J Smola

    ACM conference on Knowledge Discovery and Data Mining (KDD) (2013) (to appear)

  •   

    Label Partitioning for Sublinear Ranking

    Jason Weston, Ameesh Makadia, Hector Yee

    International Conference on Machine Learning (2013)

  •   

    Language-Independent Discriminative Parsing of Temporal Expressions

    Gabor Angeli, Jakob Uszkoreit

    The 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013) (to appear)

  •    

    Large Scale Distributed Acoustic Modeling With Back-off N-grams

    Ciprian Chelba, Peng Xu, Fernando Pereira, Thomas Richardson

    IEEE Transactions on Audio, Speech and Language Processing, vol. 21 (2013), pp. 1158-1169

  •   

    Large Scale SVD and Manifold Learning

    Ameet Talwalkar, Sanjiv Kumar, Mehryar Morhri, Henry A. Rowley

    Journal of Machine Learning Research (JMLR) (2013)

  •    

    Large-Scale Learning with Less RAM via Randomization

    Daniel Golovin, D. Sculley, H. Brendan McMahan, Michael Young

    Proceedings of the 30 International Conference on Machine Learning (ICML) (2013), pp. 10

  •   

    Latent Factor Models with Additive Hierarchically-smoothed User Preferences

    Amr Ahmed, Bhargav Kanagal, Sandeep Pandey, Vanja Josifovski, Lluis Garcia-Pueyo

    Proceedings of The 6th ACM International Conference on Web Search and Data Mining (WSDM) (2013) (to appear)

  •    

    Learning Binary Codes for High Dimensional Data Using Bilinear Projections

    Yunchao Gong, Sanjiv Kumar, Henry Rowley, Svetlana Lazebnik

    IEEE Computer Vision and Pattern Recognition (2013)

  •    

    Learning Multiple Non-Linear Sub-Spaces using K-RBMs

    Siddhartha Chandra, Shailesh Kumar, C. V. Jawahar

    Computer Vision and Pattern Recognition (2013)

  •    

    Learning Prices for Repeated Auctions with Strategic Buyers

    Kareem Amin, Afshin Rostamizadeh, Umar Syed

    Neural Information Processing Systems (2013)

  •  

    Learning Semantic Representations Of Objects And Their Parts.

    G Mesnil, Antoine Bordes, Jason Weston, Gal Chechik, Yoshua Bengio

    Special Issue on Learning Semantics in Machine Learning Journal (2013) (to appear)

  •    

    Learning kernels using local rademacher complexity

    Corinna Cortes, Marius Kloft, Mehryar Mohri

    Advances in Neural Information Processing Systems (NIPS 2013), MIT Press.

  •    

    Learning to Rank Recommendations with the k-Order Statistic Loss

    Jason Weston, Hector Yee, Ron Weiss

    ACM International Conference on Recommender Systems (RecSys) (2013)

  •   

    Making touchscreen keyboards adaptive to keys, hand postures, and individuals: a hierarchical spatial backoff model approach

    Ying Yin, Tom Ouyang, Kurt Partridge, Shumin Zhai

    Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI 2013), ACM, New York, NY, pp. 2775-2784

  •   

    Measurement and modeling of eye-mouse behavior

    Vidhya Navalpakkam, LaDawn Jentzsch, Rory Sayres, Sujith Ravi, Amr Ahmed, Alex J. Smola

    Proceedings of the 22nd International World Wide Web Conference (2013)

  •    

    Minimax Optimal Algorithms for Unconstrained Linear Optimization

    H. Brendan McMahan, Jacob Abernethy

    Advances in Neural Information Processing Systems (NIPS) (2013)

  •    

    Multi-Armed Recommendation Bandits for Selecting State Machine Policies for Robotic Systems

    Pyry Matikainen, P. Michael Furlong, Rahul Sukthankar, Martial Hebert

    Proceedings of International Conference on Robotics and Automation (ICRA 2013)

  •   

    Multi-class classification with maximum margin multiple kernel

    Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Proceedings of the Thirtieth International Conference on Machine Learning (ICML 2013)

  •   

    Multiframe Deep Neural Networks for Acoustic Modeling

    Vincent Vanhoucke, Matthieu Devin, Georg Heigold

    Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, CA (2013)

  •   

    Multilingual acoustic models using distributed deep neural networks

    Georg Heigold, Vincent Vanhoucke, Andrew Senior, Patrick Nguyen, Marc'aurelio Ranzato, Matthieu Devin, Jeff Dean

    Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, CA (2013)

  •    

    Neighborhood Preserving Codes for Assigning Point Labels: Applications to Stochastic Search

    Shumeet Baluja, Michele Covell

    Procedia Computer Science: 2013 International Conference on Computational Science, Elsevier, pp. 956-965

  •    

    Nonlinear Latent Factorization by Embedding Multiple User Interests

    Jason Weston, Ron Weiss, Hector Yee

    ACM International Conference on Recommender Systems (RecSys) (2013)

  •    

    On Rectified Linear Units For Speech Processing

    M.D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q.V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean, G.E. Hinton

    38th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver (2013)

  •    

    One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling

    Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn, Tony Robinson

    ArXiv, Google (2013)

  •  

    POMDP-Based Control of Workflows for Crowdsourcing

    Peng Dai, Christopher H. Lin, Mausam, Daniel S. Weld

    Artificial Intelligence, vol. 202 (2013), pp. 52-85

  •    

    PRIME: Probabilistic Initial 3D Model Generation for Single-Particle Cryo-Electron Microscopy

    Hans Elmlund, Dominika Elmlund, Samy Bengio

    Structure, vol. 21 (2013), pp. 1299-1306

  •    

    Parallel Boosting with Momentum

    Indraneel Mukherjee, Kevin Canini, Rafael Frongillo, Yoram Singer

    ECML PKDD 2013, Part III, LNAI 8190, Springer, Heidelberg, pp. 17-32 (to appear)

  •    

    Point Representation for Local Optimization: Towards Multi-Dimensional Gray Codes

    Shumeet Baluja, Michele Covell

    Proceedings IEEE Congress on Evolutionary Computation, IEEE (2013)

  •    

    ReFr: An Open-Source Reranker Framework

    Daniel M. Bikel, Keith B. Hall

    Interspeech 2013, pp. 756-758

  •    

    Recurrent Neural Networks for Voice Activity Detection

    Thad Hughes, Keir Mierle

    ICASSP, IEEE (2013), pp. 7378-7382

  •    

    Restricted Transfer learning for Text Categorization

    Rajhans Samdani, Gideon Mann

    NIPS Workshop (2013) (to appear)

  •   

    Russian Stress Prediction using Maximum Entropy Ranking

    Keith Hall, Richard Sproat

    EMNLP, ACL (2013)

  •   

    Scalable Decipherment for Machine Translation via Hash Sampling

    Sujith Ravi

    Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (2013)

  •   

    Scalable Dynamic Nonparametric Bayesian Models of Content and Users

    Amr Ahmed, Eric P. Xing

    International Joint Conference on Artificial Intelligence (IJCAI - Best paper track) (2013) (to appear)

  •    

    Similarity-based Clustering by Left-Stochastic Matrix Factorization

    Raman Arora, Maya R. Gupta, Amol Kapila, Maryam Fazel

    Journal Machine Learning Research (JMLR), vol. 14 (2013), pp. 1715-1746

  •    

    Spatiotemporal Deformable Part Models for Action Detection

    Yicong Tian, Rahul Sukthankar, Mubarak Shah

    Proceedings of International Conference on Computer Vision and Pattern Recognition (CVPR 2013)

  •   

    The Nested Chinese Restaurant Franchise Process: User Tracking and Document Modeling

    Amr Ahmed, Liangjie Hong, Alexander J Smola

    International Conference on Machine Learning (ICML) (2013) (to appear)

  •    

    Transfer Learning In MIR: Sharing Learned Latent Representations For Music Audio Classification And Similarity

    Philippe Hamel, Matthew E. P. Davies, Kazuyoshi Yoshii, Masataka Goto

    14th International Conference on Music Information Retrieval (ISMIR '13) (2013) (to appear)

  •  

    Translating Embeddings for Modeling Multi-relational Data.

    Antoine Bordes, Nicolas Usunier, A. Garcia-Duran, Jason Weston, Oksana Yakhnenko

    Neural Information Processing Systems (2013)

  •    

    Using Web Co-occurrence Statistics for Improving Image Categorization

    Samy Bengio, Jeffrey Dean, Dumitru Erhan, Eugene Ie, Quoc Le, Andrew Rabinovich, Jonathon Shlens, Yoram Singer

    arXiv (2013)

  •   

    pSVM for Learning with Label Proportions

    F. Yu, D. Liu, S. Kumar, T. Jebara, S. F. Chang

    International Conference on Machine Learning (ICML) (2013)

  •   

    A Disambiguation Algorithm for Finite Automata and Functional Transducers

    Mehryar Mohri, Andres Munoz Medina

    CIAA (2012), pp. 265-277

  •    

    Accuracy at the Top

    Stephen Boyd, Corinna Cortes, Mehryar Mohri, Ana Radovanovic

    NIPS: Neural Information Processing Systems Foundation (2012)

  •   

    Algorithms for Learning Kernels Based on Centered Alignment

    Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Journal of Machine Learning Research, vol. 13 (2012), pp. 795-828

  •   

    Angular Quantization-based Binary Codes for Fast Similarity Search

    Yunchao Gong, Sanjiv Kumar, Vishal Verma, Svetlana Lazebnik

    Neural Information Processing Systems (NIPS) (2012)

  •    

    Application Of Pretrained Deep Neural Networks To Large Vocabulary Speech Recognition

    Navdeep Jaitly, Patrick Nguyen, Andrew Senior, Vincent Vanhoucke

    Proceedings of Interspeech 2012

  •    

    Building high-level features using large scale unsupervised learning

    Quoc Le, Marc'Aurelio Ranzato, Rajat Monga, Matthieu Devin, Kai Chen, Greg Corrado, Jeff Dean, Andrew Ng

    International Conference in Machine Learning (2012)

  •    

    Buildling adaptive dialogue systems via Bayes-adaptive POMDP

    Shaowei Png, Joelle Pineau, B. Chaib-draa

    IEEE Journal of Selected Topics in Signal Processing, vol. vol.6(8). 2012. (2012), pp. 917-927

  •    

    Compact Hyperplane Hashing with Bilinear Functions

    Wei Liu, Jun Wang, Yadong Mu, Sanjiv Kumar, Shih-Fu Chang

    International Conference on Machine Learning (ICML) (2012)

  •    

    Deep Neural Networks for Acoustic Modeling in Speech Recognition

    Geoffrey Hinton, Li Deng, Dong Yu, George Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara Sainath, Brian Kingsbury

    Signal Processing Magazine (2012)

  •   

    Distributed Gibbs sampling for latent variable models

    Arthur Asuncion, Padhraic Smyth, Max Welling, David Newman, Ian Porteous, Scott Triglia

    Scaling up Machine Learning, Cambridge (2012) (to appear)

  •   

    FastEx: Hash Clustering with Exponential Families

    Amr Ahmed, Sujith Ravi, Shravan Narayanamurthy, Alex Smola

    Proceedings of the 26th Conference on Neural Information Processing Systems. (NIPS) (2012)

  •    

    Hokusai | Sketching Streams in Real Time

    Sergiy Matusevych, Alex Smola, Amr Ahmed

    Proceedings of the 28th International Conference on Conference on Uncertainty in Artificial Intelligence (UAI) (2012)

  •    

    Human Computation Must Be Reproducible

    Praveen Paritosh

    WWW 2012, Lyon.

  •    

    Joint Image and Word Sense Discrimination For Image Retrieval

    Aurelien Lucchi, Jason Weston

    ECCV (2012)

  •    

    Large Scale Distributed Deep Networks

    Jeffrey Dean, Greg S. Corrado, Rajat Monga, Kai Chen, Matthieu Devin, Quoc V. Le, Mark Z. Mao, Marc’Aurelio Ranzato, Andrew Senior, Paul Tucker, Ke Yang, Andrew Y. Ng

    NIPS (2012)

  •   

    Large Scale Visual Semantic Extraction

    Samy Bengio

    Frontiers of Engineering - Reports on Leading-Edge Engineering from the 2011 Symposium, The National Academies Press, Washington, D.C. (2012), pp. 61-68

  •    

    Latent Collaborative Retrieval

    Jason Weston, Chong Wang, Ron Weiss, Adam Berenzweig

    International Conference on Machine Learning (2012)

  •    

    Latent Structured Ranking

    Jason Weston, John Blitzer

    UAI (2012)

  •    

    Learning Hierarchical Bag of Words Using Naive Bayes Clustering

    Siddhartha Chandra, Shailesh Kumar, C. V. Jawahar

    Asian Conference on Computer Vision (2012), pp. 382-395

  •    

    Linear classifiers are nearly optimal when hidden variables have diverse effects

    Nader H. Bshouty, Philip M. Long

    Machine Learning, vol. 86 (2012), pp. 209-231

  •    

    Machine learning: a probabilistic perspective

    Kevin P Murphy

    MIT Press, Cambridge, MA (2012)

  •    

    MedLDA: Maximum Margin Supervised Topic Models

    Jun Zhu, Amr Ahmed, Eric P. Xing

    Journal of Machine Learning Research (2012) (to appear)

  •  

    Minimizing Uncertainty in Pipelines

    Nilesh N. Dalvi, Aditya Parameswaran, Vibhor Rastogi

    NIPS (2012) (to appear)

  •    

    Model Recommendation for Action Recognition

    Pyry Matikainen, Rahul Sukthankar, Martial Hebert

    IEEE International Conference on Computer Vision and Pattern Recognition (CVPR'12) (2012)

  •   

    New Analysis and Algorithm for Learning with Drifting Distributions

    Mehryar Mohri, Andres Munoz Medina

    ALT (2012), pp. 124-138

  •    

    No-Regret Algorithms for Unconstrained Online Convex Optimization

    Matthew Streeter, H. Brendan McMahan

    Advances in Neural Information Processing Systems (NIPS) (2012)

  •    

    On Using Nearly-Independent Feature Families for High Precision and Confidence

    Omid Madani, Manfred Georg, David Ross

    Fourth Asian Machine Learning Conference, JMLR workshop and conference proceedings (2012), pp. 269-284

  •    

    On the Difficulty of Nearest Neighbor Search

    Junfeng He, Sanjiv Kumar, Shih-Fu Chang

    International Conference on Machine Learning (ICML) (2012)

  •  

    Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits

    Yasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári

    AISTATS 2012

  •    

    Open Problem: Better Bounds for Online Logistic Regression

    H. Brendan McMahan, Matthew Streeter

    COLT/ICML Joint Open Problem Session, JMLR: Workshop and Conference Proceedings (2012)

  •    

    Robust Local Search for Solving RCPSP/max with Durational Uncertainty

    Na Fu, Hoong Chuin Lau, Pradeep Varakantha, Fei Xiao

    Journal of Artificial Intelligence Research, vol. 43 (2012), pp. 43-86

  •   

    Sampling Methods for the Nystrom Method

    Sanjiv Kumar, Mehryar Mohri, Ameet Talwalkar

    Journal of Machine Learning Research (JMLR) (2012)

  •   

    Scalable Active Learning for Multi-Class Image Classification

    Ajay J. Joshi, Fatih Porikli, Nikolaos Papanikolopoulos

    IEEE Transactions on Pattern Analysis and Machine Intelligence (2012)

  •    

    Spectral Intersections for Non-Stationary Signal Separation

    Trausti Kristjansson, Thad Hughes

    Proceedings of InterSpeech 2012, Portland, OR

  •   

    Spectral Learning of General Weighted Automata via Constrained Matrix Completion

    Borja Balle, Mehryar Mohri

    NIPS (2012), pp. 2168-2176

  •    

    Student-t based Robust Spatio-Temporal Prediction

    Yang Chen, Feng Chen, Jing Dai, T. Charles Clancy, Yao-Jan Wu

    IEEE 12th International Conference on Data Mining, IEEE, Brussels, Belgium (2012), pp. 151-160

  •   

    The Foundations of Machine Learning

    Mehryar Mohri, Afshin Rostamizadeh, Ameet Talwalkar

    MIT Press (2012)

  •   

    The multi-iterative closest point tracker: An online algorithm for tracking multiple interacting targets

    Adam Feldman, Maria Hybinette, Tucker Balch

    Journal of Field Robotics, vol. 29.2 (2012), pp. 258-276

  •   

    The word-gesture keyboard: reimagining keyboard interaction (CACM Research Highlight)

    Shumin Zhai, Per Ola Kristensson

    Communications of the ACM, vol. 55, no. 9 (2012), pp. 91-101

  •    

    Three Controversial Hypotheses Concerning Computation in the Primate Cortex

    Thomas Dean, Greg Corrado, Jonathon Shlens

    Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, AAAI Press (2012)

  •    

    Unsupervised Learning for Graph Matching

    Marius Leordeanu, Rahul Sukthankar, Martial Hebert

    International Journal of Computer Vision, vol. 96 (2012), pp. 28-45

  •    

    Weakly Supervised Learning of Object Segmentations from Web-Scale Video

    Glenn Hartmann, Matthias Grundmann, Judy Hoffman, David Tsai, Vivek Kwatra, Omid Madani, Sudheendra Vijayanarasimhan, Irfan Essa, James Rehg, Rahul Sukthankar

    ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I, Springer-Verlag, Berlin, Heidelberg (2012), pp. 198-208

  •    

    Web-Scale Multi-Task Feature Selection for Behavioral Targeting

    Amr Ahmed, Mohamed Aly, Abhimanyu Das, Alex Smola, Tasos Anastasakos

    Proceedings of The 21st ACM International Conference on Information and Knowledge Management (CIKM), ACM (2012) (to appear)

  •   

    A Dual Coordinate Descent Algorithm for SVMs Combined with Rational Kernels

    Cyril Allauzen, Corinna Cortes, Mehryar Mohri

    International Journal of Foundations of Computer Science, vol. 22 (2011), pp. 1761-1779

  •   

    Algorithms and hardness results for parallel large margin learning

    Philip M. Long, Rocco A. Servedio

    NIPS (2011)

  •   

    Artificial General Intelligence. Proceedings of the 4th International Conference

    Jürgen Schmidhuber, Kristinn Thorisson, Moshe Looks

    Springer Lecture Notes in Artificial Intelligence (2011)

  •   

    Can matrix coherence be efficiently and accurately estimated?

    Mehryar Mohri, Ameet Talwalkar

    Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011)

  •  

    Combinatorial and Algorithmic Aspects of Sequence Processing (Dagstuhl Seminar 11081)

    Maxime Crochemore, Lila Kari, Mehryar Mohri, Dirk Nowotka

    Dagstuhl Reports, vol. 1 (2011), pp. 47-66

  •    

    Controlling Complexity in Part-of-Speech Induction

    Joao Graca, Kuzman Ganchev, Luisa Coheur, Fernando Pereira, Ben Taskar

    Journal of Artificial Intelligence Research (JAIR), vol. 41 (2011), pp. 527-551

  •    

    Domain Adaptation with Coupled Subspaces

    John Blitzer, Sham Kakade, Dean Foster

    Artificial Intelligence and Statistics (2011)

  •   

    Domain adaptation in regression

    Corinna Cortes, Mehryar Mohri

    Proceedings of The 22nd International Conference on Algorithmic Learning Theory, ALT 2011, Springer, Heidelberg, Germany

  •   

    Ensemble Nystrom

    Sanjiv Kumar, Mehryar Mohri, Ameet Talwalkar

    A book chapter in Ensemble Machine Learning: Theory and Applications, Springer (2011)

  •   

    Ensembles of Kernel Predictors

    Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence (UAI 2011)

  •    

    Feature Seeding for Action Recognition

    Pyry Matikainen, Rahul Sukthankar, Martial Hebert

    International Conference on Computer Vision (ICCV) (2011)

  •    

    Follow-the-Regularized-Leader and Mirror Descent: Equivalence Theorems and L1 Regularization

    H. Brendan McMahan

    Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS) (2011)

  •   

    Hashing with Graphs

    Wei Liu, Jun Wang, Sanjiv Kumar, Shih-Fu Chang

    International Conference on Machine Learning (ICML) (2011)

  •    

    History Dependent Domain Adaptation

    Allen Lavoie, Matthew Eric Otey, Nathan Ratliff

    Domain Adaptation Workshop at NIPS '11 (2011)

  •    

    Improved Time Series Prediction and Symbolic Regression with Affine Arithmetic

    Cassio Pennachin, Moshe Looks, J. A. de Vasconcelos

    Genetic Programming Theory and Practice IX, Springer, 233 Spring Street, New York, NY 10013 (2011), pp. 97-112

  •    

    L1 and L2 Regularization for Multiclass Hinge Loss Models

    Robert C. Moore, John DeNero

    Symposium on Machine Learning in Speech and Natural Language Processing (2011)

  •   

    Large-Scale Image Annotation using Visual Synset

    David Tsai, Yushi Jing, Henry Rowley, Yi Liu, Sergey Ioffe, James Rehg

    Proc. International Conference on Computer Vision (ICCV) (2011)

  •    

    Large-Scale Music Annotation and Retrieval: Learning to Rank in Joint Semantic Spaces.

    Jason Weston, Samy Bengio, Philippe Hamel

    Journal of New Music Research (2011)

  •   

    Lateen EM: Unsupervised Training with Multiple Objectives, Applied to Dependency Grammar Induction

    Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jurafsky

    2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011)

  •   

    Learning Highlights in Sports Videos Using a Semi-Supervised Approach: Cricket as a Test Case

    Hao Tang, Vivek Kwatra, Mehmet Emre Sargin, Ullas Gargi

    ICME 2011

  •  

    Learning Structured Embeddings of Knowledge Bases

    Antoine Bordes, Jason Weston, Ronan Collobert, Yoshua Bengio

    Proceedings of the 25th Conference on Artificial Intelligence (AAAI) (2011)

  •   

    Learning large-margin halfspaces with more malicious noise

    Philip M. Long, Rocco A. Servedio

    NIPS (2011)

  •    

    Managing Crowdsourced Human Computation

    Panagiotis G. Ipeirotis, Praveen K. Paritosh

    20th International World Wide Web Conference, WWW 2011

  •    

    Models for Neural Spike Computation and Cognition

    David H. Staelin, Carl H. Staelin

    CreateSpace, Seattle, WA (2011), pp. 142

  •    

    On the necessity of irrelevant variables

    David P. Helmbold, Philip M. Long

    ICML (2011)

  •    

    Online Learning in the Manifold of Low-Rank Matrices

    Gal Chechik, Daphna Weinshall, Uri Shalit

    Neural Information Processing Systems (NIPS 23), Curran Associates, Inc. (2011), pp. 2128-2136

  •    

    Posterior Sparsity in Dependency Grammar Induction

    Jennifer Gillenwater, Kuzman Ganchev, Joao Graca, Fernando Pereira, Ben Taskar

    Journal of Machine Learning Research, vol. 12 (2011), pp. 455-490

  •   

    Temporal pooling and multiscale learning for automatic annotation and ranking of music audio

    Philippe Hamel, Simon Lemieux, Yoshua Bengio, Douglas Eck

    International Society for Music Information Retrieval (ISMIR 2011)

  •    

    Wsabie: Scaling Up To Large Vocabulary Image Annotation

    Jason Weston, Samy Bengio, Nicolas Usunier

    Proceedings of the International Joint Conference on Artificial Intelligence, IJCAI (2011)

  •    

    A theory of learning from different domains

    Shai Ben-David, John Blitzer, Koby Crammer, Alex Kulesza, Fernando Pereira, Jennifer Vaughan

    Machine Learning, vol. 79 (2010), pp. 151-175

  •    

    Active Tuples-based Scheme for Bounding Posterior Beliefs

    Bozhena Bidyuk, Rina Dechte, Emma Rollon

    JAIR, vol. 39 (2010), pp. 335-371

  •    

    Adaptive Bound Optimization for Online Convex Optimization

    H. Brendan McMahan, Matthew Streeter

    Proceedings of the 23rd Annual Conference on Learning Theory (COLT) (2010)

  •   

    Algorithms for Learning Kernels Based on Centered Alignment

    Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Journal of Machine Learning Research, vol. 13 (2010), pp. 795-828

  •  

    Bayesian Robot System Identification with Input and Output Noise

    Jo-Anne Ting, Aaron D'Souza, Stefan Schaal

    Neural Networks (2010) (to appear)

  •    

    Beyond Heuristics: Learning to Classify Vulnerabilities and Predict Exploits

    Mehran Bozorgi, Lawrence Saul, Stefan Savage, Geoffrey M. Voelker

    Proceedings of the Sixteenth ACM Conference on Knowledge Discovery and Data Mining (KDD-2010), pp. 105-113

  •    

    Compression Progress, Pseudorandomness, & Hyperbolic Discounting

    Moshe Looks

    The Third Conference on Artificial General Intelligence, Atlantis Press, http://www.atlantis-press.com (2010), pp. 186-187

  •   

    Distributed Training Strategies for the Structured Perceptron

    Ryan McDonald, Keith Hall, Gideon Mann

    North American Chapter of the Association for Computational Linguistics (NAACL) (2010)

  •  

    Efficient Learning and Feature Selection in High-Dimensional Regression

    Jo-Anne Ting, Aaron D'Souza, Stefan Schaal

    Neural Computation, vol. 22(4) (2010), pp. 831-886

  •   

    Exploiting Feature Covariance in High-Dimensional Online Learning

    Justin Ma, Alex Kulesza, Mark Dredze, Koby Crammer, Lawrence Saul, Fernando Pereira

    Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, JMLR (2010), pp. 493-500

  •    

    Finding Meaning on YouTube: Tag Recommendation and Category Discovery

    George Toderici, Hrishikesh Aradhye, Marius Pasca, Luciano Sbaiz, Jay Yagnik

    Computer Vision and Pattern Recognition, IEEE (2010)

  •   

    Finding planted partitions in nearly linear time using arrested spectral clustering

    Nader H. Bshouty, Philip M. Long

    ICML (2010)

  •    

    Generalization Bounds for Learning Kernels

    Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Proceedings of the 27th Annual International Conference on Machine Learning (ICML 2010)

  •  

    Generalized Expectation Criteria for Semi-supervised Learning with Weakly Labeled Data

    Gideon Mann, Andrew McCallum

    JMLR, vol. 11 (2010)

  •   

    Graphical Models of the Visual Cortex

    Thomas Dean

    Heuristics, Probability and Causality, College Publications, King's College London, Strand, London WC2R 2LS, UK (2010), pp. 121-142

  •    

    Half Transductive Ranking

    Bing Bai, Jason Weston, David Grangier, Ronan Collobert, Corinna Cortes, Mehryar Mohri

    Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2010)

  •   

    Hilbert Space Embeddings of Hidden Markov Models

    Le Song, Byron Boots, Sajid Siddiqi, Geoffrey J. Gordon, Alex Smola

    Proceedings of the International Conference on Machine Learning (ICML) (2010)

  •   

    Label Embedding Trees for Large Multi-Class Tasks

    Samy Bengio, Jason Weston, David Grangier

    Neural Information Processing Systems (NIPS) (2010)

  •  

    Label Ranking under Ambiguous Supervision: An Application for Learning Semantic Correspondences

    Nicolas Usunier, Antoine Bordes, Jason Weston

    ICML, ICML (2010)

  •    

    Large Scale Image Annotation: Learning to Rank with Joint Word-Image Embeddings

    Jason Weston, Samy Bengio, Nicolas Usunier

    European Conference on Machine Learning (2010)

  •    

    Large Scale Online Learning of Image Similarity Through Ranking

    Gal Chechik, Varun Sharma, Uri Shalit, Samy Bengio

    Journal of Machine Learning Research, JMLR (2010), pp. 1109-1135

  •   

    Large scale image annotation: learning to rank with joint word-image embeddings

    Jason Weston, Samy Bengio, Nicolas Usunier

    Machine Learning, vol. 81, Issue 1 (2010), pp. 21

  •   

    Large-Scale Training of SVMs with Automata Kernels

    Cyril Allauzen, Corinna Cortes, Mehryar Mohri

    CIAA (2010), pp. 17-27

  •   

    Learning Bounds for Importance Weighting

    Corinna Cortes, Yishay Mansour, Mehryar Mohri

    Advances in Neural Information Processing Systems (NIPS 2010), MIT Press, Vancouver, Canada

  •    

    Learning with Global Cost in Stochastic Environments

    Eyal Even-Dar, Shie Mannor, Yishay Mansour

    Proceedings of the 23rd Annual Conference on Learning Theory (COLT) (2010)

  •    

    Mahout in Action

    Robin Anil, Sean Owen, Ted Dunning, Ellen Friedman

    Manning, Manning Publications Co. Sound View Ct. #3B Greenwich, CT 06830 (2010), pp. 350

  •    

    MapReduce/Bigtable for Distributed Optimization

    Keith B. Hall, Scott Gilpin, Gideon Mann

    Neural Information Processing Systems Workshop on Leaning on Cores, Clusters, and Clouds (2010)

  •   

    Natural Language Processing (almost) from Scratch

    Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, Pavel Kuksa

    Journal of Machine Learning Research (2010)

  •  

    On the Estimation of Coherence

    Mehryar Mohri, Ameet Talwalkar

    CoRR, vol. abs/1009.0861 (2010)

  •    

    On the Impact of Kernel Approximation on Learning Accuracy

    Corinna Cortes, Mehryar Mohri, Ameet Talwalkar

    Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2010)

  •  

    Parallel Spectral Clustering in Distributed Systems

    Wen-Yen Chen, Yangqiu Song, Hongjie Bai, Chih-Jen Lin, Edward Y. Chang

    IEEE Transactions on Pattern Analysis and Machine Intelligence (2010)

  •    

    Prediction of Advertiser Churn for Google AdWords

    Sangho Yoon, Jim Koehler, Adam Ghobarah

    JSM Proceedings, American Statistical Association (2010) (to appear)

  •    

    Preference-Based Learning to Rank

    Nir Ailon, Mehryar Mohri

    Machine Learning Journal, vol. 8 (2010), pp. 189-211

  •   

    Random classification noise defeats all convex potential boosters

    Philip M. Long, Rocco A. Servedio

    Machine Learning, vol. 78 (2010), pp. 287-304

  •    

    Regret Minimization with Concept Drift

    Koby Crammer, Eyal Even-Dar, Yishay Mansour, Jennifer Wortman Vaughan

    Proceedings of the 23rd Annual Conference on Learning Theory (COLT) (2010)

  •   

    Restricted Boltzmann Machines are hard to approximately evaluate or simulate

    Philip M. Long, Rocco A. Servedio

    ICML (2010)

  •    

    Robust Symbolic Regression with Affine Arithmetic

    Cassio Pennachin, Moshe Looks, João A. de Vasconcelos

    Genetic and Evolutionary Computation COnference (GECCO) (2010)

  •   

    SPEC Hashing: Similarity Preserving algorithm for Entropy-based Coding

    Ruei-Sung Lin, David A. Ross, Jay Yagnik

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2010)

  •   

    SVM Optimization for Lattice Kernels

    Cyril Allauzen, Corinna Cortes, Mehryar Mohri

    Mining and Learning with Graphs (2010)

  •  

    Semi-Supervised Abstraction-Augmented String Kernel for Multi-Level Bio-Relation Extraction.

    Pavel Kuksa, Yanjun Qi, Bing Bai, Ronan Collobert, Jason Weston

    ECML (2010)

  •   

    Sequential Projection Learning for Hashing with Compact Codes

    Jun Wang, Sanjiv Kumar, Shih-Fu Chang

    International Conference on Machine Learning (ICML) (2010)

  •    

    Showing Relevant Ads via Lipschitz Context Multi-Armed Bandits

    Tyler Lu, Dávid Pál, Martin Pál

    Thirteenth International Conference on Artificial Intelligence and Statistics, Journal of Machine Learning Research (2010)

  •    

    Sparse Spatiotemporal Coding for Activity Recognition

    Thomas Dean, Greg Corrado, Rich Washington

    Brown University (2010)

  •    

    Stability Bounds for Stationary $\phi$-mixing and $\beta$-mixing Processes

    Mehryar Mohri, Afshin Rostamizadeh

    Journal of Machine Learning Research (JMLR), vol. 11 (2010), pp. 798-814

  •    

    Star Quality: Aggregating Reviews to Rank Products and Merchants

    Mary McGlohon, Natalie Glance, Zach Reiter

    Proceedings of Fourth International Conference on Weblogs and Social Media (ICWSM), AAAI (2010)

  •   

    The Learning Behind Gmail Priority Inbox

    Douglas Aberdeen, Ondrey Pacovsky, Andrew Slater

    LCCC : NIPS 2010 Workshop on Learning on Cores, Clusters and Clouds

  •  

    The YouTube video recommendation system

    James Davidson, Benjamin Liebald, Junning Liu, Palash Nandy, Taylor Van Vleet, Ullas Gargi, Sujoy Gupta, Yu He, Mike Lambert, Blake Livingston, Dasarathi Sampath

    Fourth ACM conference on Recommender systems (2010)

  •   

    Theoretical Convergence Guarantees for Cooperative Coevolutionary Algorithms

    Liviu Panait

    Evolutionary Computation Journal (2010)

  •  

    Towards Understanding Situated Natural Language

    Antoine Bordes, Nicolas Usunier, Jason Weston

    Artificial Intelligence and Statistics (AISTATS) (2010)

  •    

    Training and Testing Low-degree Polynomial Data Mappings via Linear SVM

    Yin-Wen Chang, Cho-Jui Hsieh, Kai-Wei Chang, Michael Ringgaard, Chih-Jen Lin

    Journal of Machine Learning Research, vol. 11(Apr) (2010), 1471−1490

  •    

    Two-Stage Learning Kernel Algorithms

    Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Proceedings of the 27th Annual International Conference on Machine Learning (ICML 2010)

  •    

    Why does Unsupervised Pre-training Help Deep Learning?

    Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pierre-Antoine Manzagol, Pascal Vincent, Samy Bengio

    Journal of Machine Learning Research (2010), pp. 625-660

  •    

    An Online Algorithm for Large Scale Image Similarity Learning

    Gal Chechik, Varun Sharma, Uri Shalit, Samy Bengio

    Advances in Neural Information Processing Systems (2009)

  •  

    Artificial Intelligence: A Modern Approach

    Stuart Russell, Peter Norvig

    Prentice Hall Press, Upper Saddle River, NJ, USA (2009)

  •    

    Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods

    Joseph Keshet, Samy Bengio

    Wiley (2009)

  •   

    Baum's algorithm learns intersections of halfspaces with respect to log-concave distributions

    Adam R. Klivans, Philip M. Long, Alex K. Tang

    RANDOM (2009)

  •   

    Boosting with structural sparsity

    John Duchi, Yoram Singer

    ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning, ACM, New York, NY, USA (2009), pp. 297-304

  •    

    Cooperative Coevolution and Univariate Estimation of Distribution Algorithms

    Christopher Vo, Liviu Panait, Sean Luke

    Foundations of Genetic Algorithms (2009)

  •   

    Des algorithmes d'apprentissage pour mieux classifier

    Corinna Cortes, Patrick Haffner, Mehryar Mohri

    Pour la Science, vol. 386 (2009)

  •    

    Discriminative Keyword Spotting

    Joseph Keshet, David Grangier, Samy Bengio

    Speech Communication (2009), pp. 317-329

  •   

    Domain Adaptation with Multiple Sources

    Yishay Mansour, Mehryar Mohri, Afshin Rostamizadeh

    Advances in Neural Information Processing Systems (NIPS 2008), MIT Press, Vancouver, Canada (2009)

  •   

    Domain Adaptation: Learning Bounds and Algorithms

    Yishay Mansour, Mehryar Mohri, Afshin Rostamizadeh

    Proceedings of The 22nd Annual Conference on Learning Theory (COLT 2009), Omnipress, Montr\'eal, Canada

  •   

    Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models

    Gideon Mann, Ryan McDonald, Mehryar Mohri, Nathan Silberman, Daniel Walker IV

    Neural Information Processing Systems (NIPS) (2009)

  •    

    Emotional Memory and Adaptive Personalities

    Anthony Francis, Manish Mehta, Ashwin Ram

    Handbook of Synthetic Emotions and Sociable Robotics, Information Science Reference, an imprint of IGI Global, www.info-sci-ref.com (2009), pp. 391-412

  •   

    Ensemble Nystrom Method

    Sanjiv Kumar, Mehryar Mohri, Ameet Talwalkar

    Neural Information Processing Systems (NIPS) (2009)

  •   

    Entropic Graph Regularization in Non-Parametric Semi-Supervised Classification

    Amarnag Subramanya, Jeff Bilmes

    NIPS 2009

  •    

    Finding Images and Line Drawings in Document-Scanning Systems

    Shumeet Baluja, Michele Covell

    Proc. International Conference on Document Analysis and Retrieval, IAPR (2009)

  •   

    Gaussian Margin Machines

    Koby Crammer, Mehryar Mohri, Fernando Pereira

    Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS 2009), Clearwater Beach, Florida, pp. 105-112

  •   

    Group Sparse Coding

    Samy Bengio, Fernando Pereira, Yoram Singer, Dennis Strelow

    Advances in Neural Information Processing Systems (2009)

  •   

    Introduction

    Samy Bengio, Joseph Keshet

    Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, Wiley (2009)

  •   

    Invited talk: Can learning kernels help performance?

    Corinna Cortes

    ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning, ACM, New York, NY, USA (2009), pp. 1-1

  •    

    Kernel Based Text-Independnent Speaker Verification

    Johnny Mariethoz, Yves Grandvalet, Samy Bengio

    Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, Wiley (2009)

  •   

    L2 Regularization for Learning Kernels

    Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence (UAI 2009), Montr\'eal, Canada

  •    

    Large Scale Graph Transduction

    Amarnag Subramanya, Jeff Bilmes

    NIPS 2009 Workshop on Large-Scale Machine Learning: Parallelism and Massive Datasets, NIPS

  •   

    Large Scale Learning to Rank

    D. Sculley

    NIPS 2009 Workshop on Advances in Ranking

  •    

    Large Scale Online Learning of Image Similarity Through Ranking: Extended Abstract

    Gal Chechik, Varun Sharma, Uri Shalit, Samy Bengio

    4th Iberian Conference on Pattern Recognition and Image Analysis, IbPRIA (2009)

  •   

    Learning Halfspaces with Malicious Noise

    Adam R. Klivans, Philip M. Long, Rocco A. Servedio

    JMLR, vol. 10 (2009), pp. 2715-2740

  •   

    Learning non-linear combinations of kernels

    Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    NIPS 2009, Advances in Neural Information Processing Systems, MIT Press

  •  

    Multiple Source Adaptation and the Renyi Divergence

    Yishay Mansour, Mehryar Mohri, Afshin Rostamizadeh

    UAI (2009), pp. 367-374

  •   

    On Sampling-Based Approximate Spectral Decomposition

    Sanjiv Kumar, Mehryar Mohri, Ameet Talkwalkar

    International Conference on Machine Learning (ICML) (2009)

  •  

    Parallel Large Scale Feature Selection for Logistic Regression

    Sameer Singh, Jeremy Kubica, Scott Larsen, Daria Sorokina

    SIAM International Conference on Data Mining (SDM) (2009)

  •   

    Polynomial semantic indexing

    Bing Bai, Jason Weston, David Grangier, Ronan Collobert, Kunihiko Sadamasa, Yanjun Qi, Corinna Cortes, Mehryar Mohri

    Advances in Neural Information Processing Systems (NIPS 2009), MIT Press

  •    

    Posterior vs. Parameter Sparsity in Latent Variable Models

    Joao Graca, Kuzman Ganchev, Ben Taskar, Fernando Pereira

    Advances in Neural Information Processing Systems 22 (2009), pp. 664-672

  •    

    Probabilistic Models for Melodic Prediction

    Jean-Francois Paiement, Samy Bengio, Douglas Eck

    Artificial Intelligence Journal, vol. 173 (2009), pp. 1266-1274

  •    

    Program Representation for General Intelligence

    Moshe Looks, Ben Goertzel

    The Second Conference on Artificial General Intelligence (2009)

  •   

    Quantum Annealing for Clustering

    Kenichi Kurihara, Shu Tanaka, Seiji Miyashita

    Proceedings of the 25th Annual Conference on Uncertainty in Artificial Intelligence, AUAI Press (2009) (to appear)

  •   

    Quantum Annealing for Variational Bayes Inference

    Issei Sato, Kenichi Kurihara, Shu Tanaka, Seiji Miyashita, Hiroshi Nakagawa

    Proceedings of the 25th Annual Conference on Uncertainty in Artificial Intelligence, AUAI Press (2009) (to appear)

  •   

    Rademacher Complexity Bounds for Non-I.I.D. Processes

    Mehryar Mohri, Afshin Rostamizadeh

    Advances in Neural Information Processing Systems (NIPS 2008), MIT Press, Vancouver, Canada (2009)

  •    

    Recursive Sparse Spatiotemporal Coding

    Thomas Dean, Greg Corrado, Richard Washington

    Proceedings of the Fifth IEEE International Workshop on Multimedia Information Processing and Retrieval, IEEE Computer Society (2009)

  •   

    Sampling Techniques for the Nystrom Method

    Sanjiv Kumar, Mehryar Mohri, Ameet Talwalkar

    Artificial Intelligence and Statistics (AISTATS) (2009)

  •    

    Semi-supervised Learning of Dependency Parsers using Generalized Expectation Criteria

    Gregory Druck, Gideon S. Mann, Andrew McCallum

    IJCNLP-ACL (2009)

  •   

    Simple Risk Bounds for Position-Sensitive Max-Margin Ranking Algorithms

    Stefan Riezler, Fabio De Bona

    Proceedings of NIPS'09 Workshop on "Advances in Ranking" (2009)

  •   

    Sleeping Experts and Bandits with Stochastic Action Availability and Adversarial Rewards

    Varun Kanade, H. Brendan McMahan, Brent Bryan

    Proceedings of the 12th International Conference on Artificial Intelligence and Statistic (AISTATS) (2009)

  •  

    Suggesting email view filters for triage and search

    Mark Dredze, Bill N. Schilit, Peter Norvig

    IJCAI'09: Proceedings of the 21st International Joint Conference on Artifical intelligence, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (2009), pp. 1414-1419

  •    

    Symmetric Splitting in the General Theory of Stable Models

    Paolo Ferraris, Joohyung Lee, Vladimir Lifschitz, Ravi Palla

    In proc. Twenty-first International Joint Conference on Artificial Intelligence (IJCAI '09) (2009), pp. 797-803

  •    

    The Difficulty of Training Deep Architectures and the Effect of Unsupervised Pre-Training

    Dumitru Erhan, Pierre-Antoine Manzagol, Yoshua Bengio, Samy Bengio, Pascal Vincent

    Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS), JMLR Workshop and Conference Procedings (2009), pp. 153-160

  •   

    Tighter Bounds for Multi-Armed Bandits with Expert Advice

    H. Brendan McMahan, Matthew Streeter

    Proceedings of the 22nd Annual Conference on Learning Theory (COLT) (2009)

  •   

    Using the Doubling Dimension to Analyze the Generalization of Learning Algorithms

    Nader H. Bshouty, Yi Li, Philip M. Long

    JCSS (2009)

  •  

    YouTube Scale, Large Vocabulary Video Annotation

    Nick Morsillo, Chris Pal, Gideon Mann

    Video Search and Mining (2009)

  •   

    A Bayesian Approach to Empirical Local Linearization for Robotics

    Jo-Anne Ting, Aaron D'Souza, Sethu Vijayakumar, Stefan Schaal

    International Conference on Robotics and Automation (ICRA2008)

  •    

    A Discriminative Kernel-based Approach to Retrieval Images from Text Queries

    David Grangier, Samy Bengio

    IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30 (2008), pp. 1371-1384

  •    

    A Distance Model for Rhythms

    Jean-Francois Paiement, Yves Grandvalet, Samy Bengio, Douglas Eck

    International Conference on Machine Learning (ICML) (2008)

  •    

    A Generative Model for Rhythms

    Jean-Francois Paiement, Samy Bengio, Yves Grandvalet, Doug Eck

    Neural Information Processing Systems, Workshop on Brain, Music and Cognition (2008)

  •   

    A Machine Learning Framework for Spoken-Dialog Classification

    Corinna Cortes, Patrick Haffner, Mehryar Mohri

    Handbook on Speech Processing and Speech Communication, Part E: Speech recognition, Springer-Verlag, Heidelberg, Germany (2008)

  •  

    Actively Learning Level-Sets of Composite Functions

    Brent Bryan, Jeff Schneider

    ICML 2008: International Conference on Machine Learning

  •   

    Adaptive Martingale Boosting

    Philip M. Long, Rocco A. Servedio

    NIPS (2008)

  •   

    An Efficient Reduction of Ranking to Classification

    Nir Ailon, Mehryar Mohri

    Proceedings of The 21st Annual Conference on Learning Theory (COLT 2008), Springer, Heidelberg, Germany, Helsinki, Finland

  •   

    Boosted Bayesian Network Classifier

    Yushi Jing, Vladimir Pavlovic, James M. Rehg

    Machine Learning Journal (2008)

  •    

    Confidence-Weighted Linear Classification

    Mark Dredze, Koby Crammer, Fernando Pereira

    International Conference on Machine Learning (ICML) (2008)

  •    

    Delay Learning and Polychronization for Reservoir Computing

    Hélène Paugam-Moisy, Régis Martinez, Samy Bengio

    Neurocomputing, vol. 71 (2008), pp. 1143-1158

  •   

    Efficient projections onto the l1-ball for learning in high dimensions

    John Duchi, Shai Shalev-Shwartz, Yoram Singer, Tushar Chandra

    ICML '08: Proceedings of the 25th international conference on Machine learning, ACM, New York, NY, USA (2008), pp. 272-279

  •    

    Forecasting Web Page Views: Methods and Observations

    Jia Li, Andrew Moore

    JMLR, vol. 9(Oct) (2008), pp. 2217-2250

  •   

    Generalized Expectation Criteria for Semi-Supervised Learning of Conditional Random Fields

    Gideon Mann, Andrew McCallum

    ACL (2008)

  •   

    Intelligent Email: Reply and Attachment Prediction

    Mark Dredze, Tova Brooks, Josh Carroll, Joshua Magarick, John Blitzer, Fernando Pereira

    Proceedings of the 2008 International Conference on Intelligent User Interfaces

  •   

    Kernel Methods for Learning Languages

    Leonid Kontorovich, Corinna Cortes, Mehryar Mohri

    Theoretical Computer Science, vol. 405 (2008), pp. 223-236

  •    

    Large Scale Content-Based Audio Retrieval from Text Queries

    Gal Chechik, Eugene Ie, Martin Rehn, Samy Bengio, Richard F. Lyon

    ACM International Conference on Multimedia Information Retrieval (MIR), ACM (2008)

  •   

    Learning Bounds for Domain Adaptation

    John Blitzer, Koby Crammer, Alex Kulesza, Fernando Pereira, Jennifer Wortman

    Advances in Neural Information Processing Systems 20, {MIT} Press, Cambridge, MA (2008)

  •   

    Learning Multiple Graphs for Document Recommendations

    Ding Zhou, Shenghuo Zhu, Kai Yu, Xiaodan Song, Belle L. Tseng, Hongyuan Zha, C. Lee Giles

    Proc. 17th International Conference on World Wide Web, ACM, Beijing (2008), pp. 141-150

  •   

    Learning sequence kernels

    Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

    Proceedings of IEEE International Workshop on Machine Learning for Signal Processing (2008)

  •    

    Learning to hash: forgiving hash functions and applications Learning to hash: forgiving hash functions and applications

    Shumeet Baluja, Michele Covell

    Data Mining and Knowledge Discovery (2008)

  •   

    Learning with weighted transducers

    Corinna Cortes, Mehryar Mohri

    Proceedings of the Seventh International Workshop Finite-State Methods and Natural Language Processing (2008)

  •  

    Online Learning of Complex Prediction Problems Using Simultaneous Projections

    Yonatan Amit, Shai Shalev-Shwartz, Yoram Singer

    J. Mach. Learn. Res., vol. 9 (2008), pp. 1399-1435

  •  

    Robust Submodular Observation Selection

    Andreas Krause, H. Brendan McMahan, Carlos Guestrin, Anupam Gupta

    Journal of Machine Learning Research (JMLR), vol. 9 (2008), pp. 2761-2801

  •   

    Sample Selection Bias Correction Theory

    Corinna Cortes, Mehryar Mohri, Michael Riley, Afshin Rostamizadeh

    Proceedings of The 19th International Conference on Algorithmic Learning Theory (ALT 2008), Springer, Heidelberg, Germany, Budapest, Hungary

  •    

    Sequence Kernels for Predicting Protein Essentiality

    Cyril Allauzen, Mehryar Mohri, Ameet Talwalkar

    Proceedings of ICML 2008

  •   

    Stability Bounds for Non-i.i.d. Processes

    Mehryar Mohri, Afshin Rostamizadeh

    Advances in Neural Information Processing Systems (NIPS 2007), MIT Press, Vancouver, Canada (2008)

  •   

    Stability of Transductive Regression Algorithms

    Corinna Cortes, Mehryar Mohri, Dmitry Pechyony, Ashish Rastogi

    Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML 2008), Helsinki, Finland

  •   

    Structured Learning with Approximate Inference

    Alex Kulesza, Fernando Pereira

    Advances in Neural Information Processing Systems 20, {MIT} Press, Cambridge, MA (2008)

  •    

    Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective

    Liviu Panait, Karl Tuyls, Sean Luke

    Journal of Machine Learning Research (2008)

  •   

    Web Page Language Identification Based on URLs

    Eda Baykan, Monika Henzinger, Ingmar Weber

    34th International Conference on Very Large Data Bases (VLDB), ACM Press, New York (2008), pp. 176-188

  •   

    A General Regression Framework for Learning String-to-String Mappings

    Corinna Cortes, Mehryar Mohri, Jason Weston

    Predicting Structured Data, The MIT Press (2007)

  •    

    A Generative Model for Distance Patterns in Music

    Jean-Francois Paiement, Yves Grandvalet, Samy Bengio, Douglas Eck

    NIPS Workshop on Music, Brain and Cognition (2007)

  •   

    A Primal-Dual Perspective of Online Learning Algorithms

    Shai Shalev-Shwartz, Yoram Singer

    Machine Learning, vol. 69, no. 2-3 (2007), pp. 115-142

  •  

    Automatic outlier detection: A Bayesian approach

    Jo-Anne Ting, Aaron D'Souza, Stefan Schaal

    International Conference on Robotics and Automation (ICRA 2007)

  •    

    Biometric Person Authentication IS A Multiple Classifier Problem

    Samy Bengio, Johnny Mariéthoz

    7th International Workshop on Multiple Classifier Systems (2007)

  •    

    Boosting the area under the ROC curve

    Philip M. Long, Rocco A. Servedio

    NIPS (2007)

  •    

    Discriminative learning can succeed where generative learning fails

    Philip M. Long, Rocco A. Servedio, Hans Ulrich Simon

    Information Processing Letters, vol. 103(4) (2007), pp. 131-135

  •   

    Euclidean Embedding of Co-occurrence Data

    Amir Globerson, Gal Chechik, Fernando Pereira, Naftali Tishby

    Journal of Machine Learning Research, vol. 8 (2007), pp. 2265-2295

  •   

    Improving Embeddings by Flexible Exploitation of Side Information

    Ali Ghodsi, Finnegan Southey, Dana Wilkinson

    Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07) (2007)

  •   

    Inferring Complex Agent Motions from Partial Trajectory Observations

    Finnegan Southey, Wesley Loh, Dana Wilkinson

    Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI-07) (2007)

  •  

    Kernel Methods for Learning Languages

    Leonid Kontorovich, Corinna Cortes, Mehryar Mohri

    Theoretical Computer Science, vol. to appear (2007)

  •   

    Lp Distance and Equivalence of Probabilistic Automata

    Corinna Cortes, Mehryar Mohri, Ashish Rastogi

    International Journal of Foundations of Computer Science, vol. 18 (2007)

  •   

    Learning Forgiving Hash Functions: Algorithms and Large Scale Tests

    Shumeet Baluja, Michele Covell

    IJCAI-07: International Joint Conference on Artificial Intelligence (2007)

  •  

    Learning and Inferring Transportation Routines

    Lin Liao, Don Patterson, Dieter Fox, Henry Kautz

    Artificial Intelligence, vol. 171 (2007), pp. 311-331

  •    

    Learning the Inter-frame Distance for Discriminative Template-based Keyword Detection

    David Grangier, Samy Bengio

    Proceedings of the International Conference Interspeech-Eurospeech (2007)

  •   

    Learning to verify branching time properties

    Abhay Vardhan, Mahesh Viswanathan

    Formal Methods in System Design, vol. 31, no. 1 (2007), pp. 35-61

  •    

    On the Prospects for Building a Working Model of the Visual Cortex

    Thomas Dean, Glenn Carroll, Richard Washington

    Proceedings of AAAI-07, MIT Press, Cambridge, Massachusetts (2007), pp. 1597-1600

  •   

    One-pass boosting

    Zafer Barutcuoglu, Philip M. Long, Rocco A. Servedio

    NIPS (2007)

  •   

    Online learning of multiple tasks with a shared loss

    Ofer Dekel, Philip M. Long, Yoram Singer

    JMLR, vol. 8 (2007), pp. 2233-2264

  •   

    Recursive Attribute Factoring

    David Cohn, Deepak Verma, Karl Pfleger

    Advances in Neural Information Processing Systems 19 (2007)

  •   

    Selecting Observations Against Adversarial Objectives

    Andreas Krause, H. Brendan McMahan, Carlos Guestrin, Anupam Gupta

    Advances in Neural Information Processing Systems (NIPS 2007)

  •    

    Studies in Lower Bounding Probability of Evidence using the Markov Inequality

    Vibhav Gogate, Bozhena Bidyuk, Rina Dechter

    UAI, Morgan Kaufmann (2007)

  •   

    Supervised Learning of Semantic Classes for Image Annotation and Retrieval

    Gustavo Carneiro, Antoni B. Chan, Pedro J. Moreno, Nuno Vasconcelos

    IEEE Transactions on Pattern Analysis and Machine Intelligence (2007), pp. 394-410

  •    

    The Need for Open Source Software in Machine Learning

    Soren Sonnenburg, Mikio L. Braun, Cheng Soon Ong, Samy Bengio, Leon Bottou, Geoff Holmes, Yann LeCun, Klaus-Robert Mueller, Fernando Pereira, Carl-Edward Rasmussen, Gunnar Raetsch, Bernhard Schoelkopf, Alexander Smola, Pascal Vincent, Jason Weston, Robert C. Williamson

    Journal of Machine Learning Research, vol. 8 (2007), pp. 2443-2466

  •    

    The War Against Spam: A report from the front line

    Brad Taylor, Dan Fingal, Douglas Aberdeen

    NIPS 2007 Workshop on Machine Learning in Adversarial Environments for Computer Security

  •   

    Theoretical Advantages of Lenient Learners in Multiagent Systems

    Liviu Panait, Karl Tuyls

    Proceedings of the Sixth International Conference on Autonomous Agents and Multi-agent Systems (AAMAS-07), ACM (2007)

  •  

    Training Conditional Random Fields using Virtual Evidence Boosting

    Lin Liao, Tanzeem Choudhury, Dieter Fox, Henry Kautz

    Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) (2007)

  •   

    A Machine Learning Framework for Spoken-Dialog Classification

    Corinna Cortes, Patrick Haffner, Mehryar Mohri

    Handbook on Speech Processing and Speech Communication, Part E: Speech recognition, Springer-Verlag, Heidelberg, Germany (2007)

  •  

    An Alternative Ranking Problem for Search Engines

    Corinna Cortes, Mehryar Mohri, Ashish Rastogi

    Proceedings of the 6th Workshop on Experimental Algorithms (WEA 2007), Springer-Verlag, Heidelberg, Germany, Rome, Italy, pp. 1-21

  •  

    Learning Languages with Rational Kernels

    Corinna Cortes, Leonid Kontorovich, Mehryar Mohri

    Proceedings of The 20th Annual Conference on Computational Learning Theory (COLT 2007), Springer, Heidelberg, Germany, San Diego, California

  •   

    Magnitude-Preserving Ranking Algorithms

    Corinna Cortes, Mehryar Mohri, Ashish Rastogi

    Proceedings of the Twenty-fourth International Conference on Machine Learning (ICML 2007), Oregon State University, Corvallis, OR

  •   

    On Transductive Regression

    Corinna Cortes, Mehryar Mohri

    Advances in Neural Information Processing Systems (NIPS 2006), MIT Press, Vancouver, Canada (2007)

  •   

    Attribute-efficient learning of linear threshold functions under unconcentrated distributions

    Philip M. Long, Rocco A. Servedio

    NIPS (2006)

  •  

    Bayesian Regression with Input Noise for High-Dimensional Data

    Jo-Anne Ting, Aaron D'Souza, Stefan Schaal

    In Proceedings of the 23rd International Conference on Machine Learning, ACM Press (2006)

  •  

    Building Personal Maps from GPS Data

    Lin Liao, Don Patterson, Dieter Fox, Henry Kautz

    Annals of the New York Academy of Sciences, vol. 1093 (2006), pp. 249-265

  •   

    Clustering graphs by weighted substructure mining

    Koji Tsuda, Taku Kudo

    Proceedings of the 23rd international conference on Machine learning, ACM (2006), pp. 953-960

  •   

    Data Fusion and Multicue Data Matching by Diffusion Maps

    Stéphane Lafon, Yosi Keller, Ronald R. Coifman

    IEEE Trans. Pattern Anal. Mach. Intell., vol. 28 (2006), pp. 1784-1797

  •   

    Dependency trees in sub-linear time and bounded memory

    Dan Pelleg, Andrew W. Moore

    VLDB J., vol. 15 (2006), pp. 250-262

  •   

    Efficient Learning of Label Ranking by Soft Projections onto Polyhedra

    S. Shalev-Shwartz, Y. Singer

    Journal of Machine Learning Research (2006)

  •  

    Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields

    Lin Liao, Dieter Fox, Henry Kautz

    International Journal of Robotics Research, vol. 26 (2006), pp. 119-134

  •    

    Learning Invariant Features Using Inertial Priors

    Thomas Dean

    Annals of Mathematics and Artificial Intelligence, vol. 47 (2006), pp. 223-250

  •   

    Online Learning meets Optimization in the Dual

    S. Shalev-Shwartz, Y. Singer

    Proceedings of the Nineteenth Annual Conference on Computational Learning Theory (2006)

  •   

    Online Multiclass Learning by Interclass Hypothesis Sharing

    Michael Fink, Shai Shalev-Shwartz, Yoram Singer, Shimon Ullman

    Proceedings of the 23rd International Conference on Machine Learning (2006)

  •   

    Online Passive Aggressive Algorithms

    K. Crammer, O. Dekel, J. Keshet, S. Shalev-Shwartz, Y. Singer

    Journal of Machine Learning Research, vol. 7 (2006)

  •  

    PAC Learning Mixtures of Gaussians with No Separation Assumption

    Jon Feldman, Ryan O'Donnell, Rocco A. Servedio

    Proc. 19th Annual Conference on Learning Theory (COLT) (2006)

  •   

    Predicting Electricity Distribution Feeder Failures Using Machine Learning Susceptibility Analysis

    Philip Gross, Albert Boulanger, Marta Arias, David L. Waltz, Philip M. Long, Charles Lawson, Roger Anderson, Matthew Koenig, Mark Mastrocinque, William Fairechio, John A. Johnson, Serena Lee, Frank Doherty, Arthur Kressner

    IAAI (2006)

  •   

    Reasoning about Partially Observed Actions

    Megan Nance, Adam Vogel, Eyal Amir

    AAAI (2006)

  •    

    Scalable Inference in Hierarchical Generative Models

    Thomas Dean

    Proceedings of the Ninth International Symposium on Artificial Intelligence and Mathematics (2006)

  •   

    Learning Linearly Separable Languages

    Leonid Kontorovich, Corinna Cortes, Mehryar Mohri

    Proceedings of The 17th International Conference on Algorithmic Learning Theory (ALT 2006), Springer, Heidelberg, Germany

  •   

    A Computational Model of the Cerebral Cortex

    Thomas Dean

    Proceedings of AAAI-05, MIT Press, Cambridge, Massachusetts (2005), pp. 938-943

  •   

    A General Regression Technique for Learning Transductions

    Corinna Cortes, Mehryar Mohri, Jason Weston

    Proceedings of the Twenty-Second International Conference on Machine Learning (ICML 2005), Bonn, Germany

  •   

    A New Perspective on an Old Perceptron Algorithm

    S. Shalev-Shwartz, Y. Singer

    Proceedings of the Eighteenth Annual Conference on Computational Learning Theory (2005)

  •   

    Data-Driven Online to Batch Conversions

    Ofer Dekel, Yoram Singer

    NIPS (2005)

  •   

    Efficient discriminative learning of Bayesian network classifier

    Yushi Jing, Vladimir Pavlovic, James M. Rehg

    Proc. International Conference on Machine Learning (Best student paper) (2005)

  •  

    Loss Bounds for Online Category Ranking

    K. Crammer, Y. Singer

    Proceedings of the Eighteenth Annual Conference on Computational Learning Theory (2005)

  •   

    Margin-Based Ranking Meets Boosting in the Middle

    Cynthia Rudin, Corinna Cortes, Mehryar Mohri, Robert E. Schapire

    Proc. of the 18th Annual Conference on Computational Learning Theory (COLT 2005), Springer, Heidelberg, Germany, pp. 63-78

  •   

    Online Multiclass Learning with k-Way Limited Feedback and an Application to Utterance Classification

    Hiyan Alshawi

    Machine Learning, vol. 60 (2005)

  •  

    Online Ranking by Projecting

    K. Crammer, Y. Singer

    Neural Computation, vol. 17 (2005)

  •   

    Phoneme Alignment Based on Discriminative Learning

    J. Keshet, S. Shalev-Shwartz, Y. Singer, D. Chazan

    Interspeech (2005)

  •   

    Semi-Supervised Self-Training of Object Detection Models

    Chuck Rosenberg, Martial Hebert, Henry Schneiderman

    WACV/MOTION (2005), pp. 29-36

  •   

    Special Review Issue

    Donald Perlis, Peter Norvig

    Artif. Intell., vol. 169 (2005), pp. 103-212

  •   

    The Forgetron: A Kernel-Based Perceptron on a Fixed Budget

    Ofer Dekel, Shai Shalev-Shwartz, Yoram Singer

    NIPS (2005)

  •   

    A Comparison of Classifiers for Detecting Emotion from Speech

    Izhak Shafran, Mehryar Mohri

    Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania

  •   

    Confidence Intervals for the Area under the ROC Curve

    Corinna Cortes, Mehryar Mohri

    Advances in Neural Information Processing Systems (NIPS 2004), MIT Press, Vancouver, Canada (2005)

  •   

    Margin-Based Ranking Meets Boosting in the Middle

    Cynthia Rudin, Corinna Cortes, Mehryar Mohri, Robert E. Schapire

    Proceedings of The 18th Annual Conference on Computational Learning Theory (COLT 2005), Springer, Heidelberg, Germany, Bertinoro, Italy, pp. 63-78

  •   

    Moment Kernels for Regular Distributions

    Corinna Cortes, Mehryar Mohri

    Machine Learning, vol. 60 (2005), pp. 117-134

  •   

    Multi-Armed Bandit Algorithms and Empirical Evaluation

    Joann\`es Vermorel, Mehryar Mohri

    Proceedings of the 16th European Conference on Machine Learning (ECML 2005), Springer, Heidelberg, Germany, Porto, Portugal

  •   

    Distribution Kernels Based on Moments of Counts

    Corinna Cortes, Mehryar Mohri

    Proceedings of the Twenty-First International Conference on Machine Learning (ICML 2004), Banff, Alberta, Canada

  •   

    Rational Kernels: Theory and Algorithms

    Corinna Cortes, Patrick Haffner, Mehryar Mohri

    Journal of Machine Learning Research (JMLR), vol. 5 (2004), pp. 1035-1062

  •   

    A Retrospective on "Paradigms of AI Programming"

    Peter Norvig

    Vivek (A Quarterly in Artificial Intelligence), vol. 15 (2003)

  •   

    Artificial Intelligence: A Modern Approach

    Stuart Russell, Peter Norvig

    Prentice Hall (2002)

  •   

    Intelligent Help Systems for UNIX

    Stephen J. Hegner, Paul McKevitt, Peter Norvig, Robert Wilensky

    Springer (2001)