Machine Perception

226 Publications

  •   

    Auto-Rectification of User Photos

    Krishnendu Chaudhury (aka Krish Chaudhury), Stephen DiVerdi, Sergey Ioffe

    proceedings of IEEE International Conference on Image Processing, ICIP 2014 (to appear)

  •   

    Co-Segmentation of Textured 3D Shapes with Sparse Annotations

    M. Ersin Yumer, Ameesh Makadia

    Computer Vision and Pattern Recognition (CVPR) (2014)

  •    

    DaMN – Discriminative and Mutually Nearest: Exploiting Pairwise Category Proximity for Video Action Recognition

    Rui Hou, Amir Roshan Zamir, Rahul Sukthankar, Mubarak Shah

    Proceedings of European Conference on Computer Vision (2014)

  •   

    DeepPose: Human Pose Estimation via Deep Neural Networks

    Alexander Toshev, Christian Szegedy

    Computer Vision and Pattern Recognition (2014) (to appear)

  •  

    Large-Scale Object Classification Using Label Relation Graphs

    Jia Deng, Nan Ding, Yangqing Jia, Andrea Frome, Kevin Murphy, Samy Bengio, Yuan Li, Hartmut Neven, Hartwig Adam

    European Conference on Computer Vision (2014) (to appear)

  •    

    Large-scale Video Classification with Convolutional Neural Networks

    Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei

    Proceedings of International Computer Vision and Pattern Recognition (CVPR 2014), IEEE

  •    

    Learning Fine-grained Image Similarity with Deep Ranking

    Jiang Wang, Yang Song, Thomas Leung, Chuck Rosenberg, Jingbin Wang, James Philbin, Bo Chen, Ying Wu

    CVPR'2014, IEEE

  •    

    Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks

    Ian Goodfellow, Yaroslav Bulatov, Julian Ibarz, Sacha Arnoud, Vinay Shet

    ICLR2014, ICLR2014 (to appear)

  •    

    Recognition of Complex Events: Exploiting Temporal Dynamics between Underlying Concepts

    Subhabrata Bhattacharya, Mahdi M. Kalayeh, Rahul Sukthankar, Mubarak Shah

    Proceedings of International Computer Vision and Pattern Recognition (CVPR 2014), IEEE

  •   

    SUPER 4PCS Fast Global Pointcloud Registration via Smart Indexing

    Nicolas Mellado, Dror Aiger, Niloy Mitra

    Eurographics Symposium on Geometry Processing 2014

  •  

    Scalable Object Detection using Deep Neural Networks

    Dumitru Erhan, Christian Szegedy, Alexander Toshev, Dragomir Anguelov

    Computer Vision and Pattern Recognition (2014) (to appear)

  •  

    Sinusoidal Interpolation Across Missing Data

    W. Bastiaan Kleijn, Turaj Zakizadeh Shabestary, Jan Skoglund

    International Workshop on Acoustic Signal Enhancement 2014 (IWAENC 2014), pp. 71-75

  •    

    Temporal Synchronization of Multiple Audio Signals

    Julius Kammerl, Neil Birkbeck, Sasi Inguva, Damien Kelly, Andy Crawford, Hugh Denman, Anil Kokaram, Caroline Pantofaru

    Proceedings of the International Conference on Signal Processing (ICASSP), Florence, Italy (2014)

  •    

    Training Highly Multi-class Linear Classifiers

    Maya R. Gupta, Samy Bengio, Jason Weston

    Journal Machine Learning Research (JMLR) (2014), 1461-−1492

  •  

    Unsupervised Discovery of Object Classes with a Mobile Robot

    Julian Mason, Bhaskara Marthi, Ronald Parr

    ICRA 2014

  •    

    Video Object Discovery and Co-segmentation with Extremely Weak Supervision

    Le Wang, Gang Hua, Rahul Sukthankar, Jianru Xue, Nanning Zheng

    Proceedings of European Conference on Computer Vision (2014)

  •    

    Video Quality Assessment for Web Content Mirroring

    Ye He, Kevin Fei, Gus Fernandez, Edward J. Delp

    Imaging and Multimedia Analytics in a Web and Mobile World 2014, IS&T/SPIE Electronic Imaging, San Francisco, California, pp. 9027-11

  •    

    Zero-Shot Learning by Convex Combination of Semantic Embeddings

    Mohammad Norouzi, Tomas Mikolov, Samy Bengio, Yoram Singer, Jonathon Shlens, Andrea Frome, Greg Corrado, Jeffrey Dean

    International Conference on Learning Representations (2014)

  •    

    3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding

    Scott Satkin, Martial Hebert

    Proceedings of the International Conference on Computer Vision (ICCV) (2013) (to appear)

  •    

    A Butterfly Structured Design of The Hybrid Transform Coding Scheme

    Jingning Han, Yaowu Xu, Debargha Mukherjee

    Picture Coding Symposium, IEEE (2013), pp. 1-4 (to appear)

  •   

    A Discriminative Model for Learning Semantic and Geometric Interactions in Indoor Scenes

    Wongun Choi, Yu-Wei Chao, Caroline Pantofaru, Silvio Savarese

    Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Scene Understanding Workshop (SUNw) (2013)

  •   

    Accelerating defocus blur magnification

    Florian Kriener, Thomas Binder, Manuel Wille

    Proceedings SPIE Vol. 8667 (Multimedia Content and Mobile Devices), SPIE (2013)

  •   

    Category-Independent Object-level Saliency Detection

    Yangqing Jia, Mei Han

    International Conference on Computer Vision (2013)

  •    

    DeViSE: A Deep Visual-Semantic Embedding Model

    Andrea Frome, Greg Corrado, Jonathon Shlens, Samy Bengio, Jeffrey Dean, Marc’Aurelio Ranzato, Tomas Mikolov

    Neural Information Processing Systems (NIPS) (2013)

  •  

    Deep Neural Networks for Object Detection

    Christian Szegedy, Alexander Toshev, Dumitru Erhan

    Advances in Neural Information Processing Systems (2013)

  •    

    Design of user interfaces for selective editing of digital photos on touchscreen devices

    Thomas Binder, Meikel Steiding, Manuel Wille, Nils Kokemohr

    Proceedings SPIE 8667 (Multimedia Content and Mobile Devices), SPIE (2013)

  •    

    Discriminative Segment Annotation in Weakly Labeled Video

    Kevin Tang, Rahul Sukthankar, Jay Yagnik, Li Fei-Fei

    Proceedings of International Conference on Computer Vision and Pattern Recognition (CVPR 2013)

  •    

    Fast, Accurate Detection of 100,000 Object Classes on a Single Machine

    Thomas Dean, Mark Ruzon, Mark Segal, Jonathon Shlens, Sudheendra Vijayanarasimhan, Jay Yagnik

    Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, Washington, DC, USA (2013)

  •    

    Fast, Accurate Detection of 100,000 Object Classes on a Single Machine: Technical Supplement

    Thomas Dean, Mark Ruzon, Mark Segal, Jonathon Shlens, Sudheendra Vijayanarasimhan, Jay Yagnik

    Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, Washington, DC, USA (2013)

  •    

    Handling Packet Loss in WebRTC

    Stefan Holmer, Mikhal Shemer, Marco Paniconi

    International Conference on Image Processing (ICIP 2013), IEEE, pp. 1860-1864

  •    

    High-Resolution Global Maps of 21st-Century Forest Cover Change

    Rebecca Moore, Matt Hancher, David Thau

    Science, vol. 342 (2013), pp. 850-853

  •   

    Image Annotation in Presence of Noisy Labels

    Chandrashekhar V., Shailesh Kumar, C. V. Jawahar

    International Conference on Pattern Recognition and Machine Intelligence (2013) (to appear)

  •  

    Image Compression via Colorization Using Semi-Regular Color Samples

    Chenguang Zhang, Hui Fang

    Data Compression Conference (2013)

  •   

    Joint Noise Level Estimation from Personal Photo Collections

    YiChang Shih, Vivek Kwatra, Troy Chinen, Hui Fang, Sergey Ioffe

    ICCV 2013 (to appear)

  •    

    Learning Binary Codes for High Dimensional Data Using Bilinear Projections

    Yunchao Gong, Sanjiv Kumar, Henry Rowley, Svetlana Lazebnik

    IEEE Computer Vision and Pattern Recognition (2013)

  •    

    Learning Multiple Non-Linear Sub-Spaces using K-RBMs

    Siddhartha Chandra, Shailesh Kumar, C. V. Jawahar

    Computer Vision and Pattern Recognition (2013)

  •    

    Learning Part-based Templates from Large Collections of 3D Shapes

    Vladimir Kim, Wilmot Li, Niloy Mitra, Siddhartha Chaudhuri, Stephen DiVerdi, Thomas Funkhouser

    ACM Transactions on Graphics (TOG) - SIGGRAPH 2013 Conference Proceedings, vol. 32, no. 4 (2013), 70:1-70:12

  •    

    Learning Query-Specific Distance Functions for Large-Scale Web Image Search

    Yushi Jing, Michele Covell, David Tsai, James M. Rehg

    IEEE Transactions on Multimedia, vol. 15 (2013), pp. 2022-2034

  •    

    Random Grids: Fast Approximate Nearest Neighbors and Range Searching for Image Search

    Dror Aiger, Efi Kokiopoulou, Ehud Rivlin

    ICCV 2013

  •   

    Rate-Distortion Optimization for Multichannel Audio Compression

    Minyue Li, Jan Skoglund, W. Bastiaan Kleijn

    2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

  •    

    RealBrush: Painting with Examples of Physical Media

    Jingwan Lu, Connelly Barnes, Stephen DiVerdi, Adam Finkelstein

    ACM Transactions on Graphics (TOG) - SIGGRAPH 2013 Conference Proceedings, vol. 32, no. 4 (2013), 117:1-117:12

  •    

    Rendering Fur in Life of Pi

    Ivan Neulander, Toshi Kato, Kevin Beason

    ACM, New York, NY, USA

  •   

    Reporting Neighbors in High-Dimensional Euclidean Space

    Dror Aiger, Haim Kaplan, Micha Sharir

    SODA (2013)

  •    

    Spatiotemporal Deformable Part Models for Action Detection

    Yicong Tian, Rahul Sukthankar, Mubarak Shah

    Proceedings of International Conference on Computer Vision and Pattern Recognition (CVPR 2013)

  •    

    Street View Motion-from-Structure-from-Motion

    Bryan Klingner, David Martin, James Roseborough

    Proceedings of the International Conference on Computer Vision, IEEE (2013)

  •    

    The Intervalgram: An Audio Feature for Large-Scale Cover-Song Recognition

    Thomas C. Walters, David A. Ross, Richard F. Lyon

    From Sounds to Music and Emotions: 9th International Symposium, CMMR 2012, London, UK, June 19-22, 2012, Revised Selected Papers, Springer Berlin Heidelberg (2013), pp. 197-213

  •    

    Tracking Large-Scale Video Remix in Real-World Events

    Lexing Xie, Apostol Natsev, Xuming He, John R. Kender, Matthew L. Hill, John R. Smith

    IEEE Transactions on Multimedia, vol. 15, no. 6 (2013), pp. 1244-1254

  •    

    Understanding Indoor Scenes using 3D Geometric Phrases

    Wongun Choi, Yu-Wei Chao, Caroline Pantofaru, Silvio Savarese

    Proceedings of International Conference on Computer Vision and Pattern Recognition (CVPR 2013)

  •    

    Using Web Co-occurrence Statistics for Improving Image Categorization

    Samy Bengio, Jeffrey Dean, Dumitru Erhan, Eugene Ie, Quoc Le, Andrew Rabinovich, Jonathon Shlens, Yoram Singer

    arXiv (2013)

  •    

    A QCQP Approach to Triangulation

    Chris Aholt, Rekha Thomas, Sameer Agarwal

    European Conference on Computer Vision, Springer Verlag (2012)

  •    

    All Smiles : Automatic Photo Enhancement by Facial Expression Analysis

    Rajvi Shah, Vivek Kwatra

    Conference for Visual Media Production (CVMP 2012) [Best Paper]

  •   

    Apparel silhouette attributes recognition

    Wei Zhang, Emilio Antunez, Salih Gokturk, Baris Sumengen

    Proceedings of the 2012 IEEE Workshop on the Applications of Computer Vision, IEEE Computer Society, Washington, DC, USA, pp. 489-496

  •    

    Automatically Discovering Talented Musicians with Acoustic Analysis of YouTube Videos

    Eric Nichols, Charles DuHadway, Hrishikesh Aradhye, Richard F. Lyon

    Proceedings of the 2012 IEEE 12th International Conference on Data Mining (ICDM), IEEE Computer Society, Washington, DC, USA, pp. 559-565

  •    

    Building high-level features using large scale unsupervised learning

    Quoc Le, Marc'Aurelio Ranzato, Rajat Monga, Matthieu Devin, Kai Chen, Greg Corrado, Jeff Dean, Andrew Ng

    International Conference in Machine Learning (2012)

  •    

    Calibration-Free Rolling Shutter Removal

    Matthias Grundmann, Vivek Kwatra, Daniel Castro, Irfan Essa

    International Conference on Computational Photography [Best Paper], IEEE (2012)

  •    

    Capturing Indoor Scenes with Smartphones

    Aditya Sankar, Steve Seitz

    Proc. UIST, 651 N. 34th St. (2012) (to appear)

  •  

    Coherent image selection using a fast approximation to the generalized traveling salesman problem

    Meng Wang, Prakash Ishwar, Janusz Konrad, Cenk Gazen, Rohit Saboo

    Proceedings of the 20th ACM international conference on Multimedia, ACM, New York, NY, USA (2012), pp. 981-984

  •    

    D-Nets: Beyond Patch-Based Image Descriptors

    Felix von Hundelshausen, Rahul Sukthankar

    IEEE International Conference on Computer Vision and Pattern Recognition (CVPR'12) (2012)

  •    

    Efficient Closed-Form Solution to Generalized Boundary Detection

    Marius Leordeanu, Rahul Sukthankar, Crisitian Sminchisescu

    Proceedings of European Conference on Computer Vision (ECCV'12) (2012)

  •  

    Efficient model based single and double thresholding for real time recognition

    Dror Aiger, Silvio Guimarães

    ACCV Workshop on Detection and Tracking in Challenging Environments (2012)

  •    

    Embedded Voxel Colouring with Adaptive Threshold Selection Using Globally Minimal Surfaces

    Carlos Leung, Ben Appleton, Mitchell Buckley, Changming Sun

    IJCV, vol. 99 (2012), pp. 215-231

  •    

    General and Nested Wiberg Minimization

    Dennis Strelow

    Computer Vision and Pattern Recognition, IEEE (2012)

  •    

    General and nested Wiberg minimization: L2 and maximum likelihood

    Dennis Strelow

    European Conference on Computer Vision, Springer (2012)

  •   

    IMPROVED PREDICTION OF NEARLY-PERIODIC SIGNALS

    Bastiaan Kleijn, Jan Skoglund

    International Workshop on Acoustic Signal Enhancement 2012 (IWAENC2012)

  •    

    Improving Book OCR by Adaptive Language and Image Models

    Dar-Shyang Lee, Ray Smith

    Proceedings of 2012 10th IAPR International Workshop on Document Analysis Systems, IEEE, pp. 115-119

  •    

    Joint Image and Word Sense Discrimination For Image Retrieval

    Aurelien Lucchi, Jason Weston

    ECCV (2012)

  •    

    Learning Hierarchical Bag of Words Using Naive Bayes Clustering

    Siddhartha Chandra, Shailesh Kumar, C. V. Jawahar

    Asian Conference on Computer Vision (2012), pp. 382-395

  •    

    MEASURING NOISE CORRELATION FOR IMPROVED VIDEO DENOISING

    Anil Kokaram, Damien Kelly, Hugh Denman, Andrew Crawford

    IEEE International Conference on Image Processing, IEEE, 1600 Amphitheatre Parkway (2012)

  •    

    Mobile Music Modeling, Analysis and Recognition

    Pavel Golik, Boulos Harb, Ananya Misra, Michael Riley, Alex Rudnick, Eugene Weinstein

    International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2012)

  •    

    Model Recommendation for Action Recognition

    Pyry Matikainen, Rahul Sukthankar, Martial Hebert

    IEEE International Conference on Computer Vision and Pattern Recognition (CVPR'12) (2012)

  •  

    Modelling the Distortion Produced by Cochlear Compression

    Roy D. Patterson, Timothy Ives, Thomas C. Walters, Richard F. Lyon

    16th International Symposium on Hearing (2012)

  •  

    Molli: Interactive Visualization for Exploratory Protein Analysis

    Sara L. Su, Connor Gramazio, Megan Strait, Caitlin Crumm, Daniela Extrum-Fernandez, Matt Menke, Lenore Cowen

    IEEE Computer Graphics & Applications, vol. 32 (2012), pp. 62-69

  •    

    Multi-component Models for Object Detection

    Chunhui Gu, Pablo Arbelaez, Yuanqing Lin, Kai Yu, Jitendra Malik

    European Conference on Computer Vision, Springer (2012), Volume 4, 445-458

  •   

    Multimedia Semantics: Interactions Between Content and Community

    Hari Sundaram, Lexing Xie, Munmun De Choudhury, Yu-Ru Lin, Apostol Natsev

    Proceedings of the IEEE, vol. 100, no. 9 (2012)

  •    

    On Using Nearly-Independent Feature Families for High Precision and Confidence

    Omid Madani, Manfred Georg, David Ross

    Fourth Asian Machine Learning Conference, JMLR workshop and conference proceedings (2012), pp. 269-284

  •   

    Photo Tours

    Avanish Kushal, Ben Self, Yasutaka Furukawa, David Gallup, Carlos Hernandez, Brian Curless, Steve Seitz

    3DimPVT 2012 (to appear)

  •    

    Real-Time Human Pose Tracking from Range Data

    Varun Ganapathi, Christian Plagemann, Daphne Koller, Sebastian Thrun

    Proceedings of the European Conference on Computer Vision (ECCV) (2012)

  •    

    Reconstructing the World's Museums

    Jianxiong Xiao, Yasutaka Furukawa

    European Conference on Computer Vision (2012) (to appear)

  •    

    Refractive Height Fields from Single and Multiple Images

    Qi Shan, Sameer Agarwal, Brian Curless

    IEEE Conference on Computer Vision and Pattern Recognition, IEEE (2012)

  •   

    Repetition Maximization based Texture Rectification

    Dror Aiger, Niloy Mitra, Daniel Cohen-Or

    EUROGRAPHICS 2012

  •   

    Scene Aligned Pooling for Complex Video Recognition

    Liangliang Cao, Yadong Mu, Apostol Natsev, Shih-Fu Chang, Gang Hua, John R. Smith

    ECCV (2012), pp. 688-701

  •    

    Schematic Surface Reconstruction

    Changchang Wu, Sameer Agarwal, Brian Curless, Steven M. Seitz

    IEEE Conference on Computer Vision and Pattern Recognition, IEEE (2012)

  •    

    Semantic Segmentation Using Regions and Parts

    Pablo Arbelaez, Bharath Hariharan, Chunhui Gu, Saurabh Gupta, Lubomir Bourdev, Jitendra Malik

    Computer Vision and Pattern Recognition, IEEE Computer Society Washington, DC, USA (2012), pp. 3378-3385

  •   

    Semi-Supervised Hashing for Large Scale Search

    Jun Wang, Sanjiv Kumar, Shih-Fu Chang

    IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) (2012)

  •    

    Shadow Removal for Aerial Imagery by Information Theoretic Intrinsic Image Analysis

    Vivek Kwatra, Mei Han, Shengyang Dai

    International Conference on Computational Photography, IEEE (2012)

  •    

    Size Matters: Exhaustive Geometric Verification for Image Retrieval

    Henrik Stewenius, Steinar H. Gunderson, Julien Pilet

    12th European Conference on Computer Vision (ECCV), Springer (2012), pp. 674-687

  •   

    Street view goes indoors: Automatic pose estimation from uncalibrated unordered spherical panoramas

    Mohamed Aly, Jean-Yves Bouguet

    Proceedings of the 2012 IEEE Workshop on the Applications of Computer Vision, IEEE Computer Society, Washington, DC, USA, pp. 1-8

  •    

    Unsupervised Learning for Graph Matching

    Marius Leordeanu, Rahul Sukthankar, Martial Hebert

    International Journal of Computer Vision, vol. 96 (2012), pp. 28-45

  •   

    VISQOL: THE VIRTUAL SPEECH QUALITY OBJECTIVE LISTENER

    Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte

    International Workshop on Acoustic Signal Enhancement 2012 (IWAENC2012)

  •    

    Video Description Length Guided Constant Quality Video Coding with Bitrate Constraint

    Lei Yang, Debargha Mukherjee, Dapeng Wu

    Multimedia and Expo Workshops (ICMEW), 2012 IEEE International Conference on, IEEE, 2001 L Street, NW. Suite 700 Washington, DC 20036-4910 USA, pp. 366-371

  •    

    Visibility Based Preconditioning for Bundle Adjustment

    Avanish Kushal, Sameer Agarwal

    IEEE Conference on Computer Vision and Pattern Recognition, IEEE (2012)

  •    

    Weakly Supervised Learning of Object Segmentations from Web-Scale Video

    Glenn Hartmann, Matthias Grundmann, Judy Hoffman, David Tsai, Vivek Kwatra, Omid Madani, Sudheendra Vijayanarasimhan, Irfan Essa, James Rehg, Rahul Sukthankar

    ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I, Springer-Verlag, Berlin, Heidelberg (2012), pp. 198-208

  •    

    A Hierarchical Conditional Random Field Model for Labeling and Images of Street Scenes

    Qixing Huang, Mei Han, Bo Wu, Sergey Ioffe

    International Conference on Computer Vision and Pattern Recognition (2011)

  •    

    A Pole-Zero Filter Cascade Provides Good Fits to Human Masking Data and to Basilar Membrane and Neural Data

    Richard F. Lyon

    Mechanics of Hearing (2011)

  •   

    Aesthetics and Emotions in Images

    Dhiraj Joshi, Ritendra Datta, Elena Fedorovskaya, Quang-Tuan Luong, James Z. Wang, Jia Li, Jiebo Luo

    IEEE Signal Processing Magazine, vol. vol. 28, no. 5 (2011), pp. 94-115

  •    

    Auditory Sparse Coding

    Steven R. Ness, Thomas Walters, Richard F. Lyon

    Music Data Mining, CRC Press/Chapman Hall (2011)

  •    

    Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths

    Matthias Grundmann, Vivek Kwatra, Irfan Essa

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2011)

  •   

    Automatic Language Identification in Music Videos with Low Level Audio and Visual Features

    Vijay Chandrasekhar, Mehmet Emre Sargin, David A. Ross

    Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2011)

  •  

    Boosting Video Classification Using Cross-Video Signals

    Mehmet Emre Sargin, Hrishikesh Aradhye

    Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2011) (to appear)

  •   

    Building Rome in a day

    Sameer Agarwal, Yasutaka Furukawa, Noah Snavely, Ian Simon, Brian Curless, Steven M. Seitz, Rick Szeliski

    Communications of the ACM, vol. 54 (2011), pp. 105-112

  •    

    Cascades of two-pole–two-zero asymmetric resonators are good models of peripheral auditory function

    Richard F. Lyon

    Journal of the Acoustical Society of America, vol. 130 (2011), pp. 3893-3904

  •    

    Crowdsourcing Event Detection in YouTube Videos

    Thomas Steiner, Ruben Verborgh, Rik Van de Walle, Michael Hausenblas, Joaquim Gabarro

    Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011), Bonn, Germany

  •    

    Discrete Point Based Signatures and Applications to Document Matching

    Nemanja Spasojevic, Guillaume Poncin, Dan Bloomberg

    ICIAP 2011

  •    

    Discriminative Tag Learning on YouTube Videos with Latent Sub-tags

    Weilong Yang, George Toderici

    Computer Vision and Pattern Recognition, IEEE (2011)

  •    

    Dynamic Stylized Shading Primitives

    David Vanderhaeghe, Romain Vergne, Pascal Barla, William Baxter

    Proc. Symposium on NonPhotorealistic Animation and Rendering (NPAR 2011), ACM

  •    

    Exploring Photobios

    Ira Kemelmacher-Shlizerman, Eli Shechtman, Rahul Garg, Steven Seitz

    ACM Trans. on Graphics (Proc. SIGGRAPH), vol. 30(4) (2011) (to appear)

  •    

    Feature Seeding for Action Recognition

    Pyry Matikainen, Rahul Sukthankar, Martial Hebert

    International Conference on Computer Vision (ICCV) (2011)

  •    

    Geometric Overpass Extraction from Vector Road Data and DSMs

    Joshua Schpok

    Proceedings of the 19th ACM SIGSPATIAL international Conference on Advances in Geographic information Systems, 2011 (to appear)

  •   

    Handling Label Noise in Video Classification via Multiple Instance Learning

    Thomas Leung, Yang Song, John Zhang

    ICCV'2011, IEEE

  •   

    Image Saliency: From Local to Global Context

    Meng Wang, Janusz Konrad, Prakash Ishwar, Yushi Jing, Henry Rowley

    Proc. Conference on Computer Vision and Pattern Recognition (CVPR) (2011)

  •   

    Improving Video Classification via YouTube Video Co-Watch Data

    John Zhang, Yang Song, Thomas Leung

    ACM Workshop on Social and Behavioural Networked Media Access at ACM MM 2011, ACM

  •    

    Kernelized Structural SVM Learning for Supervised Object Segmentation

    Luca Bertelli, Tianli Yu, Diem Vu, Burak Gokturk

    Proceedings of IEEE Conference on Computer Vision and Pattern Recognition 2011

  •   

    Large-Scale Image Annotation using Visual Synset

    David Tsai, Yushi Jing, Henry Rowley, Yi Liu, Sergey Ioffe, James Rehg

    Proc. International Conference on Computer Vision (ICCV) (2011)

  •    

    Limits on the Application of Frequency-based Language Models to OCR

    Ray Smith

    ICDAR, IEEE (2011), pp. 538-542

  •    

    Multicore Bundle Adjustment

    Changchang Wu, Sameer Agarwal, Brian Curless, Steven Seitz

    Proc. IEEE Conf. on Computer Vision and Pattern Recognition (2011), pp. 3057-3064

  •    

    Privacy protection and face recognition

    Andrew Senior, Sharat Pankanti

    Handbook of Face recognition, Springer, 236 Gray's Inn Road | Floor 6 London | WC1X 8HL | UK (2011), pp. 671-692

  •    

    Reading Digits in Natural Images with Unsupervised Feature Learning

    Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, Andrew Y. Ng

    NIPS Workshop on Deep Learning and Unsupervised Feature Learning 2011

  •    

    Sparse coding of auditory features for machine hearing in interference

    Richard F. Lyon, Gal Chechik, Jay Ponte

    Proc. ICASSP, IEEE (2011)

  •    

    Summary of Opus listening test results

    Christian Hoene, Jean-Marc Valin, Koen Vos, Jan Skoglund

    IETF, IETF (2011)

  •    

    Survey and Evaluation of Audio Fingerprinting Schemes for Mobile Query-By-Example Applications

    Vijay Chandrasekhar, Matt Sharifi, David Ross

    12th International Society for Music Information Retrieval Conference (ISMIR) (2011)

  •   

    Technical Overview of VP8, an open source video codec for the web

    Jim Bankoski, Paul Wilkins, Yaowu Xu

    2011 International Workshop on Acoustics and Video Coding and Communication, IEEE, Barcelona, Spain (to appear)

  •    

    The Power of Comparative Reasoning

    Jay Yagnik, Dennis Strelow, David Ross, Ruei-Sung Lin

    International Conference on Computer Vision, IEEE (2011)

  •    

    Using a Cascade of Asymmetric Resonators with Fast-Acting Compression as a Cochlear Model for Machine-Hearing Applications

    Richard F. Lyon

    Autumn Meeting of the Acoustical Society of Japan (2011), pp. 509-512

  •    

    Visual and Semantic Similarity in ImageNet

    Thomas Deselaers, Vittorio Ferrari

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2011), pp. 1777-1784

  •    

    Where's Waldo: Matching People in Images of Crowds

    Rahul Garg, Deva Ramanan, Steven M. Seitz, Noah Snavely

    Proc. IEEE Conf. on Computer Vision and Pattern Recognition (2011), pp. 1793-1800

  •   

    YouTubeEvent: On Large-Scale Video Event Classification

    Bingbing Ni, Yang Song, Ming Zhao

    The 3rd International Workshop on Video Event Categorization, Tagging and Retrieval for Real-World Applications at IEEE ICCV'2011

  •   

    A Large-Scale Taxonomic Classification System for Web-based Videos

    Yang Song, Ming Zhao, Reto Strobl, John Zhang, Jay Yagnik

    the 11th European Conference on Computer Vision (ECCV 2010)

  •   

    Baselines for Image Annotation

    Ameesh Makadia, Vladimir Pavlovic, Sanjiv Kumar

    International Journal on Computer Vision (IJCV) (2010)

  •    

    Beyond “Near-Duplicates”: Learning Hash Codes for Efficient Similar-Image Retrieval

    Shumeet Baluja, Michele Covell

    20th International Conference on Pattern Recognition 2010

  •    

    Comparison of Clustering Approaches for Summarizing Large Populations of Images

    Yushi Jing, Michele Covell, Henry A. Rowley

    Proceedings ICME VCIDS, IEEE, Singapore (2010)

  •    

    Discontinuous Seam-Carving for Video Retargeting

    Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa

    Computer Vision and Pattern Recognition (CVPR 2010)

  •   

    Document Image Analysis (Chapter 18)

    Dan Bloomberg, Luc Vincent

    Mathematical morphology: theory and applications, ISTE-Wiley (2010), pp. 425-438

  •    

    Efficient Hierarchical Graph-Based Video Segmentation

    Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa

    Computer Vision and Pattern Recognition (CVPR 2010)

  •    

    Example-based Image Compression

    Jing-Yu Cui, Saurabh Mathur, Michele Covell, Vivek Kwatra, Mei Han

    International Conference on Image Processing (ICIP 2010)

  •    

    Fast Covariance Computation and Dimensionality Reduction for Sub-Window Features in Images

    Vivek Kwatra, Mei Han

    European Conference on Computer Vision (ECCV 2010)

  •   

    Feature Tracking for Wide-Baseline Image Retrieval

    Ameesh Makadia

    European Conference on Computer Vision (ECCV) (2010)

  •    

    Google Street View: Capturing the World at Street Level

    Dragomir Anguelov, Carole Dulong, Daniel Filip, Christian Frueh, Stéphane Lafon, Richard Lyon, Abhijit Ogale, Luc Vincent, Josh Weaver

    Computer, vol. 43 (2010)

  •    

    History and Future of Auditory Filter Models

    Richard F. Lyon, Andreas G. Katsiamis, Emmanuel M. Drakakis

    Proc. ISCAS, IEEE (2010), pp. 3809-3812

  •    

    Improved Consistent Sampling, Weighted Minhash and L1 Sketching

    Sergey Ioffe

    ICDM (2010) (to appear)

  •   

    Looking for Pieces of Needles in Millions of Haystacks: Finding Distorted Audio/Video Snippets

    Michele Covell, Shumeet Baluja

    International Workshop on Computer Vision (2010)

  •    

    Machine Hearing: An Emerging Field

    Richard F. Lyon

    IEEE Signal Processing Magazine, vol. 27 (2010), pp. 131-139

  •    

    SemWebVid - Making Video a First Class Semantic Web Citizen and a First Class Web Bourgeois - Semantic Web Challenge

    Thomas Steiner, Michael Hausenblas

    9th International Semantic Web Conference (ISWC 2010)

  •   

    Semi-Supervised Hashing for Scalable Image Retrieval

    Jun Wang, Sanjiv Kumar, Shih-Fu Chang

    IEEE Conf on Computer Vision and Pattern Recognition (CVPR) (2010)

  •    

    Sound Retrieval and Ranking Using Sparse Auditory Representations

    Richard F Lyon, Martin Rehn, Samy Bengio, Thomas C. Walters, Gal Chechik

    Neural Computation, vol. 22 (2010), pp. 2390-2416

  •    

    Table Detection in Heterogeneous Documents

    Faisal Shafait, Ray Smith

    Document Analysis Systems 2010, ACM International Conference Proceedings series

  •   

    Taxonomic Classification for Web-based Videos

    Yang Song, Ming Zhao, Jay Yagnik, Xiaoyun Wu

    IEEE Conf on Computer Vision and Pattern Recognition (CVPR), IEEE (2010)

  •   

    YouTubeCat: Learning to Categorize Wild Web Videos

    Zheshen Wang, Ming Zhao, Yang Song, Sanjiv Kumar, Baoxin Li

    IEEE Conf on Computer Vision and Pattern Recognition (CVPR) (2010)

  •    

    A Biomimetic, 4.5 µW, 120+dB, Log-domain Cochlea Channel with AGC

    Andreas G. Katsiamis, Emmanuel M. Drakakis, Richard F. Lyon

    IEEE JSSC (Journal of Solid-State Circuits), vol. 44 (2009), pp. 1006-1022

  •    

    Adapting the Tesseract Open Source OCR Engine for Multilingual OCR

    Ray Smith, Daria Antonova, Dar-Shyang Lee

    MOCR '09: Proceedings of the International Workshop on Multilingual OCR (2009)

  •   

    Adaptive, selective, automatic tonal enhancement of faces

    Hrishikesh Aradhye, George D. Toderici, Jay Yagnik

    ACM Multimedia, ACM, New York, NY, USA (2009), pp. 677-680

  •   

    Audiovisual Celebrity Recognition in Unconstrained Web Videos

    Mehmet Emre Sargin, Hrishikesh Aradhye, Pedro Moreno, Ming Zhao

    Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2009)

  •    

    Automatic, Efficient, Temporally-Coherent Video Enhancement for Large Scale Applications

    George Toderici, Jay Yagnik

    ACM Multimedia, ACM (2009), pp. 609-612

  •    

    Combined Orientation and Script Detection using the Tesseract OCR Engine

    Ranjith Unnikrishnan, Ray Smith

    Workshop on Multilingual OCR (MOCR), Proc. 10th Intl. Conf. on Document Analysis and Recognition (ICDAR), (2009)

  •  

    Computer Vision Interfaces for Interactive Art

    Andrew Senior, Alejandro Jaimes

    Human-Centric Interfaces for Ambient Intelligence, Elsevier (2009)

  •   

    Efficient and Robust Music Identification with Weighted Finite-State Transducers

    Mehryar Mohri, Pedro Moreno, Eugene Weinstein

    IEEE Transactions on Audio, Speech, and Language Processing, vol. to appear (2009)

  •   

    Flight patterns

    Aaron Koblin

    SIGGRAPH ASIA '09: ACM SIGGRAPH ASIA 2009 Art Gallery & Emerging Technologies: Adaptation, ACM, New York, NY, USA, pp. 29-29

  •   

    Google Newspaper Search – Image Processing and Analysis Pipeline

    Krishnendu Chaudhury, Ankur Jain, Sriram Thirthala, Vivek Sahasranaman, Shobhit Saxena, Selvam Mahalingam

    10th International Conference on Document Analysis and Recognition, ICDAR 2009, pp. 621-625

  •    

    Hybrid Page Layout Analysis via Tab-Stop Detection

    Ray Smith

    Proceedings of the 10th international conference on document analysis and recognition, IEEE (2009)

  •    

    Image Reconstruction in the Gigavision Camera

    Feng Yang, Luciano Sbaiz, Edoardo Charbon, Sabine Susstrunk, Martin Vetterli

    ICCV workshop OMNIVIS 2009

  •    

    LSH Banding for Large-Scale Retrieval with Memory and Recall Constraints

    Michele Covell, Shumeet Baluja

    International Conference on Acoustics, Speech, and Signal Processing, IEEE (2009)

  •    

    Large-scale Privacy Protection in Google Street View

    Andrea Frome, German Cheung, Ahmad Abdulkader, Marco Zennaro, Bo Wu, Alessandro Bissacco, Hartwig Adam, Hartmut Neven, Luc Vincent

    IEEE International Conference on Computer Vision (2009)

  •    

    Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment

    Ahmad Abdulkader, Matthew R. Casey

    Proceedings of the 10th international conference on document analysis and recognition, IEEE (2009)

  •   

    Media on the web, in post-production and broadcasting: the practitioner day of the ACM 2009 International Conference on Image and Video Retrieval

    S\'{e}bastien Marcel, Roelof van Zwol, Ricardo Baeza-Yates, Oliver Heckmann, Jan Erik Solem, Johan Oomen, Hans van Gageldonk, Jean-Pierre Gehrig, Xavier Vives, Baris Sumengen

    CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval, ACM, New York, NY, USA (2009), pp. 1-5

  •   

    Models for patch-based image restoration

    Mithun Das Gupta, Shyamsundar Rajaram, Nemanja Petrovic, Thomas S. Huang

    J. Image Video Process., vol. 2009 (2009), pp. 1-12

  •    

    Predictive Models for Music

    Jean-Francois Paiement, Yves Grandvalet, Samy Bengio

    Connection Science, vol. 21 (2009), pp. 253-272

  •   

    Privacy Protection in Video Surveillance

    Andrew W. Senior

    Springer (2009)

  •   

    SD-VBS: The San Diego Vision Benchmark Suite

    Sravanthi Kota Venkata, Ikkjin Ahn, Donghwan Jeon, Anshuman Gupta, Christopher Louie, Saturnino Garcia, Serge Belongie, Michael Bedford Taylor

    IEEE Workload Characterization Symposium, vol. 0 (2009), pp. 55-64

  •   

    Shape-based Object Recognition in Videos Using 3D Synthetic Object Models

    Alexander Toshev, Ameesh Makadia, Kostas Daniilidis

    Computer Vision and Pattern Recognition (2009)

  •   

    Softcuts: A Soft Edge Smoothness Prior for Color Image Super Resolution

    Shengyang Dai, Mei Han, Wei Xu, Ying Wu, Yihong Gong, Aggelos K. Katsaggelos

    IEEE Transactions on Image Processing (T-IP), vol. 18 (2009), pp. 969-981

  •    

    Sound Ranking Using Auditory Sparse-Code Representations

    Martin Rehn, Richard F. Lyon, Samy Bengio, Thomas C. Walters, Gal Chechik

    ICML 2009 Workshop on Sparse Method for Music Audio

  •   

    State of the Art in Example-based Texture Synthesis

    Li-Yi Wei, Sylvain Lefebvre, Vivek Kwatra, Greg Turk

    Eurographics 2009, State of the Art Report, EG-STAR, Eurographics Association

  •   

    Tour the World: building a web-scale landmark recognition engine

    Yantao Zheng, Ming Zhao, Yang Song, Hartwig Adam, Ulrich Buddemeier, Alessandro Bissacco, Fernando Brucher, Tat-Seng Chua, Hartmut Neven

    International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)

  •   

    Tree detection from aerial imagery

    Lin Yang, Xiaqing Wu, Emil Praun, Xiaoxu Ma

    Proceedings of the 17th ACM SIGSPATIAL international Conference on Advances in Geographic information Systems, Seattle, Washington (2009)

  •   

    Visualizing Web Images via Google Image Swirl

    Yushi Jing, Henry A. Rowley, Chuck Rosenberg, Jingbin Wang, Michele Covell

    NIPS Workshop on Statistical Machine Learning for Visual Analytics (2009)

  •   

    A New Baseline For Image Annotation

    Ameesh Makadia, Vladimir Pavlovic, Sanjiv Kumar

    European Conference on Computer Vision (ECCV) (2008)

  •   

    Beyond Sliding Windows: Object Localization by Efficient Subwindow Search

    Christoph H. Lampert, Matthew B. Blaschko, Thomas Hofmann

    IEEE Computer Vision and Pattern Recognition (CVPR), Anchorage, AK (2008)

  •   

    Coordinated Multi-Device Presentations: Ambient-Audio Identification

    Michael Fink, Michele Covell, Shumeet Baluja

    Encyclopedia of Wireless and Mobile Communications, Taylor & Francis (2008), pp. 274-285

  •    

    Estimating the Spectral Reflectance of Natural Imagery Using Color Image Features

    Josh Hyman, Mark Hansen, Eric Graham, Deborah Estrin

    Workshop on Applications, Systems, and Algorithms for Image Sensing (2008)

  •    

    Face Tracking and Recognition with Visual Constraints in Real-World Videos

    Minyoung Kim, Sanjiv Kumar, Vladimir Pavlovic, Henry A. Rowley

    IEEE Computer Vision and Pattern Recognition (CVPR) (2008)

  •   

    Fluid in Video: Augmenting Real Video with Simulated Fluids

    Vivek Kwatra, Philippos Mordohai, Rahul Narain, Sashi Kumar Penta, Mark Carlson, Marc Pollefeys, Ming C. Lin

    Comput. Graph. Forum (Proc. Eurographics), vol. 27 (2008), pp. 487-496

  •  

    Large Scale Learning and Recognition of Faces in Web Videos

    Ming Zhao, Jay Yagnik, Hartwig Adam, David Bau

    FG2008

  •    

    Large-Scale Manifold Learning

    Ameet Talwalkar, Sanjiv Kumar, Henry A. Rowley

    Computer Vision and Pattern Recognition (CVPR) (2008)

  •   

    Linear Time Maximally Stable Extremal Regions

    David Nist{\'e}r, Henrik Stewénius

    Proc. 10th Europ. Conf. Comput. Vision (2008), pp. 183-196

  •    

    Markovian Mixture Face Recognition with discriminative face alignment

    Ming Zhao

    automatic face and gesture recognition, ieee (2008)

  •   

    Mass Personalization: Social and Interactive Applications using Sound-Track Identification

    Michael Fink, Michele Covell, Shumeet Baluja

    Journal of Multimedia Tools and Applications, vol. 36 (2008), pp. 115-132

  •   

    PageRank for Product Image Search

    Yushi Jing, Shumeet Baluja

    WWW-2008

  •   

    Permutation Grouping: Intelligent Hash Function Design for Audio & Image Retrieval

    Shumeet Baluja, Michele Covell, Sergey Ioffe

    International Conference on Acoustics, Speech and Signal Processing (ICASSP-2008)

  •    

    Reducing Photon Mapping Bandwidth by Query Reordering

    Joshua Steinhurst, Greg Coombe, Anselmo Lastra

    IEEE Transactions on Visualization and Computer Graphics, vol. 14 (2008)

  •   

    Solving the label resolution problem in supervised video content classification

    Ullas Gargi, Jay Yagnik

    MIR '08: Proceeding of the 1st ACM international conference on Multimedia information retrieval, ACM, New York, NY, USA (2008), pp. 276-282

  •  

    Stereo Matching with Color-weighted Correlation, Hierarchical Belief Propagation and Occlusion Handling

    Qingxiong Yang, Liang Wang, Ruigang Yang, Henrik Stewénius, David Nistér

    IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) (2008)

  •  

    Visual Synset: Towards a Higher-level Visual Representation

    Yantao Zheng, Ming Zhao, Shi-Yong Neo, Tat-Seng Chua, Qi Tian

    CVPR (2008)

  •    

    VisualRank: Applying PageRank to Large-Scale Image Search

    Yushi Jing, Shumeet Baluja

    IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30 (2008), pp. 1877-1890

  •   

    Waveprint: Efficient Wavelet-Based Audio Fingerprinting

    Shumeet Baluja, Michele Covell

    Pattern Recognition (2008)

  •   

    Web-scale Image Annotation

    Jiakai Liu, Rong Hu, Meihong Wang, Yi Wang, Edward Chang

    Pacific-Rim Conference on Multimedia (2008) (to appear)

  •    

    An Overview of the Tesseract OCR Engine

    Ray Smith

    Proc. Ninth Int. Conference on Document Analysis and Recognition (ICDAR), IEEE Computer Society (2007), pp. 629-633

  •   

    Audio Fingerprinting: Combining Computer Vision & Data Stream Processing

    Shumeet Baluja, Michele Covell

    Proceedings of the 2007 International Conference on Acoustics, Speech, and Signal Processing

  •   

    Automated Image Orientation Detection: A Scalable Boosting Approach

    Shumeet Baluja

    Pattern Analysis and Applications (2007)

  •   

    Automatic Alignment of Large-scale Aerial Rasters to Road-maps

    James Xiaqing Wu, Rodrigo Carceroni, Hui Fang, Steve Zelinka, Andrew Kirmse

    ACM GIS 2007, ACM

  •   

    Boosting Sex Identification Performance

    Shumeet Baluja, Henry A. Rowley

    International Journal of Computer Vision, vol. 71 (2007), pp. 111-119

  •   

    Canonical Image Selection from the Web

    Yushi Jing, Shumeet Baluja, Henry A. Rowley

    ACM International Conference on Image and Video Retrieval (2007)

  •   

    Classification of Weakly-Labeled Data with Partial Equivalence Relations

    Sanjiv Kumar, Henry A. Rowley

    International Conference on Computer Vision (ICCV) (2007)

  •   

    Detail Preserving Shape Deformation in Image Editing

    Hui Fang, John C. Hart

    Proc. SIGGRAPH 2007, ACM, San Diego, no. 12

  •   

    Efficient Complete and Incomplete Path Openings and Closings

    Hugues Talbot, Ben Appleton

    Image and Vision Computing, vol. 25, no. 4 (2007), pp. 416-425

  •   

    GRADE-IV: Visualizing Graphics Library Operations in an Executing Program

    Hidehiko Abe, Takeo Igarashi

    SIGGRAPH 2007 Posters, ACM, no. 118

  •   

    Google Books: Making the public domain universally accessible

    Adam Langley, Dan Bloomberg

    Document Recognition and Retrieval XIV, SPIE (2007), 65000H1-65000H10

  •    

    Imagers as sensors: Correlating plant CO2 uptake with digital visible-light imagery

    Josh Hyman, Eric Graham, Mark Hansen, Deborah Estrin

    Data Management for Sensor Networks (2007)

  •   

    Known-Audio Detection Using Waveprint: Spectrogram Fingerprinting By Wavelet Hashing

    Michele Covell, Shumeet Baluja

    Proceedings of the 2007 International Conference on Acoustics, Speech, and Signal Processing

  •   

    Music Identification with Weighted Finite-State Transducers

    Eugene Weinstein, Pedro J. Moreno

    Proceedings of the International Conference in Acoustics, Speech and Signal Processing (ICASSP) (2007)

  •   

    Ordinal Regression Based Subpixel Shift Estimation for Video Super-Resolution

    Mithun Das Gupta, Shyamsundar Rajaram, Thomas S. Huang, Nemanja Petrovic

    EURASIP Journal on Advances in Signal Processing, vol. 85963 (2007)

  •    

    Practical Gammatone-Like Filters for Auditory Modeling

    Andreas G. Katsiamis, Emmanuel M. Drakakis, Richard F. Lyon

    EURASIP Journal on Audio, Speech, and Music Processing, vol. 2007 (2007), pp. 12

  •   

    Practical MythTV: Building a PVR and Media Center PC

    Michael Still, Stewart Smith

    Apress (2007), pp. 350

  •   

    Raising Global Awareness with Google Earth

    Rebecca Moore

    Imaging Notes, vol. 22, no. 2 (2007), pp. 24-29

  •   

    Robust music identification, detection, and analysis

    M. Mohri, Pedro J. Moreno, Eugene Weinstein

    Proceedings of the International Conference on Music Information Retrieval (ISMIR) (2007)

  •  

    Temporally Consistent Reconstruction from Multiple Video Streams using Enhanced Belief Propagation

    E. Scott Larsen, Philippos Mordohai, Marc Pollefeys, Henry Fuchs

    Eleventh IEEE International Conference on Computer Vision (2007)

  •   

    Advertisement Detection and Replacement using Acoustic and Visual Repetition

    Michele Covell, Shumeet Baluja, Michael Fink

    Proceedings of the 2006 International Workshop on Multimedia Signal Processing, IEEE

  •   

    Content Fingerprinting Using Wavelets

    Shumeet Baluja, Michele Covell

    Proceedings of the Conference of Visual Media Production, IET (2006)

  •   

    Detecting Ads in Video Streams using Acoustic and Visual Cues

    Michele Covell, Shumeet Baluja, Michael Fink

    Computer Magazine (2006), pp. 135-137

  •    

    Globally Minimal Surfaces by Continuous Maximal Flows

    Ben Appleton, Hugues Talbot

    IEEE Trans. Pattern Anal. Mach. Intell., vol. 28 (2006), pp. 106-118

  •   

    Large Scale Image-Based Adult-Content Filtering

    Henry A. Rowley, Yushi Jing, Shumeet Baluja

    1st International Conference on Computer Vision Theory, Sebutal, Portugal (2006)

  •   

    Query by Semantic Example

    Nikhil Rasiwasia, Nuno Vasconcelos, Pedro J. Moreno

    CIVR (2006), pp. 51-60

  •   

    Social- and Interactive-Television Applications Based on Real-Time Ambient-Audio Identification

    Michael Fink, Michele Covell, Shumeet Baluja

    European Interactive TV Conference (Euro-ITV) (2006)

  •   

    Time-Scale Modification for 3G-Telephony Video

    Michele Covell, Sumit Roy, Bo Shen

    Proceedings of the 2006 International Workshop on Multimedia Signal Processing, IEEE

  •  

    Boosting Sex Identification Performance

    Shumeet Baluja, Henry A. Rowley

    Proceedings of the Seventeenth Innovative Applications of Artificial Intelligence Conference, AAAI (2005), pp. 1508-1513

  •   

    Large Scale Performance Measurement of Content-Based Automated Image-Orientation Detection

    Shumeet Baluja, Henry A. Rowley

    International Conference on Image Processing, Genova, Italy (2005)

  •    

    The Definitive Guide to ImageMagick

    Michael Still

    Apress, Apress, Inc. 2560 Ninth St., Ste. 219 Berkeley, CA 94710 (2005), pp. 335

  •   

    Efficient Face Orientation Discrimination

    Shumeet Baluja, Mehran Sahami, Henry A. Rowley

    International Conference on Image Processing (ICIP-2004)