Machine Perception
179 Publications
-
Accelerating defocus blur magnification
Florian Kriener, Thomas Binder, Manuel Wille
Proceedings SPIE Vol. 8667 (Multimedia Content and Mobile Devices), SPIE (2013)
-
Discriminative Segment Annotation in Weakly Labeled Video
Kevin Tang, Rahul Sukthankar, Jay Yagnik, Li Fei-Fei
Proceedings of International Conference on Computer Vision and Pattern Recognition (CVPR 2013)
-
Fast, Accurate Detection of 100,000 Object Classes on a Single Machine
Thomas Dean, Mark Ruzon, Mark Segal, Jon Shlens, Sudheendra Vijayanarasimhan, Jay Yagnik
Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, Washington, DC, USA (2013)
-
Fast, Accurate Detection of 100,000 Object Classes on a Single Machine: Technical Supplement
Thomas Dean, Mark Ruzon, Mark Segal, Jon Shlens, Sudheendra Vijayanarasimhan, Jay Yagnik
Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, Washington, DC, USA (2013)
-
Learning Binary Codes for High Dimensional Data Using Bilinear Projections
Yunchao Gong, Sanjiv Kumar, Henry Rowley, Svetlana Lazebnik
IEEE Computer Vision and Pattern Recognition (2013) (to appear)
-
Reporting Neighbors in High-Dimensional Euclidean Space
Dror Aiger, Haim Kaplan, Micha Sharir
SODA (2013) (to appear)
-
Spatiotemporal Deformable Part Models for Action Detection
Yicong Tian, Rahul Sukthankar, Mubarak Shah
Proceedings of International Conference on Computer Vision and Pattern Recognition (CVPR 2013)
-
A QCQP Approach to Triangulation
Chris Aholt, Rekha Thomas, Sameer Agarwal
European Conference on Computer Vision, Springer Verlag (2012)
-
All Smiles : Automatic Photo Enhancement by Facial Expression Analysis
Rajvi Shah, Vivek Kwatra
Conference for Visual Media Production (CVMP 2012) [Best Paper]
-
Apparel silhouette attributes recognition
Wei Zhang, Emilio Antunez, Salih Gokturk, Baris Sumengen
Proceedings of the 2012 IEEE Workshop on the Applications of Computer Vision, IEEE Computer Society, Washington, DC, USA, pp. 489-496
-
Building high-level features using large scale unsupervised learning
Quoc Le, Marc'Aurelio Ranzato, Rajat Monga, Matthieu Devin, Kai Chen, Greg Corrado, Jeff Dean, Andrew Ng
International Conference in Machine Learning (2012)
-
Calibration-Free Rolling Shutter Removal
Matthias Grundmann, Vivek Kwatra, Daniel Castro, Irfan Essa
International Conference on Computational Photography [Best Paper], IEEE (2012)
-
Capturing Indoor Scenes with Smartphones
Aditya Sankar, Steve Seitz
Proc. UIST, 651 N. 34th St. (2012) (to appear)
-
Coherent image selection using a fast approximation to the generalized traveling salesman problem
Meng Wang, Prakash Ishwar, Janusz Konrad, Cenk Gazen, Rohit Saboo
Proceedings of the 20th ACM international conference on Multimedia, ACM, New York, NY, USA (2012), pp. 981-984
-
D-Nets: Beyond Patch-Based Image Descriptors
Felix von Hundelshausen, Rahul Sukthankar
IEEE International Conference on Computer Vision and Pattern Recognition (CVPR'12) (2012)
-
Efficient Closed-Form Solution to Generalized Boundary Detection
Marius Leordeanu, Rahul Sukthankar, Crisitian Sminchisescu
Proceedings of European Conference on Computer Vision (ECCV'12) (2012)
-
Efficient model based single and double thresholding for real time recognition
Dror Aiger, Silvio Guimarães
ACCV Workshop on Detection and Tracking in Challenging Environments (2012) (to appear)
-
Embedded Voxel Colouring with Adaptive Threshold Selection Using Globally Minimal Surfaces
Carlos Leung, Ben Appleton, Mitchell Buckley, Changming Sun
IJCV, vol. 99 (2012), pp. 215-231
-
General and Nested Wiberg Minimization
Dennis Strelow
Computer Vision and Pattern Recognition, IEEE (2012)
-
General and nested Wiberg minimization: L2 and maximum likelihood
Dennis Strelow
European Conference on Computer Vision, Springer (2012)
-
IMPROVED PREDICTION OF NEARLY-PERIODIC SIGNALS
Bastiaan Kleijn, Jan Skoglund
International Workshop on Acoustic Signal Enhancement 2012 (IWAENC2012)
-
Improving Book OCR by Adaptive Language and Image Models
Dar-Shyang Lee, Ray Smith
Proceedings of 2012 10th IAPR International Workshop on Document Analysis Systems, IEEE, pp. 115-119
-
Joint Image and Word Sense Discrimination For Image Retrieval
Aurelien Lucchi, Jason Weston
ECCV (2012)
-
MEASURING NOISE CORRELATION FOR IMPROVED VIDEO DENOISING
Anil Kokaram, Damien Kelly, Hugh Denman, Andrew Crawford
IEEE International Conference on Image Processing, IEEE, 1600 Amphitheatre Parkway (2012)
-
Mobile Music Modeling, Analysis and Recognition
Pavel Golik, Boulos Harb, Ananya Misra, Michael Riley, Alex Rudnick, Eugene Weinstein
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2012)
-
Model Recommendation for Action Recognition
Pyry Matikainen, Rahul Sukthankar, Martial Hebert
IEEE International Conference on Computer Vision and Pattern Recognition (CVPR'12) (2012)
-
Modelling the Distortion Produced by Cochlear Compression
Roy D. Patterson, Timothy Ives, Thomas C. Walters, Richard F. Lyon
16th International Symposium on Hearing (2012)
-
Molli: Interactive Visualization for Exploratory Protein Analysis
Sara L. Su, Connor Gramazio, Megan Strait, Caitlin Crumm, Daniela Extrum-Fernandez, Matt Menke, Lenore Cowen
IEEE Computer Graphics & Applications, vol. 32 (2012), pp. 62-69
-
Multi-component Models for Object Detection
Chunhui Gu, Pablo Arbelaez, Yuanqing Lin, Kai Yu, Jitendra Malik
European Conference on Computer Vision, Springer (2012), Volume 4, 445-458
-
On Using Nearly-Independent Feature Families for High Precision and Confidence
Omid Madani, Manfred Georg, David Ross
Fourth Asian Machine Learning Conference, JMLR workshop and conference proceedings (2012), pp. 269-284
-
Photo Tours
Avanish Kushal, Ben Self, Yasutaka Furukawa, David Gallup, Carlos Hernandez, Brian Curless, Steve Seitz
3DimPVT 2012 (to appear)
-
Real-Time Human Pose Tracking from Range Data
Varun Ganapathi, Christian Plagemann, Daphne Koller, Sebastian Thrun
Proceedings of the European Conference on Computer Vision (ECCV) (2012) (to appear)
-
Reconstructing the World's Museums
Jianxiong Xiao, Yasutaka Furukawa
European Conference on Computer Vision (2012) (to appear)
-
Refractive Height Fields from Single and Multiple Images
Qi Shan, Sameer Agarwal, Brian Curless
IEEE Conference on Computer Vision and Pattern Recognition, IEEE (2012)
-
Repetition Maximization based Texture Rectification
Dror Aiger, Niloy Mitra, Daniel Cohen-Or
EUROGRAPHICS 2012
-
Schematic Surface Reconstruction
Changchang Wu, Sameer Agarwal, Brian Curless, Steven M. Seitz
IEEE Conference on Computer Vision and Pattern Recognition, IEEE (2012)
-
Semantic Segmentation Using Regions and Parts
Pablo Arbelaez, Bharath Hariharan, Chunhui Gu, Saurabh Gupta, Lubomir Bourdev, Jitendra Malik
Computer Vision and Pattern Recognition, IEEE Computer Society Washington, DC, USA (2012), pp. 3378-3385
-
Semi-Supervised Hashing for Large Scale Search
Jun Wang, Sanjiv Kumar, Shih-Fu Chang
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) (2012)
-
Shadow Removal for Aerial Imagery by Information Theoretic Intrinsic Image Analysis
Vivek Kwatra, Mei Han, Shengyang Dai
International Conference on Computational Photography, IEEE (2012)
-
Street view goes indoors: Automatic pose estimation from uncalibrated unordered spherical panoramas
Mohamed Aly, Jean-Yves Bouguet
Proceedings of the 2012 IEEE Workshop on the Applications of Computer Vision, IEEE Computer Society, Washington, DC, USA, pp. 1-8
-
The intervalgram: An audio feature for large-scale melody recognition
Thomas C. Walters, David Ross, Richard F. Lyon
9th International Symposium on Computer Music Modeling and Retrieval (2012)
-
Unsupervised Learning for Graph Matching
Marius Leordeanu, Rahul Sukthankar, Martial Hebert
International Journal of Computer Vision, vol. 96 (2012), pp. 28-45
-
VISQOL: THE VIRTUAL SPEECH QUALITY OBJECTIVE LISTENER
Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte
International Workshop on Acoustic Signal Enhancement 2012 (IWAENC2012)
-
Video Description Length Guided Constant Quality Video Coding with Bitrate Constraint
Lei Yang, Debargha Mukherjee, Dapeng Wu
Multimedia and Expo Workshops (ICMEW), 2012 IEEE International Conference on, IEEE, 2001 L Street, NW. Suite 700 Washington, DC 20036-4910 USA, pp. 366-371
-
Visibility Based Preconditioning for Bundle Adjustment
Avanish Kushal, Sameer Agarwal
IEEE Conference on Computer Vision and Pattern Recognition, IEEE (2012)
-
Weakly Supervised Learning of Object Segmentations from Web-Scale Video
Glenn Hartmann, Matthias Grundmann, Judy Hoffman, David Tsai, Vivek Kwatra, Omid Madani, Sudheendra Vijayanarasimhan, Irfan Essa, James Rehg, Rahul Sukthankar
ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I, Springer-Verlag, Berlin, Heidelberg (2012), pp. 198-208
-
A Hierarchical Conditional Random Field Model for Labeling and Images of Street Scenes
Qixing Huang, Mei Han, Bo Wu, Sergey Ioffe
International Conference on Computer Vision and Pattern Recognition (2011)
-
A Pole-Zero Filter Cascade Provides Good Fits to Human Masking Data and to Basilar Membrane and Neural Data
Mechanics of Hearing (2011)
-
Aesthetics and Emotions in Images
Dhiraj Joshi, Ritendra Datta, Elena Fedorovskaya, Quang-Tuan Luong, James Z. Wang, Jia Li, Jiebo Luo
IEEE Signal Processing Magazine, vol. vol. 28, no. 5 (2011), pp. 94-115
-
Auditory Sparse Coding
Steven R. Ness, Thomas Walters, Richard F. Lyon
Music Data Mining, CRC Press/Chapman Hall (2011)
-
Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths
Matthias Grundmann, Vivek Kwatra, Irfan Essa
IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2011)
-
Automatic Language Identification in Music Videos with Low Level Audio and Visual Features
Vijay Chandrasekhar, Mehmet Emre Sargin, David A. Ross
Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2011)
-
Boosting Video Classification Using Cross-Video Signals
Mehmet Emre Sargin, Hrishikesh Aradhye
Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2011) (to appear)
-
Building Rome in a day
Sameer Agarwal, Yasutaka Furukawa, Noah Snavely, Ian Simon, Brian Curless, Steven M. Seitz, Rick Szeliski
Communications of the ACM, vol. 54 (2011), pp. 105-112
-
Cascades of two-pole–two-zero asymmetric resonators are good models of peripheral auditory function
Journal of the Acoustical Society of America, vol. 130 (2011), pp. 3893-3904
-
Crowdsourcing Event Detection in YouTube Videos
Thomas Steiner, Ruben Verborgh, Rik Van de Walle, Michael Hausenblas, Joaquim Gabarro
Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011), Bonn, Germany
-
Discrete Point Based Signatures and Applications to Document Matching
Nemanja Spasojevic, Guillaume Poncin, Dan Bloomberg
ICIAP 2011
-
Discriminative Tag Learning on YouTube Videos with Latent Sub-tags
Weilong Yang, George Toderici
Computer Vision and Pattern Recognition, IEEE (2011) (to appear)
-
Dynamic Stylized Shading Primitives
David Vanderhaeghe, Romain Vergne, Pascal Barla, William Baxter
Proc. Symposium on NonPhotorealistic Animation and Rendering (NPAR 2011), ACM
-
Exploring Photobios
Ira Kemelmacher-Shlizerman, Eli Shechtman, Rahul Garg, Steven Seitz
ACM Trans. on Graphics (Proc. SIGGRAPH), vol. 30(4) (2011) (to appear)
-
Feature Seeding for Action Recognition
Pyry Matikainen, Rahul Sukthankar, Martial Hebert
International Conference on Computer Vision (ICCV) (2011)
-
Geometric Overpass Extraction from Vector Road Data and DSMs
Joshua Schpok
Proceedings of the 19th ACM SIGSPATIAL international Conference on Advances in Geographic information Systems, 2011 (to appear)
-
Handling Label Noise in Video Classification via Multiple Instance Learning
Thomas Leung, Yang Song, John Zhang
ICCV'2011, IEEE (to appear)
-
Image Saliency: From Local to Global Context
Meng Wang, Janusz Konrad, Prakash Ishwar, Yushi Jing, Henry Rowley
Proc. Conference on Computer Vision and Pattern Recognition (CVPR) (2011)
-
Improving Video Classification via YouTube Video Co-Watch Data
John Zhang, Yang Song, Thomas Leung
ACM Workshop on Social and Behavioural Networked Media Access at ACM MM 2011, ACM (to appear)
-
Kernelized Structural SVM Learning for Supervised Object Segmentation
Luca Bertelli, Tianli Yu, Diem Vu, Burak Gokturk
Proceedings of IEEE Conference on Computer Vision and Pattern Recognition 2011
-
Large-Scale Image Annotation using Visual Synset
David Tsai, Yushi Jing, Henry Rowley, Yi Liu, Sergey Ioffe, James Rehg
Proc. International Conference on Computer Vision (ICCV) (2011)
-
Limits on the Application of Frequency-based Language Models to OCR
ICDAR, IEEE (2011), pp. 538-542
-
Multicore Bundle Adjustment
Changchang Wu, Sameer Agarwal, Brian Curless, Steven Seitz
Proc. IEEE Conf. on Computer Vision and Pattern Recognition (2011), pp. 3057-3064
-
Privacy protection and face recognition
Andrew Senior, Sharat Pankanti
Handbook of Face recognition, Springer, 236 Gray's Inn Road | Floor 6 London | WC1X 8HL | UK (2011), pp. 671-692
-
Reading Digits in Natural Images with Unsupervised Feature Learning
Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, Andrew Y. Ng
NIPS Workshop on Deep Learning and Unsupervised Feature Learning 2011
-
Sparse coding of auditory features for machine hearing in interference
Richard F. Lyon, Gal Chechik, Jay Ponte
Proc. ICASSP, IEEE (2011)
-
Survey and Evaluation of Audio Fingerprinting Schemes for Mobile Query-By-Example Applications
Vijay Chandrasekhar, Matt Sharifi, David Ross
12th International Society for Music Information Retrieval Conference (ISMIR) (2011)
-
Technical Overview of VP8, an open source video codec for the web
Jim Bankoski, Paul Wilkins, Yaowu Xu
2011 International Workshop on Acoustics and Video Coding and Communication, IEEE, Barcelona, Spain (to appear)
-
The Power of Comparative Reasoning
Jay Yagnik, Dennis Strelow, David Ross, Ruei-Sung Lin
International Conference on Computer Vision, IEEE (2011)
-
Using a Cascade of Asymmetric Resonators with Fast-Acting Compression as a Cochlear Model for Machine-Hearing Applications
Autumn Meeting of the Acoustical Society of Japan (2011), pp. 509-512
-
Visual and Semantic Similarity in ImageNet
Thomas Deselaers, Vittorio Ferrari
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2011), pp. 1777-1784
-
Where's Waldo: Matching People in Images of Crowds
Rahul Garg, Deva Ramanan, Steven M. Seitz, Noah Snavely
Proc. IEEE Conf. on Computer Vision and Pattern Recognition (2011), pp. 1793-1800
-
YouTubeEvent: On Large-Scale Video Event Classification
Bingbing Ni, Yang Song, Ming Zhao
The 3rd International Workshop on Video Event Categorization, Tagging and Retrieval for Real-World Applications at IEEE ICCV'2011 (to appear)
-
A Large-Scale Taxonomic Classification System for Web-based Videos
Yang Song, Ming Zhao, Reto Strobl, John Zhang, Jay Yagnik
the 11th European Conference on Computer Vision (ECCV 2010)
-
Baselines for Image Annotation
Ameesh Makadia, Vladimir Pavlovic, Sanjiv Kumar
International Journal on Computer Vision (IJCV) (2010)
-
Beyond “Near-Duplicates”: Learning Hash Codes for Efficient Similar-Image Retrieval
Shumeet Baluja, Michele Covell
20th International Conference on Pattern Recognition 2010
-
Comparison of Clustering Approaches for Summarizing Large Populations of Images
Yushi Jing, Michele Covell, Henry A. Rowley
Proceedings ICME VCIDS, IEEE, Singapore (2010)
-
Discontinuous Seam-Carving for Video Retargeting
Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa
Computer Vision and Pattern Recognition (CVPR 2010)
-
Document Image Analysis (Chapter 18)
Dan Bloomberg, Luc Vincent
Mathematical morphology: theory and applications, ISTE-Wiley (2010), pp. 425-438
-
Efficient Hierarchical Graph-Based Video Segmentation
Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa
Computer Vision and Pattern Recognition (CVPR 2010)
-
Example-based Image Compression
Jing-Yu Cui, Saurabh Mathur, Michele Covell, Vivek Kwatra, Mei Han
International Conference on Image Processing (ICIP 2010)
-
Fast Covariance Computation and Dimensionality Reduction for Sub-Window Features in Images
European Conference on Computer Vision (ECCV 2010)
-
Google Image Swirl, a large-scale content-based image browsing engine
Yushi Jing, Henry Rowley, Charles Rosenberg, Jingbin Wang, Michele Covell, Liu Yi, Marius Pasca
Demo at IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2010)
-
Google Street View: Capturing the World at Street Level
Dragomir Anguelov, Carole Dulong, Daniel Filip, Christian Frueh, Stéphane Lafon, Richard Lyon, Abhijit Ogale, Luc Vincent, Josh Weaver
Computer, vol. 43 (2010)
-
History and Future of Auditory Filter Models
Richard F. Lyon, Andreas G. Katsiamis, Emmanuel M. Drakakis
Proc. ISCAS, IEEE (2010), pp. 3809-3812
-
Improved Consistent Sampling, Weighted Minhash and L1 Sketching
Sergey Ioffe
ICDM (2010) (to appear)
-
Looking for Pieces of Needles in Millions of Haystacks: Finding Distorted Audio/Video Snippets
Michele Covell, Shumeet Baluja
International Workshop on Computer Vision (2010)
-
Machine Hearing: An Emerging Field
IEEE Signal Processing Magazine, vol. 27 (2010), pp. 131-139
-
SemWebVid - Making Video a First Class Semantic Web Citizen and a First Class Web Bourgeois - Semantic Web Challenge
Thomas Steiner, Michael Hausenblas
9th International Semantic Web Conference (ISWC 2010)
-
Semi-Supervised Hashing for Scalable Image Retrieval
Jun Wang, Sanjiv Kumar, Shih-Fu Chang
IEEE Conf on Computer Vision and Pattern Recognition (CVPR) (2010)
-
Sound Retrieval and Ranking Using Sparse Auditory Representations
Richard F Lyon, Martin Rehn, Samy Bengio, Thomas C. Walters, Gal Chechik
Neural Computation, vol. 22 (2010), pp. 2390-2416
-
Table Detection in Heterogeneous Documents
Faisal Shafait, Ray Smith
Document Analysis Systems 2010, ACM International Conference Proceedings series
-
Taxonomic Classification for Web-based Videos
Yang Song, Ming Zhao, Jay Yagnik, Xiaoyun Wu
IEEE Conf on Computer Vision and Pattern Recognition (CVPR), IEEE (2010)
-
YouTubeCat: Learning to Categorize Wild Web Videos
Zheshen Wang, Ming Zhao, Yang Song, Sanjiv Kumar, Baoxin Li
IEEE Conf on Computer Vision and Pattern Recognition (CVPR) (2010)
-
A Biomimetic, 4.5 µW, 120+dB, Log-domain Cochlea Channel with AGC
Andreas G. Katsiamis, Emmanuel M. Drakakis, Richard F. Lyon
IEEE JSSC (Journal of Solid-State Circuits), vol. 44 (2009), pp. 1006-1022
-
Adapting the Tesseract Open Source OCR Engine for Multilingual OCR
Ray Smith, Daria Antonova, Dar-Shyang Lee
MOCR '09: Proceedings of the International Workshop on Multilingual OCR (2009)
-
Adaptive, selective, automatic tonal enhancement of faces
Hrishikesh Aradhye, George D. Toderici, Jay Yagnik
ACM Multimedia, ACM, New York, NY, USA (2009), pp. 677-680
-
Audiovisual Celebrity Recognition in Unconstrained Web Videos
Mehmet Emre Sargin, Hrishikesh Aradhye, Pedro Moreno, Ming Zhao
Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2009)
-
Automatic, Efficient, Temporally-Coherent Video Enhancement for Large Scale Applications
ACM Multimedia, ACM (2009), pp. 609-612
-
Combined Orientation and Script Detection using the Tesseract OCR Engine
Ranjith Unnikrishnan, Ray Smith
Workshop on Multilingual OCR (MOCR), Proc. 10th Intl. Conf. on Document Analysis and Recognition (ICDAR), (2009)
-
Computer Vision Interfaces for Interactive Art
Andrew Senior, Alejandro Jaimes
Human-Centric Interfaces for Ambient Intelligence, Elsevier (2009)
-
Efficient and Robust Music Identification with Weighted Finite-State Transducers
Mehryar Mohri, Pedro Moreno, Eugene Weinstein
IEEE Transactions on Audio, Speech, and Language Processing, vol. to appear (2009)
-
Flight patterns
Aaron Koblin
SIGGRAPH ASIA '09: ACM SIGGRAPH ASIA 2009 Art Gallery & Emerging Technologies: Adaptation, ACM, New York, NY, USA, pp. 29-29
-
Google Newspaper Search – Image Processing and Analysis Pipeline
Krishnendu Chaudhury, Ankur Jain, Sriram Thirthala, Vivek Sahasranaman, Shobhit Saxena, Selvam Mahalingam
10th International Conference on Document Analysis and Recognition, ICDAR 2009, pp. 621-625
-
Hybrid Page Layout Analysis via Tab-Stop Detection
Proceedings of the 10th international conference on document analysis and recognition, IEEE (2009)
-
Image Reconstruction in the Gigavision Camera
Feng Yang, Luciano Sbaiz, Edoardo Charbon, Sabine Susstrunk, Martin Vetterli
ICCV workshop OMNIVIS 2009
-
LSH Banding for Large-Scale Retrieval with Memory and Recall Constraints
Michele Covell, Shumeet Baluja
International Conference on Acoustics, Speech, and Signal Processing, IEEE (2009)
-
Large-scale Privacy Protection in Google Street View
Andrea Frome, German Cheung, Ahmad Abdulkader, Marco Zennaro, Bo Wu, Alessandro Bissacco, Hartwig Adam, Hartmut Neven, Luc Vincent
IEEE International Conference on Computer Vision (2009)
-
Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment
Ahmad Abdulkader, Matthew R. Casey
Proceedings of the 10th international conference on document analysis and recognition, IEEE (2009)
-
Media on the web, in post-production and broadcasting: the practitioner day of the ACM 2009 International Conference on Image and Video Retrieval
S\'{e}bastien Marcel, Roelof van Zwol, Ricardo Baeza-Yates, Oliver Heckmann, Jan Erik Solem, Johan Oomen, Hans van Gageldonk, Jean-Pierre Gehrig, Xavier Vives, Baris Sumengen
CIVR '09: Proceeding of the ACM International Conference on Image and Video Retrieval, ACM, New York, NY, USA (2009), pp. 1-5
-
Models for patch-based image restoration
Mithun Das Gupta, Shyamsundar Rajaram, Nemanja Petrovic, Thomas S. Huang
J. Image Video Process., vol. 2009 (2009), pp. 1-12
-
Predictive Models for Music
Jean-Francois Paiement, Yves Grandvalet, Samy Bengio
Connection Science, vol. 21 (2009), pp. 253-272
-
Privacy Protection in Video Surveillance
Springer (2009)
-
SD-VBS: The San Diego Vision Benchmark Suite
Sravanthi Kota Venkata, Ikkjin Ahn, Donghwan Jeon, Anshuman Gupta, Christopher Louie, Saturnino Garcia, Serge Belongie, Michael Bedford Taylor
IEEE Workload Characterization Symposium, vol. 0 (2009), pp. 55-64
-
Shape-based Object Recognition in Videos Using 3D Synthetic Object Models
Alexander Toshev, Ameesh Makadia, Kostas Daniilidis
Computer Vision and Pattern Recognition (2009)
-
Softcuts: A Soft Edge Smoothness Prior for Color Image Super Resolution
Shengyang Dai, Mei Han, Wei Xu, Ying Wu, Yihong Gong, Aggelos K. Katsaggelos
IEEE Transactions on Image Processing (T-IP), vol. 18 (2009), pp. 969-981
-
Sound Ranking Using Auditory Sparse-Code Representations
Martin Rehn, Richard F. Lyon, Samy Bengio, Thomas C. Walters, Gal Chechik
ICML 2009 Workshop on Sparse Method for Music Audio
-
State of the Art in Example-based Texture Synthesis
Li-Yi Wei, Sylvain Lefebvre, Vivek Kwatra, Greg Turk
Eurographics 2009, State of the Art Report, EG-STAR, Eurographics Association
-
Tour the World: building a web-scale landmark recognition engine
Yantao Zheng, Ming Zhao, Yang Song, Hartwig Adam, Ulrich Buddemeier, Alessandro Bissacco, Fernando Brucher, Tat-Seng Chua, Hartmut Neven
International Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
-
Tree detection from aerial imagery
Lin Yang, Xiaqing Wu, Emil Praun, Xiaoxu Ma
Proceedings of the 17th ACM SIGSPATIAL international Conference on Advances in Geographic information Systems, Seattle, Washington (2009)
-
Visualizing Web Images via Google Image Swirl
Yushi Jing, Henry A. Rowley, Chuck Rosenberg, Jingbin Wang, Michele Covell
NIPS Workshop on Statistical Machine Learning for Visual Analytics (2009)
-
A New Baseline For Image Annotation
Ameesh Makadia, Vladimir Pavlovic, Sanjiv Kumar
European Conference on Computer Vision (ECCV) (2008)
-
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search
Christoph H. Lampert, Matthew B. Blaschko, Thomas Hofmann
IEEE Computer Vision and Pattern Recognition (CVPR), Anchorage, AK (2008)
-
Coordinated Multi-Device Presentations: Ambient-Audio Identification
Michael Fink, Michele Covell, Shumeet Baluja
Encyclopedia of Wireless and Mobile Communications, Taylor & Francis (2008), pp. 274-285
-
Estimating the Spectral Reflectance of Natural Imagery Using Color Image Features
Josh Hyman, Mark Hansen, Eric Graham, Deborah Estrin
Workshop on Applications, Systems, and Algorithms for Image Sensing (2008)
-
Face Tracking and Recognition with Visual Constraints in Real-World Videos
Minyoung Kim, Sanjiv Kumar, Vladimir Pavlovic, Henry A. Rowley
IEEE Computer Vision and Pattern Recognition (CVPR) (2008)
-
Fluid in Video: Augmenting Real Video with Simulated Fluids
Vivek Kwatra, Philippos Mordohai, Rahul Narain, Sashi Kumar Penta, Mark Carlson, Marc Pollefeys, Ming C. Lin
Comput. Graph. Forum (Proc. Eurographics), vol. 27 (2008), pp. 487-496
-
Large Scale Learning and Recognition of Faces in Web Videos
Ming Zhao, Jay Yagnik, Hartwig Adam, David Bau
FG2008
-
Large-Scale Manifold Learning
Ameet Talwalkar, Sanjiv Kumar, Henry A. Rowley
Computer Vision and Pattern Recognition (CVPR) (2008)
-
Linear Time Maximally Stable Extremal Regions
David Nist{\'e}r, Henrik Stewénius
Proc. 10th Europ. Conf. Comput. Vision (2008), pp. 183-196
-
Markovian Mixture Face Recognition with discriminative face alignment
Ming Zhao
automatic face and gesture recognition, ieee (2008)
-
Mass Personalization: Social and Interactive Applications using Sound-Track Identification
Michael Fink, Michele Covell, Shumeet Baluja
Journal of Multimedia Tools and Applications, vol. 36 (2008), pp. 115-132
-
PageRank for Product Image Search
Yushi Jing, Shumeet Baluja
WWW-2008
-
Permutation Grouping: Intelligent Hash Function Design for Audio & Image Retrieval
Shumeet Baluja, Michele Covell, Sergey Ioffe
International Conference on Acoustics, Speech and Signal Processing (ICASSP-2008)
-
Reducing Photon Mapping Bandwidth by Query Reordering
Joshua Steinhurst, Greg Coombe, Anselmo Lastra
IEEE Transactions on Visualization and Computer Graphics, vol. 14 (2008)
-
Solving the label resolution problem in supervised video content classification
MIR '08: Proceeding of the 1st ACM international conference on Multimedia information retrieval, ACM, New York, NY, USA (2008), pp. 276-282
-
Stereo Matching with Color-weighted Correlation, Hierarchical Belief Propagation and Occlusion Handling
Qingxiong Yang, Liang Wang, Ruigang Yang, Henrik Stewénius, David Nistér
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) (2008)
-
Visual Synset: Towards a Higher-level Visual Representation
Yantao Zheng, Ming Zhao, Shi-Yong Neo, Tat-Seng Chua, Qi Tian
CVPR (2008)
-
VisualRank: Applying PageRank to Large-Scale Image Search
Yushi Jing, Shumeet Baluja
IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30 (2008), pp. 1877-1890
-
Waveprint: Efficient Wavelet-Based Audio Fingerprinting
Shumeet Baluja, Michele Covell
Pattern Recognition (2008)
-
Web-scale Image Annotation
Jiakai Liu, Rong Hu, Meihong Wang, Yi Wang, Edward Chang
Pacific-Rim Conference on Multimedia (2008) (to appear)
-
An Overview of the Tesseract OCR Engine
Proc. Ninth Int. Conference on Document Analysis and Recognition (ICDAR), IEEE Computer Society (2007), pp. 629-633
-
Audio Fingerprinting: Combining Computer Vision & Data Stream Processing
Shumeet Baluja, Michele Covell
Proceedings of the 2007 International Conference on Acoustics, Speech, and Signal Processing
-
Automated Image Orientation Detection: A Scalable Boosting Approach
Pattern Analysis and Applications (2007)
-
Automatic Alignment of Large-scale Aerial Rasters to Road-maps
James Xiaqing Wu, Rodrigo Carceroni, Hui Fang, Steve Zelinka, Andrew Kirmse
ACM GIS 2007, ACM
-
Boosting Sex Identification Performance
Shumeet Baluja, Henry A. Rowley
International Journal of Computer Vision, vol. 71 (2007), pp. 111-119
-
Canonical Image Selection from the Web
Yushi Jing, Shumeet Baluja, Henry A. Rowley
ACM International Conference on Image and Video Retrieval (2007)
-
Classification of Weakly-Labeled Data with Partial Equivalence Relations
International Conference on Computer Vision (ICCV) (2007)
-
Detail Preserving Shape Deformation in Image Editing
Hui Fang, John C. Hart
Proc. SIGGRAPH 2007, ACM, San Diego, no. 12
-
Efficient Complete and Incomplete Path Openings and Closings
Hugues Talbot, Ben Appleton
Image and Vision Computing, vol. 25, no. 4 (2007), pp. 416-425
-
GRADE-IV: Visualizing Graphics Library Operations in an Executing Program
Hidehiko Abe, Takeo Igarashi
SIGGRAPH 2007 Posters, ACM, no. 118
-
Google Books: Making the public domain universally accessible
Adam Langley, Dan Bloomberg
Document Recognition and Retrieval XIV, SPIE (2007), 65000H1-65000H10
-
Imagers as sensors: Correlating plant CO2 uptake with digital visible-light imagery
Josh Hyman, Eric Graham, Mark Hansen, Deborah Estrin
Data Management for Sensor Networks (2007)
-
Known-Audio Detection Using Waveprint: Spectrogram Fingerprinting By Wavelet Hashing
Michele Covell, Shumeet Baluja
Proceedings of the 2007 International Conference on Acoustics, Speech, and Signal Processing
-
Music Identification with Weighted Finite-State Transducers
Eugene Weinstein, Pedro J. Moreno
Proceedings of the International Conference in Acoustics, Speech and Signal Processing (ICASSP) (2007)
-
Ordinal Regression Based Subpixel Shift Estimation for Video Super-Resolution
Mithun Das Gupta, Shyamsundar Rajaram, Thomas S. Huang, Nemanja Petrovic
EURASIP Journal on Advances in Signal Processing, vol. 85963 (2007)
-
Practical Gammatone-Like Filters for Auditory Modeling
Andreas G. Katsiamis, Emmanuel M. Drakakis, Richard F. Lyon
EURASIP Journal on Audio, Speech, and Music Processing, vol. 2007 (2007), pp. 12
-
Practical MythTV: Building a PVR and Media Center PC
Michael Still, Stewart Smith
Apress (2007), pp. 350
-
Raising Global Awareness with Google Earth
Imaging Notes, vol. 22, no. 2 (2007), pp. 24-29
-
Robust music identification, detection, and analysis
M. Mohri, Pedro J. Moreno, Eugene Weinstein
Proceedings of the International Conference on Music Information Retrieval (ISMIR) (2007)
-
Temporally Consistent Reconstruction from Multiple Video Streams using Enhanced Belief Propagation
E. Scott Larsen, Philippos Mordohai, Marc Pollefeys, Henry Fuchs
Eleventh IEEE International Conference on Computer Vision (2007)
-
Advertisement Detection and Replacement using Acoustic and Visual Repetition
Michele Covell, Shumeet Baluja, Michael Fink
Proceedings of the 2006 International Workshop on Multimedia Signal Processing, IEEE
-
Content Fingerprinting Using Wavelets
Shumeet Baluja, Michele Covell
Proceedings of the Conference of Visual Media Production, IET (2006)
-
Detecting Ads in Video Streams using Acoustic and Visual Cues
Michele Covell, Shumeet Baluja, Michael Fink
Computer Magazine (2006), pp. 135-137
-
Globally Minimal Surfaces by Continuous Maximal Flows
Ben Appleton, Hugues Talbot
IEEE Trans. Pattern Anal. Mach. Intell., vol. 28 (2006), pp. 106-118
-
Large Scale Image-Based Adult-Content Filtering
Henry A. Rowley, Yushi Jing, Shumeet Baluja
1st International Conference on Computer Vision Theory, Sebutal, Portugal (2006)
-
Query by Semantic Example
Nikhil Rasiwasia, Nuno Vasconcelos, Pedro J. Moreno
CIVR (2006), pp. 51-60
-
Social- and Interactive-Television Applications Based on Real-Time Ambient-Audio Identification
Michael Fink, Michele Covell, Shumeet Baluja
European Interactive TV Conference (Euro-ITV) (2006)
-
Time-Scale Modification for 3G-Telephony Video
Michele Covell, Sumit Roy, Bo Shen
Proceedings of the 2006 International Workshop on Multimedia Signal Processing, IEEE
-
Boosting Sex Identification Performance
Shumeet Baluja, Henry A. Rowley
Proceedings of the Seventeenth Innovative Applications of Artificial Intelligence Conference, AAAI (2005), pp. 1508-1513
-
Large Scale Performance Measurement of Content-Based Automated Image-Orientation Detection
Shumeet Baluja, Henry A. Rowley
International Conference on Image Processing, Genova, Italy (2005)
-
The Definitive Guide to ImageMagick
Michael Still
Apress, Apress, Inc. 2560 Ninth St., Ste. 219 Berkeley, CA 94710 (2005), pp. 335
-
Efficient Face Orientation Discrimination
Shumeet Baluja, Mehran Sahami, Henry A. Rowley
International Conference on Image Processing (ICIP-2004)
