Marc Najork

Marc Najork is a Senior Staff Research Scientist, working in the Strategic Technologies team. Before joining Google, Marc was a Principal Researcher at Microsoft Research (2001-2014) and prior to that a Researcher at the DEC/Compaq Systems Researcher Center (1993-2001). Marc earned a Ph.D. in Computer Science from the University of Illinois. His service activities include Editor-in-Chief of the ACM Transactions on the Web (2011-2014), news board co-chair of the Communications of the ACM (2008-2014), conference chair of WSDM 2008, and program co-chair of WWW 2004.

Google Publications

Previous Publications

  •  

    Debugging a Crowdsourced Task with Low Inter-Rater Agreement

    Omar Alonso, Catherine C. Marshall, Marc Najork

    Joint Conference on Digital Libraries (2015)

  •  

    Social Search

    Marc Najork

    14th International Conference on Web Engineering (2014)

  •  

    A Human-Centered Framework for Ensuring Reliability on Crowdsourced Labeling Tasks

    Omar Alonso, Catherine C Marshall, Marc A Najork

    First AAAI Conference on Human Computation and Crowdsourcing, AAAI (2013)

  •  

    Are Some Tweets More Interesting Than Others?# HardQuestion

    Omar Alonso, Catherine C Marshall, Marc Najork

    7th Annual Symposium on Human-Computer Interaction and Information Retrieval, ACM (2013)

  •   

    Boot-Strapping Language Identifiers for Short Colloquial Postings

    Mois├ęs Goldszmidt, Marc Najork, Stelios Paparizos

    European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD), Springer (2013), pp. 95-111

  •   

    Robust query rewriting using anchor data

    Nick Craswell, Bodo Billerbeck, Dennis Fetterly, Marc Najork

    6th ACM Intl. Conference on Web Search and Data Mining (WSDM), ACM (2013), pp. 335-344

  •   

    Detecting quilted web pages at scale

    Marc Najork

    SIGIR (2012), pp. 385-394

  •  

    Editorial

    Helen Ashman, Arun Iyengar, Marc Najork

    TWEB, vol. 6 (2012), pp. 5

  •  

    How user behavior is related to social affinity

    Rina Panigrahy, Marc Najork, Yinglian Xie

    WSDM (2012), pp. 713-722

  •  

    Of hammers and nails: an empirical comparison of three paradigms for processing large graphs

    Marc Najork, Dennis Fetterly, Alan Halverson, Krishnaram Kenthapadi, Sreenivas Gollapudi

    WSDM (2012), pp. 103-112

  •  

    Microsoft Research at TREC 2011 Web Track

    Bodo Billerbeck, Nick Craswell, Dennis Fetterly, Marc Najork

    TREC (2011)

  •  

    The Power of Peers

    Nick Craswell, Dennis Fetterly, Marc Najork

    ECIR (2011), pp. 497-502

  •  

    A Sketch-Based Distance Oracle for Web-Scale Graphs

    Atish Das Sarma, Sreenivas Gollapudi, Marc Najork, Rina Panigrahy

    Web Search and Data Mining (WSDM) (2010)

  •  

    Microsoft Research at TREC 2010 Web Track

    Nick Craswell, Dennis Fetterly, Marc Najork

    TREC (2010)

  •  

    Querying the Web Graph - (Invited Talk)

    Marc Najork

    SPIRE (2010), pp. 1-12

  •  

    Web Crawling

    Christopher Olston, Marc Najork

    Foundations and Trends in Information Retrieval, vol. 4 (2010), pp. 175-246

  •  

    Less is more: sampling the neighborhood graph makes SALSA better and faster

    Marc Najork, Sreenivas Gollapudi, Rina Panigrahy

    WSDM (2009), pp. 242-251

  •  

    Microsoft Research at TREC 2009: Web and Relevance Feedback Track

    Nick Craswell, Dennis Fetterly, Marc Najork, Stephen Robertson, Emine Yilmaz

    TREC (2009)

  •  

    The scalable hyperlink store

    Marc Najork

    Hypertext (2009), pp. 89-98

  •  

    Web Crawler Architecture

    Marc Najork

    Encyclopedia of Database Systems (2009), pp. 3462-3465

  •  

    Web Search Relevance Ranking

    Hugo Zaragoza, Marc Najork

    Encyclopedia of Database Systems (2009), pp. 3497-3501

  •  

    Web Spam Detection

    Marc Najork

    Encyclopedia of Database Systems (2009), pp. 3520-3523

  •  

    Computing Information Retrieval Performance Measures Efficiently in the Presence of Tied Scores

    Frank McSherry, Marc Najork

    ECIR (2008), pp. 414-421

  •  

    Efficient and effective link analysis with precomputed salsa maps

    Marc Najork, Nick Craswell

    CIKM (2008), pp. 53-62

  •  

    Introduction to special section on adversarial issues in Web search

    Marc Najork, Brian D. Davison 0001

    TWEB, vol. 2 (2008)

  •  

    Comparing the effectiveness of hits and salsa

    Marc Najork

    CIKM (2007), pp. 157-164

  •  

    Hits on the web: how does it compare?

    Marc Najork, Hugo Zaragoza, Michael J. Taylor

    SIGIR (2007), pp. 471-478

  •  

    Using Bloom Filters to Speed Up HITS-Like Ranking Algorithms

    Sreenivas Gollapudi, Marc Najork, Rina Panigrahy

    WAW (2007), pp. 195-201

  •  

    Adversarial information retrieval on the web (AIRWeb 2006)

    Brian D. Davison 0001, Marc Najork, Tim Converse

    SIGIR Forum, vol. 40 (2006), pp. 27-30

  •  

    Detecting spam web pages through content analysis

    Alexandros Ntoulas, Marc Najork, Mark Manasse, Dennis Fetterly

    WWW (2006), pp. 83-92

  •  

    Detecting phrase-level duplication on the world wide web

    Dennis Fetterly, Mark Manasse, Marc Najork

    SIGIR (2005), pp. 170-177

  •   

    How search engines shape the web

    Byron Dom, Krishna Bharat, Andrei Z. Broder, Marc Najork, Jan O. Pedersen, Yoshinobu Tonomura

    WWW (Special interest tracks and posters) (2005), pp. 879

  •  

    A large-scale study of the evolution of Web pages

    Dennis Fetterly, Mark Manasse, Marc Najork, Janet L. Wiener

    Softw., Pract. Exper., vol. 34 (2004), pp. 213-237

  •  

    Boxwood: Abstractions as the Foundation for Storage Infrastructure

    John MacCormick, Nick Murphy, Marc Najork, Chandramohan A. Thekkath, Lidong Zhou

    OSDI (2004), pp. 105-120

  •  

    On The Evolution of Clusters of Near-Duplicate Web Pages

    Dennis Fetterly, Mark Manasse, Marc Najork

    J. Web Eng., vol. 2 (2004), pp. 228-246

  •  

    Spam, Damn Spam, and Statistics: Using Statistical Analysis to Locate Spam Web Pages

    Dennis Fetterly, Mark Manasse, Marc Najork

    WebDB (2004), pp. 1-6

  •  

    A large-scale study of the evolution of web pages

    Dennis Fetterly, Mark Manasse, Marc Najork, Janet L. Wiener

    WWW (2003), pp. 669-678

  •  

    Efficient URL caching for world wide web crawling

    Andrei Z. Broder, Marc Najork, Janet L. Wiener

    WWW (2003), pp. 679-689

  •  

    On the Evolution of Clusters of Near-Duplicate Web Pages

    Dennis Fetterly, Mark Manasse, Marc Najork

    LA-WEB (2003), pp. 37-45

  •  

    Breadth-first crawling yields high-quality pages

    Marc Najork, Janet L. Wiener

    WWW (2001), pp. 114-118

  •  

    Web-based Algorithm Animation

    Marc Najork

    DAC (2001), pp. 506-511

  •  

    Performance limitations of the Java core libraries

    Allan Heydon, Marc Najork

    Concurrency - Practice and Experience, vol. 12 (2000), pp. 363-373

  •  

    Mercator: A Scalable, Extensible Web Crawler

    Allan Heydon, Marc Najork

    World Wide Web, vol. 2 (1999), pp. 219-229

  •  

    Performance Limitations of the Java Core Libraries

    Allan Heydon, Marc Najork

    Java Grande (1999), pp. 35-41

  •  

    A Java-Based Implementation of Collaborative Active Textbooks

    Marc H. Brown, Marc Najork, Roope Raisamo

    VL (1997), pp. 376-383

  •  

    Collaborative Active Textbooks

    Marc H. Brown, Marc Najork

    J. Vis. Lang. Comput., vol. 8 (1997), pp. 453-486

  •  

    Distributed Applets

    Marc H. Brown, Marc Najork

    CHI Extended Abstracts (1997), pp. 204-205

  •  

    Collaborative Active Textbooks: A Web-Based Algorithm Animation System for an Electronic Classroom

    Marc H. Brown, Marc Najork

    VL (1996), pp. 266-275

  •  

    Distributed Active Objects

    Marc H. Brown, Marc Najork

    Computer Networks, vol. 28 (1996), pp. 1037-1052

  •  

    Programming in Three Dimensions

    Marc Najork

    J. Vis. Lang. Comput., vol. 7 (1996), pp. 219-242

  •  

    Obliq-3D: A High-Level, Fast-Turnaround 3D Animation System

    Marc Najork, Marc H. Brown

    IEEE Trans. Vis. Comput. Graph., vol. 1 (1995), pp. 175-193

  •  

    A Library for Visualizing Combinatorial Structures

    Marc Najork, Marc H. Brown

    IEEE Visualization (1994), pp. 164-171

  •  

    Algorithm Animation Using 3D Interactive Graphics

    Marc H. Brown, Marc Najork

    ACM Symposium on User Interface Software and Technology (1993), pp. 93-100

  •  

    Cube: Eine dreidimensionale visuelle Programmiersprache

    Marc Najork, Simon M. Kaplan

    GI Jahrestagung (1993), pp. 340-345

  •  

    Specifying Visual Languages with Conditional Set Rewrite Systems

    Marc Najork, Simon M. Kaplan

    VL (1993), pp. 12-18

  •  

    A Prototype Implementation of the Cube Language

    Marc Najork, Simon M. Kaplan

    VL (1992), pp. 270-272

  •  

    The CUBE Language

    Marc Najork, Simon M. Kaplan

    VL (1991), pp. 218-224

  •  

    Enhancing Show-and-Tell with a polymorphic type system and higher-order functions

    Marc Najork, Eric J. Golin

    VL (1990), pp. 215-220

  •  

    Roles and their role in posing recursive queries

    Sharon Kuck, Roland John, Arnd Lewe, Marc Najork

    Inf. Syst., vol. 15 (1990), pp. 173-186