Marc Najork

Marc Najork is a Senior Staff Research Scientist, working in the Strategic Technologies team. Before joining Google, Marc was a Principal Researcher at Microsoft Research (2001-2014) and prior to that a Researcher at the DEC/Compaq Systems Researcher Center (1993-2001). Marc earned a Ph.D. in Computer Science from the University of Illinois. His service activities include Editor-in-Chief of the ACM Transactions on the Web (2011-2014), news board co-chair of the Communications of the ACM (2008-2014), conference chair of WSDM 2008, and program co-chair of WWW 2004.

Google Publications

Previous Publications

  •  

    Debugging a Crowdsourced Task with Low Inter-Rater Agreement

    Omar Alonso, Catherine C. Marshall, Marc Najork

    Joint Conference on Digital Libraries (2015)

  •  

    Social Search

    Marc Najork

    14th International Conference on Web Engineering (2014)

  •  

    A Human-Centered Framework for Ensuring Reliability on Crowdsourced Labeling Tasks

    Omar Alonso, Catherine C Marshall, Marc A Najork

    First AAAI Conference on Human Computation and Crowdsourcing, AAAI (2013)

  •  

    Are Some Tweets More Interesting Than Others?# HardQuestion

    Omar Alonso, Catherine C Marshall, Marc Najork

    7th Annual Symposium on Human-Computer Interaction and Information Retrieval, ACM (2013)

  •   

    Boot-Strapping Language Identifiers for Short Colloquial Postings

    Moisés Goldszmidt, Marc Najork, Stelios Paparizos

    European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD), Springer (2013), pp. 95-111

  •   

    Robust query rewriting using anchor data

    Nick Craswell, Bodo Billerbeck, Dennis Fetterly, Marc Najork

    6th ACM Intl. Conference on Web Search and Data Mining (WSDM), ACM (2013), pp. 335-344

  •   

    Detecting quilted web pages at scale

    Marc Najork

    SIGIR (2012), pp. 385-394

  •  

    Editorial

    Helen Ashman, Arun Iyengar, Marc Najork

    TWEB, vol. 6 (2012), pp. 5

  •  

    How user behavior is related to social affinity

    Rina Panigrahy, Marc Najork, Yinglian Xie

    WSDM (2012), pp. 713-722

  •  

    Of hammers and nails: an empirical comparison of three paradigms for processing large graphs

    Marc Najork, Dennis Fetterly, Alan Halverson, Krishnaram Kenthapadi, Sreenivas Gollapudi

    WSDM (2012), pp. 103-112

  •  

    Microsoft Research at TREC 2011 Web Track

    Bodo Billerbeck, Nick Craswell, Dennis Fetterly, Marc Najork

    TREC (2011)

  •  

    The Power of Peers

    Nick Craswell, Dennis Fetterly, Marc Najork

    ECIR (2011), pp. 497-502

  •  

    A Sketch-Based Distance Oracle for Web-Scale Graphs

    Atish Das Sarma, Sreenivas Gollapudi, Marc Najork, Rina Panigrahy

    Web Search and Data Mining (WSDM) (2010)

  •  

    Microsoft Research at TREC 2010 Web Track

    Nick Craswell, Dennis Fetterly, Marc Najork

    TREC (2010)

  •  

    Querying the Web Graph - (Invited Talk)

    Marc Najork

    SPIRE (2010), pp. 1-12

  •  

    Web Crawling

    Christopher Olston, Marc Najork

    Foundations and Trends in Information Retrieval, vol. 4 (2010), pp. 175-246

  •  

    Less is more: sampling the neighborhood graph makes SALSA better and faster

    Marc Najork, Sreenivas Gollapudi, Rina Panigrahy

    WSDM (2009), pp. 242-251

  •  

    Microsoft Research at TREC 2009: Web and Relevance Feedback Track

    Nick Craswell, Dennis Fetterly, Marc Najork, Stephen Robertson, Emine Yilmaz

    TREC (2009)

  •  

    The scalable hyperlink store

    Marc Najork

    Hypertext (2009), pp. 89-98

  •  

    Web Crawler Architecture

    Marc Najork

    Encyclopedia of Database Systems (2009), pp. 3462-3465

  •  

    Web Search Relevance Ranking

    Hugo Zaragoza, Marc Najork

    Encyclopedia of Database Systems (2009), pp. 3497-3501

  •  

    Web Spam Detection

    Marc Najork

    Encyclopedia of Database Systems (2009), pp. 3520-3523

  •  

    Computing Information Retrieval Performance Measures Efficiently in the Presence of Tied Scores

    Frank McSherry, Marc Najork

    ECIR (2008), pp. 414-421

  •  

    Efficient and effective link analysis with precomputed salsa maps

    Marc Najork, Nick Craswell

    CIKM (2008), pp. 53-62

  •  

    Introduction to special section on adversarial issues in Web search

    Marc Najork, Brian D. Davison 0001

    TWEB, vol. 2 (2008)

  •  

    Comparing the effectiveness of hits and salsa

    Marc Najork

    CIKM (2007), pp. 157-164

  •  

    Hits on the web: how does it compare?

    Marc Najork, Hugo Zaragoza, Michael J. Taylor

    SIGIR (2007), pp. 471-478

  •  

    Using Bloom Filters to Speed Up HITS-Like Ranking Algorithms

    Sreenivas Gollapudi, Marc Najork, Rina Panigrahy

    WAW (2007), pp. 195-201

  •  

    Adversarial information retrieval on the web (AIRWeb 2006)

    Brian D. Davison 0001, Marc Najork, Tim Converse

    SIGIR Forum, vol. 40 (2006), pp. 27-30

  •  

    Detecting spam web pages through content analysis

    Alexandros Ntoulas, Marc Najork, Mark Manasse, Dennis Fetterly

    WWW (2006), pp. 83-92

  •  

    Detecting phrase-level duplication on the world wide web

    Dennis Fetterly, Mark Manasse, Marc Najork

    SIGIR (2005), pp. 170-177

  •   

    How search engines shape the web

    Byron Dom, Krishna Bharat, Andrei Z. Broder, Marc Najork, Jan O. Pedersen, Yoshinobu Tonomura

    WWW (Special interest tracks and posters) (2005), pp. 879

  •  

    A large-scale study of the evolution of Web pages

    Dennis Fetterly, Mark Manasse, Marc Najork, Janet L. Wiener

    Softw., Pract. Exper., vol. 34 (2004), pp. 213-237

  •  

    Boxwood: Abstractions as the Foundation for Storage Infrastructure

    John MacCormick, Nick Murphy, Marc Najork, Chandramohan A. Thekkath, Lidong Zhou

    OSDI (2004), pp. 105-120

  •  

    On The Evolution of Clusters of Near-Duplicate Web Pages

    Dennis Fetterly, Mark Manasse, Marc Najork

    J. Web Eng., vol. 2 (2004), pp. 228-246

  •  

    Spam, Damn Spam, and Statistics: Using Statistical Analysis to Locate Spam Web Pages

    Dennis Fetterly, Mark Manasse, Marc Najork

    WebDB (2004), pp. 1-6

  •  

    A large-scale study of the evolution of web pages

    Dennis Fetterly, Mark Manasse, Marc Najork, Janet L. Wiener

    WWW (2003), pp. 669-678

  •  

    Efficient URL caching for world wide web crawling

    Andrei Z. Broder, Marc Najork, Janet L. Wiener

    WWW (2003), pp. 679-689

  •  

    On the Evolution of Clusters of Near-Duplicate Web Pages

    Dennis Fetterly, Mark Manasse, Marc Najork

    LA-WEB (2003), pp. 37-45

  •  

    Breadth-first crawling yields high-quality pages

    Marc Najork, Janet L. Wiener

    WWW (2001), pp. 114-118

  •  

    Web-based Algorithm Animation

    Marc Najork

    DAC (2001), pp. 506-511

  •  

    Performance limitations of the Java core libraries

    Allan Heydon, Marc Najork

    Concurrency - Practice and Experience, vol. 12 (2000), pp. 363-373

  •  

    Mercator: A Scalable, Extensible Web Crawler

    Allan Heydon, Marc Najork

    World Wide Web, vol. 2 (1999), pp. 219-229

  •  

    Performance Limitations of the Java Core Libraries

    Allan Heydon, Marc Najork

    Java Grande (1999), pp. 35-41

  •  

    A Java-Based Implementation of Collaborative Active Textbooks

    Marc H. Brown, Marc Najork, Roope Raisamo

    VL (1997), pp. 376-383

  •  

    Collaborative Active Textbooks

    Marc H. Brown, Marc Najork

    J. Vis. Lang. Comput., vol. 8 (1997), pp. 453-486

  •  

    Distributed Applets

    Marc H. Brown, Marc Najork

    CHI Extended Abstracts (1997), pp. 204-205

  •  

    Collaborative Active Textbooks: A Web-Based Algorithm Animation System for an Electronic Classroom

    Marc H. Brown, Marc Najork

    VL (1996), pp. 266-275

  •  

    Distributed Active Objects

    Marc H. Brown, Marc Najork

    Computer Networks, vol. 28 (1996), pp. 1037-1052

  •  

    Programming in Three Dimensions

    Marc Najork

    J. Vis. Lang. Comput., vol. 7 (1996), pp. 219-242

  •  

    Obliq-3D: A High-Level, Fast-Turnaround 3D Animation System

    Marc Najork, Marc H. Brown

    IEEE Trans. Vis. Comput. Graph., vol. 1 (1995), pp. 175-193

  •  

    A Library for Visualizing Combinatorial Structures

    Marc Najork, Marc H. Brown

    IEEE Visualization (1994), pp. 164-171

  •  

    Algorithm Animation Using 3D Interactive Graphics

    Marc H. Brown, Marc Najork

    ACM Symposium on User Interface Software and Technology (1993), pp. 93-100

  •  

    Cube: Eine dreidimensionale visuelle Programmiersprache

    Marc Najork, Simon M. Kaplan

    GI Jahrestagung (1993), pp. 340-345

  •  

    Specifying Visual Languages with Conditional Set Rewrite Systems

    Marc Najork, Simon M. Kaplan

    VL (1993), pp. 12-18

  •  

    A Prototype Implementation of the Cube Language

    Marc Najork, Simon M. Kaplan

    VL (1992), pp. 270-272

  •  

    The CUBE Language

    Marc Najork, Simon M. Kaplan

    VL (1991), pp. 218-224

  •  

    Enhancing Show-and-Tell with a polymorphic type system and higher-order functions

    Marc Najork, Eric J. Golin

    VL (1990), pp. 215-220

  •  

    Roles and their role in posing recursive queries

    Sharon Kuck, Roland John, Arnd Lewe, Marc Najork

    Inf. Syst., vol. 15 (1990), pp. 173-186