Marc Najork

Marc Najork is a Distinguished Research Scientist at Google DeepMind. Previously, he was a Senior Director of Research Engineering at Google Research. Before joining Google, Marc was a Principal Researcher at Microsoft Research (2001-2014) and prior to that a Researcher at the DEC/Compaq Systems Researcher Center (1993-2001). Marc earned a Ph.D. in Computer Science from the University of Illinois. He is an ACM Fellow, an IEEE Fellow and a member of the SIGIR Academy. His service activities include Editor-in-Chief of the ACM Transactions on the Web (2011-2014), news board co-chair of the Communications of the ACM (2008-2014), conference chair of WSDM 2008, and program co-chair of WWW 2004 and WWW 2021.

Research Areas

Authored Publications

Google Publications

Other Publications

Job Type Extraction for Service Businesses

Cheng Li

Yaping Qi

Hayk Zakaryan

Mingyang Zhang

Mike Bendersky

Yonghua Wu

Marc Najork

Companion Proceedings of the ACM Web Conference 2023

Creator Context for Tweet Recommendation

Spurthi Amba Hombaiah

Tao Chen

Mingyang Zhang

Michael Bendersky

Marc Najork

Matt Colen

Sergey Levi

Vladimir Ofitserov

Tanvir Amin

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track

Regression Compatible Listwise Objectives for Calibrated Ranking with Binary Relevance

Aijun Bai

Rolf Jagerman

Zhen Qin

Le Yan

Pratyush Kar

Bing-Rong Lin

Xuanhui Wang

Michael Bendersky

Marc Najork

Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (2023)

“Why is this misleading?”: Detecting News Headline Hallucinations with Explanations

Jiaming Shen

Jialu Liu

Dan Finnie

Negar Rahmati

Mike Bendersky

Marc Najork

Proceedings of the ACM Web Conference 2023 (WWW 2023)

Towards Disentangling Relevance and Bias in Unbiased Learning to Rank

Yunan Zhang

Le Yan

Zhen Qin

Honglei Zhuang

Jiaming Shen

Xuanhui Wang

Mike Bendersky

Marc Najork

29TH ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) (2023)

Generative Information Retrieval (slides)

Marc Najork

(2023)

STRUM: Extractive Aspect-Based Contrastive Summarization

Beliz Gunel

Sandeep Tata

Marc Najork

Companion Proceedings of the ACM Web Conference 2023, 28–31

DSI++: Updating Transformer Memory with New Documents

Sanket Vaibhav Mehta

Jai Gupta

Yi Tay

Mostafa Dehghani

Vinh Tran

Jinfeng Rao

Marc Najork

Emma Strubell

Don Metzler

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

Exploring the Viability of Synthetic Query Generation for Relevance Prediction

Aditi Chaudhary

Karthik Raman

Krishna Srinivasan

Kazuma Hashimoto

Michael Bendersky

Marc Najork

SIGIR (2023)

Generative Information Retrieval (abstract)

Marc Najork

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (2023), pp. 1

End-to-End Query Term Weighting

Karan Samel

Cheng Li

Weize Kong

Tao Chen

Mingyang Zhang

Shaleen Gupta

Swaraj Khadanga

Wensong Xu

Xingyu Wang

Kashyap Kolipaka

Mike Bendersky

Marc Najork

Proceedings of the 29th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '23) (2023)

Search and Discovery in Personal Email Collections (Tutorial Proposal)

Mike Bendersky

Xuanhui Wang

Don Metzler

Marc Najork

Proceedings of the 15th ACM International Conference on Web Search and Data Mining (2022), 1617–1619

Scale Calibration of Deep Ranking Models

Le Yan

Zhen Qin

Xuanhui Wang

Mike Bendersky

Marc Najork

28TH ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) (2022), pp. 4300-4309

Out-of-Domain Semantics to the Rescue! Zero-Shot Hybrid Retrieval Models

Tao Chen

Mingyang Zhang

Jing Lu

Mike Bendersky

Marc Najork

The 44th European Conference on Information Retrieval (ECIR) (2022)

Rank4Class: A Ranking Formulation for Multiclass Classification

Nan Wang

Zhen Qin

Le Yan

Honglei Zhuang

Xuanhui Wang

Michael Bendersky

Marc Najork

(2022)

Improving Neural Ranking via Lossless Knowledge Distillation

Zhen Qin

Le Yan

Yi Tay

Honglei Zhuang

Xuanhui Wang

Michael Bendersky

Marc Najork

(2022)

Rax: Composable Learning-to-Rank using JAX

Rolf Jagerman

Xuanhui Wang

Honglei Zhuang

Zhen Qin

Mike Bendersky

Marc Najork

Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2022), 3051–3060

Revisiting two tower models for unbiased learning to rank

Le Yan

Zhen Qin

Honglei Zhuang

Xuanhui Wang

Mike Bendersky

Marc Najork

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022), 2410–2414

On Optimizing Top-K Metrics for Neural Ranking Models

Rolf Jagerman

Zhen Qin

Xuanhui Wang

Mike Bendersky

Marc Najork

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022), 2303–2307

Glean: Structured Extractions from Templatic Documents

Sandeep Tata

Navneet Potti

James B. Wendt

Lauro Beltrao Costa

Marc Najork

Beliz Gunel

Proceedings of the VLDB Endowment (2021), pp. 997-1005

Ensemble Distillation for BERT-Based Ranking Models

Honglei Zhuang

Zhen Qin

Shuguang Han

Xuanhui Wang

Mike Bendersky

Marc Najork

Proceedings of the 2021 ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR ’21)

Dynamic Language Models for Continuously Evolving Content

Spurthi Amba Hombaiah

Tao Chen

Mingyang Zhang

Michael Bendersky

Marc Najork

KDD 2021

Rethinking Search: Making Domain Experts out of Dilettantes

Don Metzler

Yi Tay

Dara Bahri

Marc Najork

SIGIR Forum, vol. 55 (2021), pp. 1-27

WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning

Krishna Srinivasan

Karthik Raman

Jiecao Chen

Mike Bendersky

Marc Najork

Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '21) (2021)

Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees?

Zhen Qin

Le Yan

Honglei Zhuang

Yi Tay

Rama Kumar Pasumarthi

Xuanhui Wang

Mike Bendersky

Marc Najork

International Conference on Learning Representations (ICLR) (2021)

Bootstrapping Recommendations at Chrome Web Store

Zhen Qin

Honglei Zhuang

Rolf Jagerman

Xinyu Qian

Po Hu

Chary Chen

Xuanhui Wang

Mike Bendersky

Marc Najork

27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) (2021)

Data-Efficient Information Extraction from Form-Like Documents

Beliz Gunel

Navneet Potti

Sandeep Tata

James B. Wendt

Marc Najork

Jing Xie

Document Intelligence Workshop @ KDD 2021

Natural Language Understanding with Privacy-Preserving BERT

Chen Qu

Weize Kong

Liu Yang

Mingyang Zhang

Michael Bendersky

Marc Najork

Proceedings of the 30th ACM International Conference on Information and Knowledge Management, ACM (2021)

Improving Cloud Storage Search with User Activity

Rolf Jagerman

Weize Kong

Rama Kumar Pasumarthi

Zhen Qin

Mike Bendersky

Marc Najork

Proceedings of the 14th International Conference on Web Search and Data Mining (WSDM '21), ACM (2021)

Scalable Hierarchical Agglomerative Clustering

Nick Monath

Avinava Dubey

Guru Prashanth Guruganesh

Manzil Zaheer

Amr Mahmoud El Houssieny Ahmed

Andrew McCallum

Gokhan Mergen

Marc Najork

Mert Terzihan

Bryon Tjanaka

Yuan Wang

Yuchen Wu

Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2021), 1245–1255

Permutation Equivariant Document Interaction Network for Neural Learning to Rank

Rama Kumar Pasumarthi

Honglei Zhuang

Xuanhui Wang

Mike Bendersky

Marc Najork

Proceedings of the 2020 ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR 2020)

Active learning for skewed data set

Abbas Kazerouni

Qi Zhao

Jing Xie

Sandeep Tata

Marc Najork

arXiv (2020)

Feature Transformation for Neural Ranking Models

Honglei Zhuang

Xuanhui Wang

Mike Bendersky

Marc Najork

Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2020), pp. 1649-1652

Leveraging Semantic and Lexical Matching to Improve the Recall of Retrieval Systems: A Hybrid Approach

Cheng Li

Marc Najork

Mike Bendersky

Mingyang Zhang

Saar Kuzi

arXiv (2020)

A Stochastic Treatment of Learning to Rank Scoring Functions

Sebastian Bruch

Shuguang Han

Mike Bendersky

Marc Najork

Proceedings of the 13th ACM International Conference on Web Search and Data Mining (WSDM 2020), pp. 61-69

Representation Learning for Information Extraction from Form-like Documents

Bodhisattwa Majumder

Navneet Potti

Sandeep Tata

James B. Wendt

Qi Zhao

Marc Najork

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), pp. 6495-6504

Learning-to-Rank with BERT in TF-Ranking

Shuguang Han

Xuanhui Wang

Mike Bendersky

Marc Najork

arXiv, arXiv (2020)

Adversarial Bandits Policy for Crawling Commercial Web Content

Shuguang Han

Michael Bendersky

Przemek Gajda

Sergey Novikov

Marc Najork

Bernhard Friedrich Brodowsky

Alexandrin Popescul

Proceedings of the Web Conference 2020 (WWW 2020), pp. 407-417

Learning to Cluster Documents into Workspaces Using Large Scale Activity Logs

Weize Kong

Mike Bendersky

Marc Najork

Brandon Vargo

Mike Colagrosso

Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’20), ACM (2020), 2416–2424

Migrating a Privacy-Safe Information Extraction System to a Software 2.0 Design

Ying Sheng

Nguyen Ha Vo

James B. Wendt

Sandeep Tata

Marc Najork

Proceedings of the 10th Annual Conference on Innovative Data Systems Research (2020)

DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling

Jiecao Chen

Liu Yang

Karthik Raman

Mike Bendersky

Jung-Jung Yeh

Yun Zhou

Marc Najork

Danyang Cai

Ehsan Emadzadeh

Findings of EMNLP 2020

Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matching

Liu Yang

Mingyang Zhang

Cheng Li

Mike Bendersky

Marc Najork

Proceedings of the 29th ACM International Conference on Information and Knowledge Management (2020)

Multi-view Embedding-based Synonyms for Personal Search

Cheng Li

Mingyang Zhang

Mike Bendersky

Hongbo Deng

Don Metzler

Marc Najork

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '19) (2019), pp. 575-584

An Analysis of the Softmax Cross Entropy Loss for Learning-to-Rank with Binary Relevance

Sebastian Bruch

Xuanhui Wang

Mike Bendersky

Marc Najork

Proceedings of the 2019 ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR 2019), pp. 75-78

Online Template Induction for Machine-Generated Emails

Michael Whittaker

Nick Edmonds

Sandeep Tata

James B. Wendt

Marc Najork

PVLDB (2019), pp. 1235-1248

TF-Ranking: Scalable TensorFlow Library for Learning-to-Rank

Rama Kumar Pasumarthi

Sebastian Bruch

Xuanhui Wang

Cheng Li

Mike Bendersky

Marc Najork

Jan Pfeifer

Nadav Golbandi

Rohan Anil

Stephan Wolf

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (2019), pp. 2970-2978

Addressing Trust Bias for Unbiased Learning-to-Rank

Aman Agarwal

Xuanhui Wang

Cheng Li

Mike Bendersky

Marc Najork

Proceedings of the 2019 World Wide Web Conference, pp. 4-14

Uncovering Hidden Structure in Sequence Data via Threading Recurrent Models

Manzil Zaheer

Amr Ahmed

Yuan Wang

Daniel Silva

Marc Najork

Yuchen Wu

Shibani Sanan

Surojit Chatterjee

Proceedings of the 12 ACM International Conference on Web Search and Data Mining (2019), pp. 186-194

Estimating Position Bias without Intrusive Interventions

Aman Agarwal

Ivan Zaitsev

Xuanhui Wang

Cheng Li

Marc Najork

Thorsten Joachims

Proceedings of the 12 ACM International Conference on Web Search and Data Mining (2019), pp. 474-482

Learning Groupwise Scoring Functions Using Deep Neural Networks

Qingyao Ai

Xuanhui Wang

Nadav Golbandi

Mike Bendersky

Marc Najork

Proceedings of the First International Workshop On Deep Matching In Practical Applications (2019)

Learning Groupwise Multivariate Scoring Functions Using Deep Neural Networks

Qingyao Ai

Xuanhui Wang

Sebastian Bruch

Nadav Golbandi

Mike Bendersky

Marc Najork

Proceedings of the 5th ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR) (2019), pp. 85-92

Revisiting Approximate Metric Optimization in the Age of Deep Neural Networks

Sebastian Bruch

Masrour Zoghi

Mike Bendersky

Marc Najork

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '19) (2019), pp. 1241-1244

Semantic Text Matching for Long-Form Documents

Jyun-Yu Jiang

Mingyang Zhang

Cheng Li

Mike Bendersky

Nadav Golbandi

Marc Najork

Proceedings of the 2019 World Wide Web Conference, pp. 795-806

RiSER: Learning Better Representations for Richly Structured Emails

Furkan Kocayusufoğlu

Ying Sheng

Nguyen Ha Vo

James B. Wendt

Qi Zhao

Sandeep Tata

Marc Najork

Proceedings of the 2019 World Wide Web Conference, pp. 886-895

Predictive Crawling for Commercial Web Content

Shuguang Han

Bernhard Brodowsky

Przemek Gajda

Sergey Novikov

Mike Bendersky

Marc Najork

Robin Dua

Alexandrin Popescul

Proceedings of the 2019 World Wide Web Conference, pp. 627-637

Learning with Sparse and Biased Feedback for Personal Search

Mike Bendersky

Xuanhui Wang

Marc Najork

Don Metzler

Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI) (2018), pp. 5219-5223

Learning Effective Embeddings for Machine Generated Emails with Applications to Email Category Prediction

Yu Sun

Luis Garcia Pueyo

James B. Wendt

Marc Najork

Andrei Broder

Proceedings of the IEEE International Conference on Big Data (2018), pp. 1846-1855

Anatomy of a Privacy-Safe Large-Scale Information Extraction System Over Email

Ying Sheng

Sandeep Tata

James B. Wendt

Jing Xie

Qi Zhao

Marc Najork

24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM (2018), pp. 734-743

Training On-Device Ranking Models from Cross-User Interactions in a Privacy-Preserving Fashion

Marc Najork

Proc. of the First Biennial Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES) (2018), pp. 108

Position Bias Estimation for Unbiased Learning to Rank in Personal Search

Xuanhui Wang

Nadav Golbandi

Michael Bendersky

Donald Metzler

Marc Najork

Proceedings of the 11th ACM International Conference on Web Search and Data Mining (WSDM), ACM (2018), pp. 610-618

Offline Comparison of Ranking Functions using Randomized Data

Aman Agarwal

Xuanhui Wang

Cheng Li

Mike Bendersky

Marc Najork

REVEAL Workshop in ACM Recommender Systems (RecSys) (2018)

The LambdaLoss Framework for Ranking Metric Optimization

Xuanhui Wang

Cheng Li

Nadav Golbandi

Mike Bendersky

Marc Najork

Proceedings of The 27th ACM International Conference on Information and Knowledge Management (CIKM '18), ACM (2018), pp. 1313-1322

TF-Ranking: Scalable TensorFlow Library for Learning-to-Rank

Rama Kumar Pasumarthi

Xuanhui Wang

Cheng Li

Sebastian Nima Bruch

Mike Bendersky

Marc Najork

Jan Pfeifer

Nadav Golbandi

Rohan Anil

Stephan Wolf

arXiv preprint (2018)

Semantic Location in Email Query Suggestion

John Foley

Mingyang Zhang

Mike Bendersky

Marc Najork

Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (2018), pp. 977-980

Hidden in Plain Sight: Classifying Emails Using Embedded Image Contents

Navneet Potti

James B. Wendt

Qi Zhao

Sandeep Tata

Marc Najork

Proceedings of the 2018 World Wide Web Conference (WWW 2018), pp. 1865-1874

Quick Access: Building a Smart Experience for Google Drive

Sandeep Tata

Alexandrin Popescul

Marc Najork

Mike Colagrosso

Julian Gibbons

Alan Green

Alexandre Mah

Michael James Smith

Divanshu Garg

Cayden Meyer

Reuben Kan

Proc. of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2017), pp. 1643-1651

Learning from User Interactions in Personal Search via Attribute Parameterization

Mike Bendersky

Xuanhui Wang

Don Metzler

Marc Najork

Proceedings of the 10th ACM International Conference on Web Search and Data Mining (WSDM), ACM (2017), pp. 791-800

Email Category Prediction

Aston Zhang

Luis Garcia Pueyo

James B. Wendt

Marc Najork

Andrei Broder

Companion Proc. of the 26th International World Wide Web Conference (2017), pp. 495-503

Learning to Rank with Selection Bias in Personal Search

Xuanhui Wang

Michael Bendersky

Donald Metzler

Marc Najork

Proc. of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM (2016), pp. 115-124

Using Machine Learning to Improve the Email Experience

Marc Najork

Proc. of the 25th ACM International Conference on Information and Knowledge Management, ACM (2016), pp. 891

Debugging a Crowdsourced Task with Low Inter-Rater Agreement

Omar Alonso

Catherine C. Marshall

Marc Najork

Joint Conference on Digital Libraries (2015)

Social Search

Marc Najork

14th International Conference on Web Engineering (2014)

A Human-Centered Framework for Ensuring Reliability on Crowdsourced Labeling Tasks

Omar Alonso

Catherine C Marshall

Marc A Najork

First AAAI Conference on Human Computation and Crowdsourcing, AAAI (2013)

Robust query rewriting using anchor data

Nick Craswell

Bodo Billerbeck

Dennis Fetterly

Marc Najork

6th ACM Intl. Conference on Web Search and Data Mining (WSDM), ACM (2013), pp. 335-344

Are Some Tweets More Interesting Than Others?# HardQuestion

Omar Alonso

Catherine C Marshall

Marc Najork

7th Annual Symposium on Human-Computer Interaction and Information Retrieval, ACM (2013)

Boot-Strapping Language Identifiers for Short Colloquial Postings

Moisés Goldszmidt

Marc Najork

Stelios Paparizos

European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD), Springer (2013), pp. 95-111

Editorial

Helen Ashman

Arun Iyengar

Marc Najork

TWEB, vol. 6 (2012), pp. 5

How user behavior is related to social affinity

Rina Panigrahy

Marc Najork

Yinglian Xie

WSDM (2012), pp. 713-722

Detecting quilted web pages at scale

Marc Najork

SIGIR (2012), pp. 385-394

Of hammers and nails: an empirical comparison of three paradigms for processing large graphs

Marc Najork

Dennis Fetterly

Alan Halverson

Krishnaram Kenthapadi

Sreenivas Gollapudi

WSDM (2012), pp. 103-112

The Power of Peers

Nick Craswell

Dennis Fetterly

Marc Najork

ECIR (2011), pp. 497-502

Microsoft Research at TREC 2011 Web Track

Bodo Billerbeck

Nick Craswell

Dennis Fetterly

Marc Najork

TREC (2011)

Querying the Web Graph - (Invited Talk)

Marc Najork

SPIRE (2010), pp. 1-12

A Sketch-Based Distance Oracle for Web-Scale Graphs

Atish Das Sarma

Sreenivas Gollapudi

Marc Najork

Rina Panigrahy

Web Search and Data Mining (WSDM) (2010)

Microsoft Research at TREC 2010 Web Track

Nick Craswell

Dennis Fetterly

Marc Najork

TREC (2010)

Web Crawling

Christopher Olston

Marc Najork

Foundations and Trends in Information Retrieval, vol. 4 (2010), pp. 175-246

Web Spam Detection

Marc Najork

Encyclopedia of Database Systems (2009), pp. 3520-3523

Web Crawler Architecture

Marc Najork

Encyclopedia of Database Systems (2009), pp. 3462-3465

Microsoft Research at TREC 2009: Web and Relevance Feedback Track

Nick Craswell

Dennis Fetterly

Marc Najork

Stephen Robertson

Emine Yilmaz

TREC (2009)

Web Search Relevance Ranking

Hugo Zaragoza

Marc Najork

Encyclopedia of Database Systems (2009), pp. 3497-3501

Less is more: sampling the neighborhood graph makes SALSA better and faster

Marc Najork

Sreenivas Gollapudi

Rina Panigrahy

WSDM (2009), pp. 242-251

The scalable hyperlink store

Marc Najork

Hypertext (2009), pp. 89-98

Efficient and effective link analysis with precomputed salsa maps

Marc Najork

Nick Craswell

CIKM (2008), pp. 53-62

Introduction to special section on adversarial issues in Web search

Marc Najork

Brian D. Davison 0001

TWEB, vol. 2 (2008)

Computing Information Retrieval Performance Measures Efficiently in the Presence of Tied Scores

Frank McSherry

Marc Najork

ECIR (2008), pp. 414-421

Hits on the web: how does it compare?

Marc Najork

Hugo Zaragoza

Michael J. Taylor

SIGIR (2007), pp. 471-478

Using Bloom Filters to Speed Up HITS-Like Ranking Algorithms

Sreenivas Gollapudi

Marc Najork

Rina Panigrahy

WAW (2007), pp. 195-201

Comparing the effectiveness of hits and salsa

Marc Najork

CIKM (2007), pp. 157-164

Adversarial information retrieval on the web (AIRWeb 2006)

Brian D. Davison 0001

Marc Najork

Tim Converse

SIGIR Forum, vol. 40 (2006), pp. 27-30

Detecting spam web pages through content analysis

Alexandros Ntoulas

Marc Najork

Mark Manasse

Dennis Fetterly

WWW (2006), pp. 83-92

How search engines shape the web

Byron Dom

Krishna Bharat

Andrei Z. Broder

Marc Najork

Jan O. Pedersen

Yoshinobu Tonomura

WWW (Special interest tracks and posters) (2005), pp. 879

Detecting phrase-level duplication on the world wide web

Dennis Fetterly

Mark Manasse

Marc Najork

SIGIR (2005), pp. 170-177

Boxwood: Abstractions as the Foundation for Storage Infrastructure

John MacCormick

Nick Murphy

Marc Najork

Chandramohan A. Thekkath

Lidong Zhou

OSDI (2004), pp. 105-120

On The Evolution of Clusters of Near-Duplicate Web Pages

Dennis Fetterly

Mark Manasse

Marc Najork

J. Web Eng., vol. 2 (2004), pp. 228-246

Spam, Damn Spam, and Statistics: Using Statistical Analysis to Locate Spam Web Pages

Dennis Fetterly

Mark Manasse

Marc Najork

WebDB (2004), pp. 1-6

A large-scale study of the evolution of Web pages

Dennis Fetterly

Mark Manasse

Marc Najork

Janet L. Wiener

Softw., Pract. Exper., vol. 34 (2004), pp. 213-237

A large-scale study of the evolution of web pages

Dennis Fetterly

Mark Manasse

Marc Najork

Janet L. Wiener

WWW (2003), pp. 669-678

Efficient URL caching for world wide web crawling

Andrei Z. Broder

Marc Najork

Janet L. Wiener

WWW (2003), pp. 679-689

On the Evolution of Clusters of Near-Duplicate Web Pages

Dennis Fetterly

Mark Manasse

Marc Najork

LA-WEB (2003), pp. 37-45

Breadth-first crawling yields high-quality pages

Marc Najork

Janet L. Wiener

WWW (2001), pp. 114-118

Web-based Algorithm Animation

Marc Najork

DAC (2001), pp. 506-511

Performance limitations of the Java core libraries

Allan Heydon

Marc Najork

Concurrency - Practice and Experience, vol. 12 (2000), pp. 363-373

Performance Limitations of the Java Core Libraries

Allan Heydon

Marc Najork

Java Grande (1999), pp. 35-41

Mercator: A Scalable, Extensible Web Crawler

Allan Heydon

Marc Najork

World Wide Web, vol. 2 (1999), pp. 219-229

Distributed Applets

Marc H. Brown

Marc Najork

CHI Extended Abstracts (1997), pp. 204-205

Collaborative Active Textbooks

Marc H. Brown

Marc Najork

J. Vis. Lang. Comput., vol. 8 (1997), pp. 453-486

A Java-Based Implementation of Collaborative Active Textbooks

Marc H. Brown

Marc Najork

Roope Raisamo

VL (1997), pp. 376-383

Programming in Three Dimensions

Marc Najork

J. Vis. Lang. Comput., vol. 7 (1996), pp. 219-242

Distributed Active Objects

Marc H. Brown

Marc Najork

Computer Networks, vol. 28 (1996), pp. 1037-1052

Collaborative Active Textbooks: A Web-Based Algorithm Animation System for an Electronic Classroom

Marc H. Brown

Marc Najork

VL (1996), pp. 266-275

Obliq-3D: A High-Level, Fast-Turnaround 3D Animation System

Marc Najork

Marc H. Brown

IEEE Trans. Vis. Comput. Graph., vol. 1 (1995), pp. 175-193

A Library for Visualizing Combinatorial Structures

Marc Najork

Marc H. Brown

IEEE Visualization (1994), pp. 164-171

Cube: Eine dreidimensionale visuelle Programmiersprache

Marc Najork

Simon M. Kaplan

GI Jahrestagung (1993), pp. 340-345

Algorithm Animation Using 3D Interactive Graphics

Marc H. Brown

Marc Najork

ACM Symposium on User Interface Software and Technology (1993), pp. 93-100

Specifying Visual Languages with Conditional Set Rewrite Systems

Marc Najork

Simon M. Kaplan

VL (1993), pp. 12-18

A Prototype Implementation of the Cube Language

Marc Najork

Simon M. Kaplan

VL (1992), pp. 270-272

The CUBE Language

Marc Najork

Simon M. Kaplan

VL (1991), pp. 218-224

Enhancing Show-and-Tell with a polymorphic type system and higher-order functions

Marc Najork

Eric J. Golin

VL (1990), pp. 215-220

Roles and their role in posing recursive queries

Sharon Kuck

Roland John

Arnd Lewe

Marc Najork

Inf. Syst., vol. 15 (1990), pp. 173-186

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Marc Najork

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Marc Najork

Research Areas

Filter by:

Year

Team

Research Area

Join us

AI/ML Foundations  & Capabilities