<?xml version="1.0" encoding="UTF-8" ?>
<feed xmlns="http://www.w3.org/2005/Atom">
<id>http://research.google.com/pubs/atom.xml</id>
<title type="text">Recent Google Publications (Atom)</title>
<updated>2009-06-28T23:18:40-08:00</updated>
<link href="http://research.google.com/pubs/atom.xml" rel="self"/>
<link type="text/html" rel="alternate" href="http://research.google.com/pubs/papers.html"/>
<entry>
<title><![CDATA[Sampling Techniques for the Nystrom Method]]></title>
<updated>2009-06-24T11:46:24-08:00</updated>
<id>urn:googlelabs:35390</id>
<summary>Sampling Techniques for the Nystrom Method, Sanjiv Kumar, Mehryar Mohri, Ameet Talwalkar</summary>
<link rel="alternate" href="http://www.cs.cmu.edu/~skumar/nys_sampling_aistats.pdf"/>
<category term="Machine Learning" label="Machine Learning"/>
<author><name>Sanjiv Kumar, Mehryar Mohri, Ameet Talwalkar</name></author>
</entry>
<entry>
<title><![CDATA[On Sampling-Based Approximate Spectral Decomposition]]></title>
<updated>2009-06-24T11:47:16-08:00</updated>
<id>urn:googlelabs:35389</id>
<summary>On Sampling-Based Approximate Spectral Decomposition, Sanjiv Kumar, Mehryar Mohri, Ameet Talkwalkar</summary>
<link rel="alternate" href="http://www.cs.cmu.edu/~skumar/nys_col_ICML.pdf"/>
<category term="Machine Learning" label="Machine Learning"/>
<author><name>Sanjiv Kumar, Mehryar Mohri, Ameet Talkwalkar</name></author>
</entry>
<entry>
<title><![CDATA[How opinions are received by online communities: A case study on Amazon.com helpfulness votes]]></title>
<updated>2009-06-24T12:02:15-08:00</updated>
<id>urn:googlelabs:35388</id>
<summary>There are many on-line settings in which users publicly express opinions. A number of these offer mechanisms for other users to evaluate these opinions; a canonical example is Amazon.com, where reviews come with annotations like ``26 of 32 people found the following review helpful.&#39;&#39; Opinion evaluation appears in many off-line settings as well, including market research and political campaigns. Reasoning about the evaluation of an opinion is fundamentally different from reasoning about the opinion itself: rather than asking, ``What did Y think of X?&#39;&#39;, we are asking, ``What did Z think of Y&#39;s opinion of X?&#39;&#39; Here we develop a framework for analyzing and modeling opinion evaluation, using a large-scale collection of Amazon book reviews as a dataset. We find that the perceived helpfulness of a review depends not just on its content but also but also in subtle ways on how the expressed evaluation relates to other evaluations of the same product. As part of our approach, we develop novel methods that take advantage of the phenomenon of review ``plagiarism&#39;&#39; to control for the effects of text in opinion evaluation, and we provide a simple and natural mathematical model consistent with our findings. Our analysis also allows us to distinguish among the predictions of competing theories from sociology and social psychology, and to discover unexpected differences in the collective opinion-evaluation behavior of user populations from different countries.</summary>
<link rel="alternate" href="http://www.google.com/search?lr=&amp;ie=UTF-8&amp;oe=UTF-8&amp;q=How+opinions+are+received+by+online+communities%3A+A+case+study+on+Amazon.com+helpfulness+votes+Danescu-Niculescu-Mizil+Kossinets+Kleinberg+Lee" type="text/html" title="Search for publication"/>
<category term="Information Retrieval" label="Information Retrieval"/>
<author><name>Cristian Danescu-Niculescu-Mizil, Gueorgi Kossinets, Jon Kleinberg, Lillian Lee</name></author>
</entry>
<entry>
<title><![CDATA[Web-Scale N-gram Models for Lexical Disambiguation]]></title>
<updated>2009-06-24T12:03:13-08:00</updated>
<id>urn:googlelabs:35387</id>
<summary>Web-Scale N-gram Models for Lexical Disambiguation, Shane Bergsma, Dekang Lin, Randy Goebel</summary>
<link rel="alternate" href="http://www.google.com/search?lr=&amp;ie=UTF-8&amp;oe=UTF-8&amp;q=Web-Scale+N-gram+Models+for+Lexical+Disambiguation+Bergsma+Lin+Goebel" type="text/html" title="Search for publication"/>
<category term="Natural Language Processing" label="Natural Language Processing"/>
<author><name>Shane Bergsma, Dekang Lin, Randy Goebel</name></author>
</entry>
<entry>
<title><![CDATA[Glen, Glenda or Glendale: Unsupervised and Semi-supervised Learning of English Noun Gender]]></title>
<updated>2009-06-24T11:32:48-08:00</updated>
<id>urn:googlelabs:35386</id>
<summary>Glen, Glenda or Glendale: Unsupervised and Semi-supervised Learning of English Noun Gender, Shane Bergsma, Dekang Lin, Randy Goebel</summary>
<link rel="alternate" href="http://www.google.com/search?lr=&amp;ie=UTF-8&amp;oe=UTF-8&amp;q=Glen%2C+Glenda+or+Glendale%3A+Unsupervised+and+Semi-supervised+Learning+of+English+Noun+Gender+Bergsma+Lin+Goebel" type="text/html" title="Search for publication"/>
<category term="Natural Language Processing" label="Natural Language Processing"/>
<author><name>Shane Bergsma, Dekang Lin, Randy Goebel</name></author>
</entry>
<entry>
<title><![CDATA[Online Learning with Global Cost Functions]]></title>
<updated>2009-06-24T12:04:28-08:00</updated>
<id>urn:googlelabs:35385</id>
<summary>We consider an online learning  setting where at each time
step the decision maker has to choose how to distribute the future loss
between k alternatives, and then observes the loss of each alternative.
Motivated by load balancing and job scheduling,
we consider a global cost function (over the losses incurred by each
alternative),
rather than a summation of the instantaneous losses as done
traditionally in online learning. Such global cost functions
include the makespan (the maximum over the alternatives) and
the L&lt;sub&gt;d&lt;/sub&gt; norm (over the alternatives).
Based on approachability theory, we design an algorithm that guarantees vanishing
 regret for this setting,
where the regret is measured with respect to the best static decision
that selects the same distribution over alternatives at every
time step.

For  the special case of makespan cost we devise a simple and efficient algorithm.
In contrast, we show that for concave global cost functions, such as
L&lt;sub&gt;d&lt;/sub&gt; norms for d&amp;lt;1,
the worst-case average regret does not vanish.</summary>
<link rel="alternate" href="http://www.cs.mcgill.ca/~colt2009/papers/005.pdf#page=1"/>
<category term="Algorithms" label="Algorithms"/>
<author><name>Eyal Even-Dar, Robert Kleinberg, Shie Mannor, Yishay Mansour</name></author>
</entry>
<entry>
<title><![CDATA[On the Convergence of Regret Minimization Dynamics in Concave Games]]></title>
<updated>2009-06-24T11:59:43-08:00</updated>
<id>urn:googlelabs:35384</id>
<summary>We consider standard regret minimization setting where at each time
step the decision maker has to choose a distribution over
&lt;em&gt;k&lt;/em&gt; alternatives, and then observes the loss of each alternative. The
setting is very similar to the classical online job scheduling setting
with three major differences:
&lt;ol&gt;
&lt;li&gt;Information model:
in the regret minimization setting losses are only observed after the
actions (assigning the job to a machine) is performed and not observed
before the action selection, as assumed in the classical online job
scheduling setting,
&lt;li&gt;The comparison class:
in regret minimization the comparison class is the best static
algorithm (i.e., distribution over alternatives) and not the optimal
offline solution.
&lt;li&gt;Performance measure: In regret minimization we measure the
additive difference to the optimal solution in the comparison class, in
contrast to the ratio used in online job scheduling
setting.
&lt;/ol&gt;
Motivated by load balancing and job scheduling,
we consider a global cost function (over the losses incur by each
alternative/machine),
rather than simply a summation of the instantaneous losses as done
traditionally in regret minimization. Such global cost functions
include the makespan (the maximum over the alternatives/machines) and
the L&lt;sub&gt;d&lt;/sub&gt; norm (over the alternatives/machines).

The major contribution of this work is to design a novel regret minimization
algorithm based on calibration that guarantees a vanishing average
regret,
where the regret is measured with respect to the best static decision
maker, who selects the same distribution over alternatives at every
time step.
Our results hold for a wide class of global cost functions. which
include the makespan and the L&lt;sub&gt;d&lt;/sub&gt; norms, for d&amp;gt;1.
In contrast, we show that for concave global cost functions, such as
L&lt;sub&gt;d&lt;/sub&gt; norms for d&amp;lt;1,
the worst-case average regret does not vanish.

In addition to the general calibration based algorithm, we provide
simple and efficient algorithms for special interesting cases.</summary>
<link rel="alternate" href="http://portal.acm.org/citation.cfm?doid=1536414.1536486"/>
<category term="Algorithms" label="Algorithms"/>
<author><name>Eyal Even-Dar, Yishay Mansour, Uri Nadav</name></author>
</entry>
<entry>
<title><![CDATA[Quantum Annealing for Variational Bayes Inference]]></title>
<updated>2009-06-24T12:05:36-08:00</updated>
<id>urn:googlelabs:35383</id>
<summary>Quantum Annealing for Variational Bayes Inference, Issei Sato, Kenichi Kurihara, Shu Tanaka, Seiji Miyashita, Hiroshi Nakagawa</summary>
<link rel="alternate" href="http://www.google.com/search?lr=&amp;ie=UTF-8&amp;oe=UTF-8&amp;q=Quantum+Annealing+for+Variational+Bayes+Inference+Sato+Kurihara+Tanaka+Miyashita+Nakagawa" type="text/html" title="Search for publication"/>
<category term="Machine Learning" label="Machine Learning"/>
<author><name>Issei Sato, Kenichi Kurihara, Shu Tanaka, Seiji Miyashita, Hiroshi Nakagawa</name></author>
</entry>
<entry>
<title><![CDATA[Quantum Annealing for Clustering]]></title>
<updated>2009-06-24T12:05:54-08:00</updated>
<id>urn:googlelabs:35382</id>
<summary>Quantum Annealing for Clustering, Kenichi Kurihara, Shu Tanaka, Seiji Miyashita</summary>
<link rel="alternate" href="http://www.google.com/search?lr=&amp;ie=UTF-8&amp;oe=UTF-8&amp;q=Quantum+Annealing+for+Clustering+Kurihara+Tanaka+Miyashita" type="text/html" title="Search for publication"/>
<category term="Machine Learning" label="Machine Learning"/>
<author><name>Kenichi Kurihara, Shu Tanaka, Seiji Miyashita</name></author>
</entry>
<entry>
<title><![CDATA[Sound Ranking Using Auditory Sparse-Code Representations]]></title>
<updated>2009-06-24T12:08:03-08:00</updated>
<id>urn:googlelabs:35269</id>
<summary>The task of ranking sounds from text queries is a
good test application for machine-hearing techniques, and particularly
for comparison and evaluation of alternative sound representations in
a large-scale setting.  We have adapted a machine-vision system,
``passive-aggressive model for image retrieval&#39;&#39;
(PAMIR), which
efficiently learns, using a ranking-based cost function, a linear
mapping from a very large sparse feature space to a large
query-term space.
Using this system allows us to focus on comparison of different
auditory front ends and different ways of extracting sparse features
from high-dimensional auditory images.  In addition to two main
auditory-image models, we also include and compare a family of more
conventional MFCC front ends.  The experimental results show a
significant advantage for the auditory models over vector-quantized MFCCs.
The two auditory models tested use the adaptive pole-zero filter
cascade (PZFC) auditory filterbank and sparse-code feature extraction
from stabilized auditory images via multiple vector quantizers. The
models differ in their implementation of the strobed temporal
integration used to generate the stabilized image. Using ranking
precision-at-top-k performance measures, the best results are about
70&amp;#37; top-1 precision and 35&amp;#37; average precision, using a test corpus
of thousands of sound files and a query vocabulary of hundreds of
words.</summary>
<link rel="alternate" href="http://www.google.com/search?lr=&amp;ie=UTF-8&amp;oe=UTF-8&amp;q=Sound+Ranking+Using+Auditory+Sparse-Code+Representations+Rehn+Lyon+Bengio+Walters+Chechik" type="text/html" title="Search for publication"/>
<category term="Audio and Speech Processing" label="Audio and Speech Processing"/>
<author><name>Martin Rehn, Richard F. Lyon, Samy Bengio, Thomas C. Walters, Gal Chechik</name></author>
</entry>
</feed>
