Can you break Google's secret?


L

Looking

The most successful search engine's secret algorithm can be calculated from
a sandbox/blackbox detection. Assuming we already know the secret, can you
write the code with it?

Case 1, especially for news.google.com, Google compares two documents in
stripped text format, and see how much of them in common. If they share 90%
similar things, then they are talking about the same thing.

here is the algorithm A, break down each document into key phrases and
keywords, ignore the most common ones, then compare the array of keywords
from both documents.
algorithm B is simpler, compare two documents as whole. Currently I am
trying to study how to complete the algorithm and codes for B.

For B, here is a test case, can you write a perl code for that? Just
comparing two strings, rather than long documents. Sudocodes are welcomed
too. Here is the sample input string.



Yahoo spends $160m to take on iTunes and co
VNUNet.com, UK - 6 hours ago
Yahoo is buying music software and services company Musicmatch for
$160m in cash as it prepares to make a splash in the digital music arena.
....

Yahoo to buy Musicmatch for $160 million
CNET News.com, United States - 18 hours ago
Yahoo said Tuesday that it is acquiring digital music specialist
Musicmatch for $160 million in cash. The Sunnyvale, Calif.-based ...

Yahoo Pays $160 Million for Online Music Store
NPR (audio) - Sep 15, 2004
Description: Yahoo joins the online music market by agreeing to buy
Musicmatch for $160 million. Analysts say Yahoo needs the deal ...

Yahoo! Tunes In to Musicmatch
Motley Fool - Sep 15, 2004
By Dave Marino-Nachison September 15, 2004. Close watchers of the
online music business no doubt noted yesterday's announcement of ...

Yahoo Plans to Purchase Online Music Seller
NPR (audio) - Sep 15, 2004
Description: Yahoo has agreed to buy the online music provider
Musicmatch for $160 million. Yahoo hopes to cash in on the expanding ...

Yahoo to enter music fray
San Francisco Chronicle, CA - Sep 15, 2004
Yahoo Inc. became the latest technology titan to jump into the
Internet music market Tuesday with a $160 million deal to buy pioneering ...

YAHOO: $160 million deal to add Musicmatch
Chicago Tribune (subscription), IL - Sep 15, 2004
SAN FRANCISCO -- Yahoo Inc. is buying online jukebox provider
Musicmatch Inc. for $160 million in a deal intended to broaden the ...

Yahoo buying Musicmatch to expand in digital music
Chicago Sun Times, IL - Sep 15, 2004
BY MICHAEL LIEDTKE. SAN FRANCISCO -- Yahoo is buying online jukebox
provider Musicmatch for $160 million in a deal designed to broaden ...

Yahoo boosts music portal
San Jose Mercury News (subscription), CA - Sep 15, 2004
By Dawn C. Chmielewski. Yahoo, seeking to add e-commerce to its
Internet music portal, will pay $160 million in cash to acquire Musicmatch
....

Yahoo moves in on music scene
News24 (subscription), South Africa - Sep 15, 2004
San Francisco - Yahoo is buying online jukebox provider Musicmatch Inc
for $160m in a deal designed to broaden the internet giant's appeal with the
growing ...

Yahoo! buys SD pioneer Musicmatch
San Diego Union Tribune, CA - Sep 15, 2004
By Kathryn Balint. Musicmatch, the San Diego online music pioneer that
was started in the homes of its three founders, was sold for ...

Yahoo banks on digital music
Miami Herald (subscription), FL - Sep 15, 2004
Yahoo's $160 million purchase of Musicmatch will give the Internet
company an online music store and a software program that manages digital
music. ...

Yahoo to Buy Online Jukebox Musicmatch
Los Angeles Times (subscription), CA - Sep 15, 2004
Yahoo Inc. agreed to buy online jukebox provider Musicmatch Inc. of
San Diego for $160 million in a deal designed to broaden the ...

Yahoo! to acquire Musicmatch for USD 160 mln
Telecom Paper (subscription), Netherlands - Sep 15, 2004
Yahoo! and Musicmatch, a provider of personalized music software and
services, have signed a definitive agreement under which Yahoo! ...

Portal eager to get in market for downloads
Houston Chronicle, TX - Sep 14, 2004
By MICHAEL LIEDTKE. SAN FRANCISCO - Yahoo is buying online jukebox
provider Musicmatch for $160 million in a deal designed to broaden ...

Yahoo! and Musicmatch: A Hot Duet?
BusinessWeek - Sep 14, 2004
The $160 million deal signals that Semel & Co. will mount the biggest
challenge yet to Apple's dominance of online music. Since grabbing ...

Yahoo! finds its MusicMatch: $160M purchase puts song in its heart
Boston Herald, United States - Sep 14, 2004
By Greg Gatlin. Yahoo! Inc. put its headphones on yesterday and danced
into the middle of a crowded music download arena with a $160 ...

Yahoo to Buy Online Music Seller for $160 Million
New York Times, NY - Sep 14, 2004
By SAUL HANSELL. na move to expand its role in the online music
business, Yahoo announced yesterday that it would buy Musicmatch, a ...

Yahoo Inc. makes a Musicmatch
Minneapolis Star Tribune (subscription), MN - Sep 14, 2004
In yet another sign that online music is a business with big
potential, Internet portal Yahoo Inc. has agreed to acquire online music
seller Musicmatch Inc. ...

Yahoo enters the online music mix
Red Herring - Sep 14, 2004
The battle for online music audiences gained yet another heavyweight
competitor on Tuesday, as Internet giant Yahoo announced its acquisition of
MusicMatch, a ...

Yahoo! expands in online music
Guardian, UK - Sep 14, 2004
Yahoo! yesterday became the latest big name to make its presence felt
in the burgeoning market for online music. The internet portal ...

Yahoo buys online music outfit for US$160m
Straits Times, Singapore - Sep 14, 2004
SAN FRANCISCO - Yahoo is buying online jukebox provider Musicmatch for
US$160 million (S$271.55 million) in a deal designed to broaden the Internet
giant's ...

Yahoo to buy Musicmatch
Stuff.co.nz, New Zealand - Sep 14, 2004
NEW YORK: Internet company Yahoo says it will buy privately held music
software company Musicmatch for about $US160 million in a bid to bolster its
online ...

Tech Brief: Yahoo puts money into jukebox
International Herald Tribune, France - Sep 14, 2004
Yahoo is buying the online jukebox provider Musicmatch for $160
million in a deal designed to broaden the Internet giant's appeal to the
growing audience of ...

Internet stocks nudge techs higher
CNN/Money - Sep 14, 2004
Yahoo! jumps into the music download biz while Ask Jeeves, Amazon
soar; Oracle beats after the bell. NEW YORK (CNN/Money) - Bucking ...

Yahoo to Buy Musicmatch Web Service for $160 Million (Update6)
Bloomberg - Sep 14, 2004
Sept. 14 (Bloomberg) -- Yahoo! Inc., owner of the world's most-visited
Internet site, will buy Musicmatch Inc. for $160 million ...

Yahoo to Buy Musicmatch for $160 Million
ABC News, United States - Sep 14, 2004
NEW YORK (Reuters) - Yahoo Inc. said on Tuesday it will buy music
software company Musicmatch Inc. for about $160 million in cash ...

Yahoo to Buy Musicmatch for $160 Million
ABC News, United States - Sep 14, 2004
Sept. 14, 2004 ¡ª NEW YORK (Reuters) - Yahoo Inc. said on Tuesday it
will buy music software company Musicmatch Inc. for about ...

Yahoo's long and winding music road
ZDNet.com - Sep 14, 2004
This fourth priority's main focus has been improving or obtaining CRM
and ERP software for the past year and a half. With Tuesday's ...

Yahoo Plans To Acquire Musicmatch
InternetWeek.com - Sep 14, 2004
By Antone Gonsalves, TechWeb News. Yahoo Inc. on Tuesday said it has
agreed to acquire Musicmatch Inc. for about $160 million in ...
 
Ad

Advertisements

T

Tassilo v. Parseval

Also sprach Looking:
The most successful search engine's secret algorithm can be calculated from
a sandbox/blackbox detection. Assuming we already know the secret, can you
write the code with it?

Case 1, especially for news.google.com, Google compares two documents in
stripped text format, and see how much of them in common. If they share 90%
similar things, then they are talking about the same thing.
here is the algorithm A, break down each document into key phrases and
keywords, ignore the most common ones, then compare the array of keywords
from both documents.
algorithm B is simpler, compare two documents as whole. Currently I am
trying to study how to complete the algorithm and codes for B.

Easy:

print "Similar" if $text1 eq $text2;

Is that what you meant with 'compare two documents as whole'?
For B, here is a test case, can you write a perl code for that? Just
comparing two strings, rather than long documents. Sudocodes are welcomed
^^^^^^^^^
What's that? Code for the superuser?
too. Here is the sample input string.

[...]

This group isn't meant as a place to discharge your quizzes of the week.
Have you tried anything so far yourself?

Tassilo
 
Ad

Advertisements

G

Gregory Toomey

Looking said:
The most successful search engine's secret algorithm can be calculated
from a sandbox/blackbox detection.

Your whole post is nonsense & has little to do with Perl.

gtoomey
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top