Bing, their response to allegations of copying their Google results. I don't think Bing were their response as clear as they could.
Today I'm going to write a post, I think that would've written by Bing. If you want to clear – I'm absolutely no relationship with any of these companies (using gmail, adsense and analytics), I'm only insert it here from my view of the situation. After that, I'll get back to blogging about things cool site and stop ruining My chances of ever working for Google ...
At the beginning, you can search the text on the page to determine whether this page relevant to the query. This works well, in particular in the event that you consider the length of the page in relation to all the other documents in the collection and the relative density of keywords on the page. This method of retrieving the information is in scientific circles, known as the TFxIDF.
The trouble with the TFxIDF on the basis of search is that it never designed to deal with people trying to intentionally manipulate search results. By selecting carefully the specific keywords in the document, you can quite easily manipulate this "bag of words" access to the search.
In 1998 Larry Page and Sergey Brin is to demonstrate the power of using the number of inbound links on a Web page has as the signal to determine the quality of this page. This is called "pagerank" in their most important book "The Anatomy of a large-scale Hypertextual Web search engine", and was created on the basis of their search engine Google.
A great side effect of this kind of connection graph based approach is that it shall adopt the measures how relevant a page is outside the control of the author of this page in the order is determined by how many other people link, which is a much harder game (even impossible).
In the years after the advent of the Google search site taken each other some form of PageRank and dramatically improve the quality of search results at a flat rate.
Today can build enough search engine indexing lots of pages, only the calculation of the chart, the connection between them and the combination of measures with data on the TFxIDF incoming links.
But to actually search engine is the head and shoulders above the competition, you can start using the other signal. For example, Google recently announced that they were starting to look at the speed of the page as a signal of quality. Bing with thousands of signals; incoming connections, the number of tweets, sentiment analysis, link the data sharing and so on.
Bing is the motivation was: "we want to see good quality links to Bing. How people express the quality of the link? By clicking on them! ". So for people who have subscribed to the toolbar, Bing, Microsoft began to collect information about the links you have clicked. This profile of anonymised and sent encrypted, so nobody MS could spy on their users. So that the association between the pages has been created – not only that the page refers to 5 other pages, but also that these 5 pages are more frequently clicked the Association. This gives Bing the relationship between the link displayed on the page and on the page the link points to the Bing was not this. for each website, whether on a blog or a search engine, shopping site. Bing gathered information as links people were clicking on and used, as one of the signals to determine the page quality – the theory is that the quality of the pages get more clicks. Other Web users also about this great is that as PageRank, takes over control of the evaluation of the search by author-it has been decided in order, so theoretically means less spam and more relevant, high quality, search engine.
When Google their experiment, they created a special page on their site, which contains the words that exist nowhere else on the Web, then you have installed Bing tools and clicked on the link of "synthetic" pages. Bing toolbar sent MS data, much like a piece for all pages and their system incorporated other signal as well as normal "data.
But what meant was that when Google then went Bing and were looking for these words, all the other signal if you have any input data only if the Bing from their "signal, and so it was, what is used in the system. Not surprisingly, the Bing returned on the same pages as well as Google, because all the data that existed in the world this query! Bing didn't collect data because of Google, Bing collected it Bing toolbar users.
Microsoft wrote a neat little tool that allows you to collect from any site you've visited and Google crier announced, because he also worked on their websites. If they had spoken to Bing first without making all these allegations, I think that the whole situation could have been the target. But even so – now Bing is known that the "data can be taken advantage of such as this, Bing will be looking into the deeply how best to stop this happening.
So this is what you have seen I liked means Bing in its reply to Google. The fact that they were not as clear-cut in their refusal as would suggest that the picture is not completely correct, the event or that Bing had other things, but wanted to communicate in the message.
OK, I think the end to this problem is now – still tired from the wrong information in the sequence of notes everywhere on this topic.
Update: Oh boy. Matt Cutts recently wrote a fantastic blog some more evidence. It is necessary to swallow my pride here and say: it really looks like Bing, they have been specifically aimed at Google.
Matt refers to a research paper from Microsoft, which contains a damning evidence:
We have a "reverse" parameters from the URL of the meeting [formulation query] and see how each search engine encodes the query and the fact that the user received the URL by clicking on a proposal from the spelling query
The book is available here: Learning Phrase-Based spelling error models of the ratio of the Data.
My apologies Matt and Google. I think that you originally submitted would not suffice to support the conclusion of the searched and therefore my blog post. But this Revelation we strongly warns about Bing intentionally and specifically aimed at the pages of Google, which is very clearly copying from Google. Shame you, I am Bing, how could you have that it was OK to do?
Related posts:
This entry was posted on 3. February 2011, 10: 49 PM and is filed under web. You can do all of the responses to this entry through the RSS 2.0. You can leave a response, or trackback from your own page.