Thread: SEO Future?
View Single Post
  #9 (permalink)  
Old 11-20-2007, 08:18 AM
Eugene04's Avatar
Eugene04 Eugene04 is offline
Registered User
 
Join Date: Nov 2007
Location: Belarus
Posts: 3
Theese method aims to ban spammy sites "A computer implemented method for identifying spam documents in an information retrieval system, the method comprising: maintaining a list of phrases, each phrase associated with a list of related phrases; determining a number of related phrases expected to be present in a document for any phrase on the list of phrases; determining for a document, and for at least one phrase in the document, an actual number of related phrases present in the document; and identifying the document as a spam document by comparing the actual number of related phrases present in the document with the expected number of related phrases."
And as for ranking: "The scoring algorithm for pre-ranking the documents may be the same underlying relevance scoring algorithm used in the search system 120 to generate a relevance score. In one embodiment, the IR score is based on the page rank algorithm, as described in U.S. Pat. No. 6,285,999. Alternatively or additionally, statistics for a number of IR-relevant attributes of the document, such as the number of inlinks, outlinks, document length, may also be stored, and used alone or in combination in order to rank the documents."