Organizing Web Search Results Using Clustering By Compression - CentraleSupélec Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

Organizing Web Search Results Using Clustering By Compression

Frédéric Boulanger
Dorin Carstoiu
  • Fonction : Auteur
Alexandra Cernian
  • Fonction : Auteur
Stefan Bodea
  • Fonction : Auteur

Résumé

Current Web search engines return long lists of ranked documents that users are forced to sift through to find relevant documents. This paper introduces an interactive presentation method of the search results, based on the notion of clustering by compression. Compression algorithms allow defining a similarity measure based on the degree of common information. Clustering methods allow clustering similar data without any previous knowledge. For this work, we have developed a Java application which retrieves the first 50 results returned by the Google search engine in response to a query, applies some text processing techniques to the Web documents, and thirdly applies the clustering by compression algorithm. The result is a binary tree enhancing the visualization of the formed clusters.
Fichier non déposé

Dates et versions

hal-00232719 , version 1 (01-02-2008)

Identifiants

  • HAL Id : hal-00232719 , version 1

Citer

Frédéric Boulanger, Géraldine Polaillon, Dorin Carstoiu, Alexandra Cernian, Stefan Bodea. Organizing Web Search Results Using Clustering By Compression. IADIS INTERNATIONAL CONFERENCE e-SOCIETY 2007, Mar 2007, Lisbon, Portugal. pp.419-423. ⟨hal-00232719⟩
63 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More