Contextual ontological concepts extraction - CentraleSupélec Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

Contextual ontological concepts extraction

Lobna Karoui
  • Fonction : Auteur
Nacéra Bennacer Seghouani

Résumé

Ontologies provide a common layer which plays a major role in supporting information exchange and sharing. In this paper, we focus on the ontological concept extraction process from HTML documents. We propose an unsupervised hierarchical clustering algorithm namely “Contextual Ontological Concept Extraction” (COCE) which is an incremental use of a partitioning algorithm and is guided by a structural context. This context exploits the html structure and the location of words to select the semantically closer cooccurrents for each word and to improve the words weighting. Guided by this context definition, we perform an incremental clustering that refines the words' context of each cluster to obtain semantic extracted concepts. The COCE algorithm offers the choice between either an automatic execution or an interactive one. We experiment the COCE algorithm on French documents related to the tourism. Our results show how the execution of our context-based algorithm improves the relevance of the clusters' conceptual quality.
Fichier non déposé

Dates et versions

hal-00259905 , version 1 (29-02-2008)

Identifiants

  • HAL Id : hal-00259905 , version 1

Citer

Lobna Karoui, Nacéra Bennacer Seghouani, Marie-Aude Aufaure. Contextual ontological concepts extraction. Ninth International Conference on Discovery Science (DS-2006) and in Lecture Notes in Artificial Intelligence, Oct 2006, Barcelonne, Spain. pp.306-310. ⟨hal-00259905⟩
23 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More