Extracting Learning Objects from Web Pages
Abstract
In this paper, we propose a semi-automatic tool to build a learning object repository from HTML pages. Our extraction method which consists of applying the user-defined patterns defined in a configuration file using a pre-defined ontology, tries also to discover partially matched objects and to assign them proper tags conformed to an XML schema. An important part of the extraction process uses hypertext links in order to help the discovery of semantic and structural links between learning objects.