Data Extraction From Wikis

From AIRWiki

Jump to: navigation, search
Use of ontologies for the extraction of structured data from wikis
Short Description: Development of a Java application for the extraction of data from wikis and their reorganization inside an ontology
Coordinator:
Tutor: DavideEynard (eynard@elet.polimi.it)
Collaborator:
Students: CarloMiglierina (carlo.miglierina@gmail.com)
Research Area: Social Software and Semantic Web
Research Topic:
Start: 2008/10/28
End: 2009/09/5
Status: Closed
Level: Bs
Type: Thesis

Contents

Part 1: project profile

Project name

Use of ontologies for the extraction of structured data from wikis

Project short description

Wikipedia is the largest and most known example of wiki. There is a lot of information inside wikis that are built using its same technology, and a lot of users who create and edit their pages. But these free encyclopedias have a disadvantage: data is not structured and so it is not possible to do advanced researches. Moreover, computers cannot process these data. The aim of this project is to create a Java application that extracts semi-structured data from wiki templates and infoboxes and puts them inside an ontology, in order to have structured data. Using the ontology it is possible to do advanced researches, as computers can process these data. As an example, this application has been used to organize data about the characters of "The lord of the rings".

Dates

Start date: 2008/10/28

End date: 2009/09/05

People involved

Project Advisor

Davide Eynard

Students

Students currently working on the project

Carlo Miglierina

Part 2: project description

Project Documentation (in Italian)

Project Presentation (in italian)