This document describes an XML model for representing the information of WordNet (Fellbaum, 1998), a lexical database. The main reason for this work is that, given its recent wide diffusion, XML seems to be a good candidate for a common interchange format for people developing wordnets, or portions of them, in several languages. This way, an XML schema may greatly facilitate the import and export of wordnet-like data from one database to another, independently of the actual implementation of the database itself. In addition, the XML validation capabilities The XML model proposed in this report has been used as interchange format in the TAL – Italwordnet project (Roventini et al. 2000). The Italian wordnet developed in TAL builds from the Italian wordnet developed within the EuroWordNet project (Vossen, 1998), from which it inherits (with some extensions for adjectives) the underlying model. Initially, the XML schema was adopted to convert EuroWordNet data available in the Polaris export format into the TAL import format
An XML schema for WordNet
Magnini, Bernardo;Girardi, Christian
2001-01-01
Abstract
This document describes an XML model for representing the information of WordNet (Fellbaum, 1998), a lexical database. The main reason for this work is that, given its recent wide diffusion, XML seems to be a good candidate for a common interchange format for people developing wordnets, or portions of them, in several languages. This way, an XML schema may greatly facilitate the import and export of wordnet-like data from one database to another, independently of the actual implementation of the database itself. In addition, the XML validation capabilities The XML model proposed in this report has been used as interchange format in the TAL – Italwordnet project (Roventini et al. 2000). The Italian wordnet developed in TAL builds from the Italian wordnet developed within the EuroWordNet project (Vossen, 1998), from which it inherits (with some extensions for adjectives) the underlying model. Initially, the XML schema was adopted to convert EuroWordNet data available in the Polaris export format into the TAL import formatI documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.