knoWitiary is a resource that presents a reorganized version of Wiktionary’s information in machine readable format. Wiktionary contains a plethora of information about words, including sense defini- tions, etymology, translations, derived terms and anagrams. Similar work to the one reported here goes one step further than extracting information from Wiktionary: mapping it onto WordNet – NLP community’s de facto gold standard. Lexical and relation overlap shows that Wik- tionary provides different types of information compared to WordNet, which implies that much is discarded when doing a mapping. We make a case here for making space for “pure” resources alongside mapped ones, to preserve the unique information that idiosyncratic resources such as Wiktionary provide, which may open up new avenues to explore for tasks that require varied and “unorthodox” information about words.

knoWitiary: A Machine Readable Incarnation of Wiktionary

Nastase, Viviana Antonela;Strapparava, Carlo
2015-01-01

Abstract

knoWitiary is a resource that presents a reorganized version of Wiktionary’s information in machine readable format. Wiktionary contains a plethora of information about words, including sense defini- tions, etymology, translations, derived terms and anagrams. Similar work to the one reported here goes one step further than extracting information from Wiktionary: mapping it onto WordNet – NLP community’s de facto gold standard. Lexical and relation overlap shows that Wik- tionary provides different types of information compared to WordNet, which implies that much is discarded when doing a mapping. We make a case here for making space for “pure” resources alongside mapped ones, to preserve the unique information that idiosyncratic resources such as Wiktionary provide, which may open up new avenues to explore for tasks that require varied and “unorthodox” information about words.
File in questo prodotto:
File Dimensione Formato  
JCL_2015.pdf

non disponibili

Tipologia: Documento in Pre-print
Licenza: DRM non definito
Dimensione 218.39 kB
Formato Adobe PDF
218.39 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/306377
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact