This paper presents an experimental system architecture for Part-Of-Speech Tagging for the Italian language, able to manage a large tagset to provide both lexical and morphological information. The tagger was built as a cascade of four classifiers where each classifier in the cascade accepts data from an initial input or the guesses of the previous one, executes its annotation, and sends the resulting data to the next stage, or to the output of the cascade. At the EVALITA 2009 PoS-tagging task the combined classifier attained an accuracy of 96.06% on the open task and of 93.54% on the closed one.
A multistage PoS-tagger at the EVALITA 2009 PoS-tagging Task
Zanoli, Roberto;Pianta, Emanuele
2009-01-01
Abstract
This paper presents an experimental system architecture for Part-Of-Speech Tagging for the Italian language, able to manage a large tagset to provide both lexical and morphological information. The tagger was built as a cascade of four classifiers where each classifier in the cascade accepts data from an initial input or the guesses of the previous one, executes its annotation, and sends the resulting data to the next stage, or to the output of the cascade. At the EVALITA 2009 PoS-tagging task the combined classifier attained an accuracy of 96.06% on the open task and of 93.54% on the closed one.File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.