This paper proposes an approach to full parsing approximation suitable for Information Extraction from texts. Sequences of cascades of finite-state rules deterministically analyze the text, building unambiguous structures. Initially basic chunks are analyzed; then clauses are recognized and nested; finally modifier attachment is performed and the global parse tree is built. The approach has been extensively proven to work mainly for Italian, but it was also tested for English and Russian. A parser based on such approach has been implemented as part of Pinocchio, an environment for developing and running IE applications

Full Parsing Approximation, Finite-State Cascades and Grammar Organization for Information Extraction

Lavelli, Alberto;
1999-01-01

Abstract

This paper proposes an approach to full parsing approximation suitable for Information Extraction from texts. Sequences of cascades of finite-state rules deterministically analyze the text, building unambiguous structures. Initially basic chunks are analyzed; then clauses are recognized and nested; finally modifier attachment is performed and the global parse tree is built. The approach has been extensively proven to work mainly for Italian, but it was also tested for English and Russian. A parser based on such approach has been implemented as part of Pinocchio, an environment for developing and running IE applications
1999
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/1849
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact