The new frontier of research on Information Extraction from texts is portability without any knowledge of Natural Language Processing. The market potential is very large in principle, provided that a suitable easy-to-use and effective methodology is provided. In this paper we describe LearningPinocchio, a system for adaptive Information Extraction from texts based on the idea above that is having good commercial and scientific success. Real world applications have been built and evaluation licenses have been released to external companies for application development. In this paper we present a number of applications developed with it and report about an evaluation performed by an independent company. Finally we discuss the suitability of the IE technology behind the system with respect to the requirements mentioned in the introduction and draw some conclusion
LearningPinocchio: Adaptive Information Extraction for Real World Applications
Lavelli, Alberto
2002-01-01
Abstract
The new frontier of research on Information Extraction from texts is portability without any knowledge of Natural Language Processing. The market potential is very large in principle, provided that a suitable easy-to-use and effective methodology is provided. In this paper we describe LearningPinocchio, a system for adaptive Information Extraction from texts based on the idea above that is having good commercial and scientific success. Real world applications have been built and evaluation licenses have been released to external companies for application development. In this paper we present a number of applications developed with it and report about an evaluation performed by an independent company. Finally we discuss the suitability of the IE technology behind the system with respect to the requirements mentioned in the introduction and draw some conclusionI documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.