This paper describes SIE (Simple Information Extraction), a modular information extraction system designed with the goal of being easily and quickly portable across tasks and domains. SIE is composed bya general purpose machine learning algorithm(SVM) combined with several customizable modules. A crucial role in the architecture is played by Instance Filtering,which allows to increase efficiency without reducing effectiveness. The results obtained by SIE on several standard data sets, representative of different tasks and domains, are reported. The experiments show that SIE achieves performance close to the best systems in all tasks, without using domain-specific knowledge.
Simple Information Extraction (SIE): A Portable and Effective IE System
Giuliano, Claudio;Lavelli, Alberto;Romano, Lorenza
2006-01-01
Abstract
This paper describes SIE (Simple Information Extraction), a modular information extraction system designed with the goal of being easily and quickly portable across tasks and domains. SIE is composed bya general purpose machine learning algorithm(SVM) combined with several customizable modules. A crucial role in the architecture is played by Instance Filtering,which allows to increase efficiency without reducing effectiveness. The results obtained by SIE on several standard data sets, representative of different tasks and domains, are reported. The experiments show that SIE achieves performance close to the best systems in all tasks, without using domain-specific knowledge.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.