The analysis of the contents of news outlets has been the focus of social scientists for a long time. However, content analysis is often performed on hand-coded documents, which limits the size of the data accessible to the investigation and consequently limits the possibility of detecting macro-trends. The use of text categorisation, clustering and statistical machine translation (SMT) enables us to operate automatically on vast amounts of news items, and consequently to analyse patterns in the content of outlets in different languages, over long time periods. We report on experiments involving hundreds of European media in 22 different languages, demonstrating how it is possible to detect similarities and differences between outlets, and between countries, based on the contents of their articles.

Detecting Macro-patterns in the European Mediasphere

Turchi, Marco;
2009-01-01

Abstract

The analysis of the contents of news outlets has been the focus of social scientists for a long time. However, content analysis is often performed on hand-coded documents, which limits the size of the data accessible to the investigation and consequently limits the possibility of detecting macro-trends. The use of text categorisation, clustering and statistical machine translation (SMT) enables us to operate automatically on vast amounts of news items, and consequently to analyse patterns in the content of outlets in different languages, over long time periods. We report on experiments involving hundreds of European media in 22 different languages, demonstrating how it is possible to detect similarities and differences between outlets, and between countries, based on the contents of their articles.
2009
978-1-4244-5331-3
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/307919
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact