From FireWiki
Jump to: navigation, search

Presenter: Marc Spaniol

Abstract

Organizations like the Internet Archive have been capturing Web contents over decades. This time-versioned content is a gold mine for analysts, focusing on longitudinal studies. An application example is tracking and analyzing a politician's public appearances over a decade. The LAWA project develops methods and tools for time-travel indexing and querying, entity detection and tracking along the time axis, and advanced analyses and knowledge discovery. For scalability, we pursue Hadoop-based distributed computations. We also prepare reference data and will provide analytics services. We will offer a user workshop in late 2011 to disseminate these opportunities and explore interesting use cases.