Istituto di Informatica e Telematica     
La Polla M. N., Gazz D., Marchetti A., Tesconi M. CAPER: Crawling and Analysing Facebook for Intelligence Purposes. In: FOSINT - International Symposium on Foundations of Open Source Intelligence and Security Informatics (Pechino (China), 17- 20 08 2014). Proceedings, pp. 665 - 669. IEEE, 2014.
Organised crime uses information technology systems to communicate, work or expand its influence. The Collaborative information, Acquisition, Processing, Exploitation and Reporting for the prevention of organised crime (CAPER) Project, created in cooperation with European Law Enforcement Agencies (LEAs), aims to build a common collaborative and information sharing platform for the detection and prevention of organised crime, which exploits Open Source Intelligence (OSINT). LEAs are becoming more inclined to using OSINT tools, and particularly tools able to manage Online Social Networks (OSNs) data. This paper presents the CAPER Facebook crawling and analysis subsystem. Heuristic algorithms have been implemented in order to extract specific properties of Facebook's social graph, in particular user interactions. To support analysis tasks specifically, extensive effort has been spent on the analysis of textual user generated content and on the recognition of named-entities, in particular person names, locations and organisations. Relationships between users and entities mentioned in posts and in related comments are created and merged into the users networks extracted from the social graph. All entity relationships are finally visualised in userfriendly network graphs.
DOI: 10.1109/ASONAM.2014.6921656
Subject Natural Language Processing
Open Source Intelligence (OSINT)
Social Network Analysis
I.2.7 Natural Language Processing; multilingual text annotation; semantic text, Online Social Networks

Icona documento 1) Download Document PDF

Icona documento Open access Icona documento Restricted Icona documento Private


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional