Istituto di Scienza e Tecnologie dell'Informazione     
Muntean C. I., Morar G. A., Moldovan D. Exploring the meaning behind Twitter hashtags through clustering. Witold Abramowicz, John Domingue, Krzysztof Węcel (eds.). (Lecture Notes in Business Information Processing, vol. 127). Heidelberg: Springer, 2012.
Social networks are generators of large amount of data produced by users, who are not limited with respect to the content of the information they exchange. The data generated can be a good indicator of trends and topic preferences among users. In our paper we focus on analyzing and representing hashtags by the corpus in which they appear. We cluster a large set of hashtags using K-means on map reduce in order to process data in a distributed manner. Our intention is to retrieve connections that might exist between different hashtags and their textual representation, and grasp their semantics through the main topics they occur with.
URL: http://link.springer.com/chapter/10.1007%2F978-3-642-34228-8_22?LI=true
DOI: 10.1007/978-3-642-34228-8_22
Subject k-means
1.2.7 Natural Language Processing
68T50 Natural language processing

Icona documento 1) Download Document PDF

Icona documento Open access Icona documento Restricted Icona documento Private


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional