PUMA
Istituto di Scienza e Tecnologie dell'Informazione     
Muntean C. I., Morar G. A., Moldovan D. Exploring the meaning behind Twitter hashtags through clustering. Witold Abramowicz, John Domingue, Krzysztof Węcel (eds.). (Lecture Notes in Business Information Processing, vol. 127). Heidelberg: Springer, 2012.
 
 
Abstract
(English)
Social networks are generators of large amount of data produced by users, who are not limited with respect to the content of the information they exchange. The data generated can be a good indicator of trends and topic preferences among users. In our paper we focus on analyzing and representing hashtags by the corpus in which they appear. We cluster a large set of hashtags using K-means on map reduce in order to process data in a distributed manner. Our intention is to retrieve connections that might exist between different hashtags and their textual representation, and grasp their semantics through the main topics they occur with.
URL: http://link.springer.com/chapter/10.1007%2F978-3-642-34228-8_22?LI=true
DOI: 10.1007/978-3-642-34228-8_22
Subject k-means
Clustering
Hashtag
Twitter
1.2.7 Natural Language Processing
68T50 Natural language processing


Icona documento 1) Download Document PDF


Icona documento Open access Icona documento Restricted Icona documento Private

 


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional