ICADL 2007 - LNCS 4822
   

Blog Classification Using Tags: An Empirical Study*

Aixin Sun1, Maggy Anastasia Suryanto1, and Ying Liu2

1Nanyang Technological University, Singapore
axsun@ntu.edu.sg

2Hong Kong Polytechnic University, Hong Kong, China
mfyliu@polyu.edu.hk

Abstract. With an exponential growth of Weblogs (or blogs), many blog directories have appeared to help users to locate topical blogs. As tags are commonly used to describe blogs, we study the effectiveness of tags in blog classification. Compared with titles and descriptions, our experiments, using 24,247 blogs, showed that tags could lead to better classification accuracy. It is interesting to observe that more tags did not necessarily lead to better classification accuracy. To better describe blogs, we have also proposed a tag expansion algorithm that assigns a blog more tags that are often co-occur with those already associated with the blog. Our experiments showed that tag expansion helped to improve the recall of blog classification with the price of precision degradation.

*This research is supported by grant SUG7/06, Nanyang Technological University, Singapore.

LNCS 4822, p. 307 ff.

Full article in PDF | BibTeX


lncs@springer.com
© Springer-Verlag Berlin Heidelberg 2007