log in  |  register  |  feedback?  |  help  |  web accessibility
Hierarchical Multi-Label Classification of Social Text Streams
Tuesday, September 9, 2014, 1:30-2:30 pm Calendar
  • You are subscribed to this talk through .
  • You are watching this talk through .
  • You are subscribed to this talk. (unsubscribe, watch)
  • You are watching this talk. (unwatch, subscribe)
  • You are not subscribed to this talk. (watch, subscribe)

Hierarchical multi-label classification assigns a document to multiple hierarchical classes. In this paper we focus on hierarchical multi-label classification of social text streams. Concept drift, complicated relations among classes, and the limited length of documents in social text streams make this a challenging problem. Our approach includes three core ingredients: short document expansion, time-aware topic tracking, and chunk-based structural learning. We extend each short document in social text streams to a more comprehensive representation via state-of-the-art entity linking and sentence ranking strategies. From documents extended in this manner, we infer dynamic probabilistic distributions over topics by dividing topics into dynamic "global'' topics and "local'' topics. For the third and final phase we propose a chunk-based structural optimization strategy to classify each document into multiple classes. Extensive experiments conducted on a large real-world dataset show the effectiveness of our proposed method for hierarchical multi-label classification of social text streams.


Zhaochun Ren is a PhD candidate in ISLA, University of Amsterdam. His supervisor is Prof. Dr. Maarten de Rijke. His research interests focus on information retrieval, text mining and social media mining. Before joining UvA, Zhaochun received his B.E and M.E from Shandong University, China, in 2009 and 2012 respectively.

This talk is organized by Jimmy Lin