Are Semantically Coherent Topic Models Useful for Ad Hoc Information Retrieval?
Romain Deveaud, Eric SanJuan and Patrice Bellot
The 51st Annual Meeting of the Association for Computational Linguistics - Short Papers (ACL Short Papers 2013)
Sofia, Bulgaria, August 4-9, 2013
The current topic modeling approaches for Information Information do not allow to explicitly model query-oriented latent topics. More, the semantic coherence of the topics has never been considered in this field. We propose a model-based feedback approach that learns Latent Dirichlet Allocation topic models on the top-ranked pseudo-relevant feedback, and we measure the semantic coherence of those topics. We perform a first experimental evaluation using two major TREC test collections. Results show that retrieval performances tend to be better when using topics with higher semantic coherence.
Conference Manager (V2.61.0 - Rev. 2792M)