Research Publications
Estimating Likelihoods for Topic Models Topic models are a discrete analogue to principle component analysis and independent component
analysis that model topic at the word level within a document. They have many variants
such as NMF, PLSI and LDA, and are used in many fields such as genetics, text and the web, image
analysis and recommender systems. However, only recently have reasonable methods for
estimating the likelihood of unseen documents, for instance to perform testing or model comparison,
become available. This paper explores a number of such methods, improving on the
recent Left-to-Right algorithm of Wallach. Keywords: topic models, likelihood, MCMC Details
| Related Project
Related People |
