Research Publications

 
Search | Show all
buntine
Narrow your search
« 1 2 3 »

Results per Page 10 25 50 100 250
30 result(s)
By Type
By Year
By Research Group
Twitter: the world of 140 characters poses serious challenges to the efficacy of topic models on short, messy text. While topic models such as Latent Dirichlet Allocation (LDA) have a long history of successful application to news articles and ...
The 36th Annual ACM SIGIR Conference - July 2013
Lan Du, Wray Buntine, Mark Johnson
We present a new hierarchical Bayesian model for unsupervised topic segmentation.~This new model integrates a point-wise boundary sampling algorithm used in Bayesian segmentation into a structured topic model that can capture a simple hierarchical ...
2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - June 2013
Shengbo Guo, Scott Sanner, Thore Graepel, Wray Buntine
We extend the Bayesian skill rating system of TrueSkill to accommodate score-based match outcomes. TrueSkill has proven to be a very effective algorithm for matchmaking --- the process of pairing competitors based on similar skill-level --- in ...
European Conference on Machine Learning - September 2012
Lan Du, Wray Buntine, Huidong Jin
Topic models are increasingly being used for text analysis tasks, often times replacing earlier semantic techniques such as latent seman- tic analysis. In this paper, we develop a novel adaptive topic model with the ability to adapt topics from both ...
Empirical Methods in Natural Language Processing (EMNLP) - July 2012
Changyou Chen, Nan Ding, Wray Buntine
We develop dependent hierarchical normalized random measures and apply them to dynamic topic modeling. The dependency arises via {\em superposition}, {\em subsampling} and {\em point transition} on the underlying Poisson processes of these measures. ...
International Conference on Machine Learning (ICML) - June 2012
The two parameter Poisson-Dirichlet Process (PDP), a generalisation of the Dirichlet Process, is increasingly being used for probabilistic modelling in discrete areas such as language technology, bioinformatics, and image analysis. There is a rich ...
ARXIV - February 2012
Topic models have the potential to improve search and browsing by extracting useful semantic themes from web pages and other text documents. When learned topics are coherent and interpretable, they can be valuable for faceted browsing, results set ...
Neural Information Processing Systems (NIPS) - December 2011
Changyou Chen, Lan Du, Wray Buntine
Hierarchical modeling and reasoning are fundamental in ma- chine intelligence, and for this the two-parameter Poisson-Dirichlet Pro- cess (PDP) plays an important role. The most popular MCMC sampling algorithm for the hierarchical PDP and hierarchical ...
European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD) - September 2011
Lan Du, Wray Buntine, Huidong Jin
Understanding how topics within a document evolve over its structure is an interesting and important problem. In this paper, we address this problem by presenting a novel variant of Latent Dirichlet Allocation (LDA): Sequential LDA (SeqLDA). This ...
IEEE International Conference on Data Mining (ICDM) - December 2010
James Petterson, Smola Alex, Tiberio Caetano, Wray Buntine, Narayanamurthy Shravan
We extend Latent Dirichlet Allocation (LDA) by explicitly allowing for the encoding of side information in the distribution over words. This results in a variety of new capabilities, such as improved estimates for infrequently occurring words, as well as ...
Neural Information Processing Systems - December 2010