Research Publications

Placeholder
Restoring punctuation and casing in English text
Tim Baldwin, Manuel Paul Anil Kumar Joseph
This paper explores the use of machine learning techniques to restore punctuation and case in English text, as part of which it investigates the co-dependence of case information and punctuation. We achieve an overall F-score of .619 for the task using a variety of lexical and contextual features, and iterative retagging.

Details

accepted
Conference Paper
Australasian Joint Conference on Artificial Intelligence
547—556
Melbourne
www.infotech.monash.edu.au/about/news/conferences/ai09/