Session I0830-1000
0830Keynote Address by Donna Harman. Some thoughts on failure analysis for noisy data
0905Invited Lecture by John Tait. Noise and information
0945Boasters of Poster Papers
Tea Break1000-1030
Session II: Errors and Effects1030-1300
1030Laurianne Sitbon and Patrice Bellot. How to cope with questions typed by dyslexic users
1050Daniel Lopresti. Optical character recognition errors and their effects on natural language processing
1110Ulrich Reffle, Annette Gotscharek, Christoph Ringlstetter and Klaus Schulz. Successfully detecting and correcting false friends using channel profiles
1130Rachit Arora and Balaraman Ravindran. Latent Dirichlet allocation based multi-document summarization
1150Amaresh Pandey and Tanveer Siddiqui. An Unsupervised Hindi stemmer with heuristic improvements
1210Discussion
Lunch1230-1330
Session III: Named Entities and Blogs1400-1530
1330Valentin Jijkoun, Mahboob Khalid, Maarten Marx and Maarten de Rijke. Named entity normalization in user generated content
1350Rema Ananthanarayanan, Vijil Chenthamarakshan, Prasad Deshpande and Raghu Krishnapuram. Rule based synonyms for entity extraction from noisy text
1410Jiyin He, Wouter Weerkamp, Martha Larson and Maarten de Rijke. Blogger, Stick to your story: Modeling topical noise in blogs with coherence measures
1430Robert McArthur. Uncovering deep user context from blogs
1450Discussion
1530-1600 Session IV: Posters Over Tea
*Jinfeng Zhuang, Steven C.H. Hoi and Aixin Sun. On profiling blogs with representative entries
*Soumya Datta and Sudeshna Sarkar. Comparative study of statistical features of language in blog-vs-splog
*Anurag Bhardwaj, Faisal Farooq, Huaigu Cao and Venu Govindaraju. Topic based language models for OCR correction
*Eiman Alshammari and Jessica Lin. A novel Arabic lemmatization algorithm
Session V: Noisy Environments1600-1800
1600Sreangsu Acharyya, Sumit Negi, L Venkata Subramaniam and Shourya Roy. Unsupervised learning of multilingual short message service (SMS) dialect from noisy examples
1620Antti Järvelin, Tuomas Talvensaari and Anni Järvelin. Data driven methods for improving mono- and cross-lingual IR performance in noisy environments
1640Lipika Dey and Mirajul Haque. Opinion mining from noisy text data
1700Discussion