Some Thoughts on Failure
Analysis for Noisy Data
Donna Harman
NIST, USA
A
critical piece of evaluation is doing a failure analysis of the experimental
results. This is necessary to improve results, but equally important, it is
necessary to understand where the problems REALLY are, as opposed to just
following "conventional" wisdom.
The talk will present some current failure analysis techniques and
examine how these techniques could be extended to retrieval from noisy data.
About Donna Harman
Donna Harman
graduated from Cornell University
with a degree in electrical engineering, but has been involved with research in
new search engine techniques for many years.
She retired from the National Institute of Standards and Technology in
2005 after leading a group that worked in the area of natural language access
to full text, both in search and browsing modes. In 1992 she started the Text Retrieval
Conference (TREC), an ongoing forum that brings together researchers from industry
and academia to test their search engines against a common corpora involving
over a million documents, with appropriate topics and relevance judgments. She received the 1999 Strix Award from the
U.K Institute of Information Scientists for this effort. Starting in 2000 she
worked with Paul Over (also at NIST) and the text summarization community to
form a new effort (DUC) to evaluate text summarization. She is currently a scientist emeritus at
NIST.