Automatically Characterizing Salience Using Readers' Feedback
AbstractSalience is an important characteristic of information inﬂuencing users’ cognitive and emotional states. For example, salient parts of a document are those that readers will ﬁnd moving or provoking. This article studies the salience concept and its meanings in linguistics and information retrieval. Then it analyses the main drawbacks of content-based techniques for automatic identiﬁcation of salient passages in a document. A new context-based method for overcoming these difficulties is subsequently presented. Our method identiﬁes passages that readers have reacted to by analyzing their textual feedback. Our experimentation with blog posts revealed that it is effective and can be on 90% of commented posts.