01886nas a2200157 4500008004100000245006400041210006400105260004600169520138200215653001601597653001901613653001001632653001301642100001901655856005401674 2007 eng d00aStructured text retrieval by means of affordances and genre0 aStructured text retrieval by means of affordances and genre aSwinton, UK, UKbBritish Computer Society3 a
This paper offers a proposal for some preliminary research on the retrieval of structured text, such as extensible mark-up language (XML). We believe that capturing the way in which a reader perceives the meaning of documents, especially genres of text, may have implications for information retrieval (IR) and in particular, for cognitive IR and relevance. Previous research on 'shallow' features of structured text has shown that categorization by form is possible. Gibson's theory of 'affordances' and genre offer the reader the meaning and purpose - through structure - of a text, before the reader has even begun to read it, and should therefore provide a good basis for the 'deep' skimming and categorization of texts. We believe that Gibson's 'affordances' will aid the user to locate, examine and utilize shallow or deep features of genres and retrieve relevant output. Our proposal puts forward two hypotheses, with a list of research questions to test them, and culminates in experiments involving the studies of human categorization behaviour when viewing the structures of emails and web documents. Finally, we will examine the effectiveness of adding structural layout cues to a Yahoo discussion forum (currently only a bag-of-words), which is rich in structure, but only searchable through a Boolean search engine.
10aaffordances10acategorization10agenre10askimming1 aClark, Malcolm uhttp://dl.acm.org/citation.cfm?id=2227895.2227912