Pittsburgh: Eighth RIAO International Conference
Large-Scale Semantic Access to Content (Text, Image, Video and Sound)
May 30 to June 1, 2007, Carnegie Mellon University http://www.riao.org



SCOPE OF THE CONFERENCE

Recent scientific advances in information retrieval: presentations,
posters, demonstrations of research prototypes as well as selected
cutting-edge innovative products



Chairs: David A Evans (Americas), Sadaoki Furui (Asia), Chantal Soul$(D??(B-
Dupuy (Europe)

PROGRAM            WEDNESDAY MAY 30, 2007

Wed 8:00-9:00 am Registration

Wed 9:00-10:00 am Opening Session

Wed 10:00-10:45 am Invited Speaker Donna Harman-NIST, USA It's time to
move on Wed 11:00-12:15 am Session 1 VIDEO AND LOG RETRIEVAL

Pic-A-Topic: Efficient Viewing of Informative TV Contents on Travel,
Cooking, Food and More

     Tetsuya Sakai, Toshiba/NewsWatch-Japan, Tatsuya Uehara, Taishi
Shimomori, Makoto Koyama, Mika Fukui, Toshiba-Japan

Multimodal Segmentation of Lifelog Data

     Aiden R Doherty, Alan F Smeaton, CDVP, Dublin City Univ-Ireland,
Keansub Lee, Daniel PW Ellis, LabROSA, Columbia Univ-USA

SportsAnno: What Do You Think?

     James Lanagan, Alan F Smeaton, CDVP, Dublin City Univ-Ireland Wed
1:30-3:10 pm Session 2 SOUND AND MUSIC RETRIEVAL

ORCHIVE: Digitizing and Analyzing Orca Vocalizations

     George Tzanetakis, Mathieu Lagrange, Univ of Victoria-Canada,
Paul Spong, Helena Symonds, Orcalab-Canada Optimizations of Local
Edition for Evaluating Similarity Between Monophonic Musical Sequences

      Pierre Hanna, Pascal Ferraro, LaBRI, Bordeaux 1 Univ-France
Multi-Modal Music Information Retrieval-Visualisation and Evaluation
of Clusterings by Both Audio and Lyrics

     Robert Neumayer, Andreas Rauber, Vienna Univ of Technology-
Austria Recent Advances in Automatic Speech Summarization

     Sadaoki Furui, Tokyo Institute of Technology-Japan Wed 3:10-3:40
pm Break and Demonstrations Wed 3:40-5:40 pm POSTER SESSION 1 Semantic
Domains and Supersense Tagging for Domain-Specific Ontology Learning

     Davide Picca, Univ of Lausanne-Switzerland Unified Access to
Heterogeneous Data in Cultural Heritage

     Marijn Koolen, Avi Arampatzis, Jaap Kamps, Univ of Amsterdam-the
Netherlands, Vincent de Keijzer, Haags Gemeentemuseum, The Hague-the
Netherlands, Nir Nussbaum, Univ of Amsterdam-the Netherlands An
approach to Multi-Lingual Text Summarization

     Alkesh Patel, Tanveer Siddiqui, Umashankar Tiwary, Indian
Institute of Information Technology, Allahabad-India Cross-Media
Entity Recognition in Nearly Parallel Visual and Textual Documents

      Koen Deschacht, Marie-Francine Moens, Wouter Robeyns, Katholieke
Universiteit Leuven-Belgium Query Refinement based on Topical Term
Clustering

      Hiromi Wakaki, The Univ of Tokyo-Japan, Tomonari Masada,
Atsuhiro Takasu, Jun Adachi, National Institute of Informatics, The
Univ of Tokyo-Japan Discriminative Fields for Modeling Semantic
Concepts in Video

      Ming-yu Chen, Alexander Hauptmann, Carnegie Mellon Univ-USA
Multi-modal Interview Concept Detection for Rushes Exploitation

      Anan Liu, Tianjin Univ, Chinese Academy of Sciences-China, Li
Jintao, Zhang Yongdong, Tang Sheng, Chinese Academy of Sciences-China,
Yang Zhaoxuan, Tianjin Univ, Chinese Academy of Sciences-China Query
clustering to decide the best system to use

      D$(D??(Bsir$(D??(B Kompaor$(D??(B, IRIT, Toulouse 3 Univ, Josiane Mothe, IRIT,
IUFM, Alain Baccini, S$(D??(Bbastien Dejean, LSP, Toulouse Univ-France An IE
and IR Approach to deal with Geographic Information Scope in Textual
Documents

      Christian Sallaberry, LIUPPA, Pau Univ-France, Mustapha Baziz,
IRIT, LIUPPA, Pau Univ-France, Julien

      Lesbegueries, Mauro Gaio, LIUPPA, Pau Univ-France Comprehensible
and Accurate Cluster Labels in Text Clustering

      Jerzy Stefanowski, Dawid Weiss, Poznan Univ of Technology-Poland
Summarizing non-textual events with 'Briefing' focus

      Mohit Kumar, Dipanjan Das, Alexander Rudnicky, Carnegie Mellon
Univ-USA A Robust Linguistic Platform for Efficient and Domain
specific Web Content Analysis

      Thierry Hamon, Adeline Nazarenko, Thierry Poibeau, Sophie Aubin,
Julien Derivire, LIPN, Paris 13 Univ-France Effectiveness of Rich
Document Representation in XML Retrieval

      Fahimeh Raja, Mostafa Keikha, Maseud Rahgozar, Univ of Teheran-
Iran, Farhad Oroumchian, Univ of Wollongong-Dubai Expanding a Test
Collection for Citation-based IR Experiments

      Anna Ritchie, Univ of Cambridge-UK, Stephen Robertson, Microsoft
Research Ltd-UK, Simone Teufel, Univ of Cambridge-UK

XQuery++: a graphical querying language for XML semi-structured data

      Ikram Amous, Anis Jedidi, Faiez Gargouri, LARIM, Sfax-Tunisia,
Florence S$(D??(Bdes, IRIT, Toulouse 3 Univ-France Structured Audio Player:
Supporting Radio Archive Workflows with Automatically Generated
Structure Metadata

      Martha Larson, ISLA, Univ of Amsterdam-the Netherlands, Joachim
K$(D??(Bhler, Fraunhofer IAIS-Germany Evaluating A Personal Information
Assistant

      Ioannis Psarras, Joemon Jose, Glasgow Univ-UK Wed 05:40-06:30 pm
Session 3 TEXTUAL QUERIES AND SEARCH 1 Selecting Automatically the
Best Query Translations

      Pierre-Yves Berger, Jacques Savoy, Univ of Neuchatel-Switzerland
An Information Retrieval Driven by Ontology: from Query to Document
Expansion

      Mustapha Baziz, IRIT, LIUPPA Pau Univ-France, Mohand Boughanem,
IRIT, Toulouse 3 Univ-France, Gabriella Pasi, Universit$(D??(B degli Studi
di Milano Bicocca-Italy, Henri Prade, IRIT-France THURSDAY MAY 31,
2007 Thu 9:00-9:45 am Invited Speaker Alan Smeaton-Dublin City Univ,
Ireland  "Bridging the Molecular-Digital Divide: Instrumented Living
Rooms and Social Media"

Thu 9:45-10:35 am Session 4 TEXTUAL QUERIES AND SEARCH 2 Modeling
Information Scent: A Comparison of LSA, PMI and GLSA Similarity
Measures on Common Tests and Corpora

      Raluca Budiu, Christiaan Royer, Peter Pirolli, Palo Alto
Research Center-USA Investigating Retrieval Performance with Manually-
Built Topic Models

      Xing Wei, W Bruce Croft, Univ of Massachusetts Amherst-USA Thu
10:35-11:00 am Break and Demonstrations Thu 11:00-12:40 pm Session 5
MODELING LANGUAGE AND SEARCH Document frequency and term specificity

      Hideo Joho, Univ of Glasgow-UK, Mark Sanderson, Univ of
Sheffield-UK Context Based Word Prediction for Texting Language

      Sachin Agarwal, Shilpa Arora, Carnegie Mellon Univ-USA Combining
linguistic indexes to improve the performances of information
retrieval systems: a machine learning based solution

      Fabienne Moreau, Vincent Claveau, Pascale S$(D??(Bbillot, IRISA,
Rennes Univ-France Using Markov Chains to Exploit Word Relationships
in Information Retrieval

      Guihong Cao, Jian-Yun Nie, Jing Bai, Univ of Montreal-Canada Thu
1:30-3:30 pm APPLICATIONS SESSION Thu 3:30-4:00 pm Break and
Demonstrations Thu 4:00-5:15 pm Session 6 XML IN RETRIEVAL A Survey on
XML Focussed Component Retrieval

      Karen Pinel-Sauvagnat, Mohand Boughanem, IRIT, Toulouse 3 Univ-
France XML Fragments Extended with Database Operators

      Yosi Mass, Dafna Sheinwald, Benjamin Sznajder, Sivan Yogev, IBM
Research Laboratory in Haifa-Israel From Layout to Semantic: a
Reranking Model for Mapping Web Documents to Mediated XML
Representations

      Guillaume Wisniewski, Patrick Gallinari, LIP6, Paris 6- France
Thu 5:15-6:05 pm Session 7 IMAGE RETRIEVAL 1 Similarity Beyond
Distance Measurement

      Feng Kang, Rong Jin, Michigan State Univ-USA, Steven Hoi, Hong
Kong Chinese Univ-China Image Retrieval Using a Multilingual Ontology

      Adrian Popescu, Commissariat $(D??(B l'$(D??(Bnergie Atomique-France FRIDAY
JUNE 1, 2007 Fri 9:00-10:40 am Session 8 PEER-TO-PEER AND WEB SEARCH A
Co-operative Web Services Paradigm for Supporting Crawlers

      Aravind Chandramouli, Susan Gauch, Univ of Kansas-USA Content-
Based Peer-to-Peer Network Overlay for Full-Text Federated Search

      Jie Lu, Jamie Callan, Carnegie Mellon Univ-USA Homepage Finding
in Hybrid Peer-to-Peer Networks

      Enrico Bragante, Massimo Melucci, Univ of Padua-Italy Collective
Annotation: Perspectives for Information Retrieval Improvement

      Guillaume Cabanac, IRIT-France, Max Chevalier, IRIT, LGC,
Toulouse 3 Univ-France, Claude Chrisment, Christine Julien, IRIT,
Toulouse 3 Univ-France Fri 10:40-11:10 am Break and Demonstrations Fri
11:10-12:00 am Session 9 IMAGE RETRIEVAL 2 Toward Content-based
Indexing and Retrieval of Functional Brain Images

      Bing Bai, Paul Kantor, Nicu Cornea, Deborah Silver, Rutgers Univ-
USA Using the Knowledge of Object Colors to Segment Images and Improve
Web Image Search

      Christophe Millet, CEA-LIST-France, Isabelle Bloch, GET-ENST-
France Fri 1:00-2:40 pm Session 10 LINKS, PRIOR INFORMATION IN SEARCH
Relevance Propagation Model for Large Hypertext Documents Collections

      Idir Chibane, Bich-Lien Doan, SUP$(D??(BLEC-France Combination of
Document Priors in Web Information Retrieval

      Jie Peng, Craig Macdonald, Ben He, Iadh Ounis, Univ of Glasgow-
UK Contextual Search Using Ontology-Based User Profiles

      Vishnu Challam, Microsoft Corporation-USA, Susan Gauch, Aravind
Chandramouli, Univ of Kansas-USA Evidence-Based Information Extraction
for High Accuracy Citation and Author Name Identification

      Brett Powley, Robert Dale, Macquarie Univ-Australia Fri
2:40-4:00 pm POSTER SESSION 2 Extracting Useful Information from the
Full Text of Fiction

      Sharon Givon, Maria Milosavljevic, Univ of Edinburgh-UK Ontology-
Aided vs Keyword-Based Web Searches: A Statistical Comparative
Analysis

      Magdi Kamel, Ann Lee, Ed Powers, Naval Postgraduate School,
Monterey-USA Smart Qualitative Data (SQUAD): Information Extraction in
a Large Document Archive

      Maria Milosavljevic, Claire Grover, Univ of Edinburgh-UK, Louise
Corti, Univ of Essex-UK Indexing Low Frequency Information for
Answering Complex Questions

      Abolfazl Keighobadi Lamjiri, Julien Dubuc, Leila Kosseim, Sabine
Bergler, CLaC laboratory, Concordia Univ-Canada Using Prior
Information Derived from Citations in Literature Search

      Edgar Meij, Maarten de Rijke, Univ of Amsterdam-the Netherlands
Information Retrieval Techniques for Templated Queries

      Giridhar Kumaran, James Allan, Univ of Massachusetts Amherst-USA
Discovering Missing Values in Semi-Structured Databases

      Xing Yi, James Allan, Victor Lavrenko, Univ of Massachusetts
Amherst-USA Audio Feature Engineering for Automatic Music Genre
Classification

      Paolo Annesi, Roberto Basili, Raffaele Gitto, Alessandro
Moschitti, Riccardo Petitti, Univ of Rome Tor Vergata-Italy Exploring
Interactive Information Retrieval: An Integrated Approach to Interface
Design and Interaction Analysis

      Gheorghe Muresan, Rutgers Univ-USA Construction of Vietnamese
Corpora

      Thao Pham Thi Xuan, Faculty of Computer Sciences, Univ of
Information Technology-VNU of HCMC Vietnam-Vietnam, Ai Kawazoe,
National Institute of Informatics, Tokyo-Japan, Dien Dinh, Faculty of
Information Technology, Univ of Natural Sciences-VNU of HCMC Vietnam-
Vietnam, Nigel Collier, National Institute of Informatics, Tokyo-
Japan, Tri Tran Quoc, Faculty of Computer Sciences, Univ of
Information Technology-VNU of HCMC Vietnam-Vietnam Using a Content-and-
Structure Oriented Method for Relevance Feedback in XML Retrieval

      Lobna Hlaoua, Mohand Boughanem, Karen Pinel-Sauvagnat, IRIT,
Toulouse 3 Univ-France Research Methodology in Studies of Information
Retrieval Evaluation

      Ben Carterette, James Allan, Univ of Massachusetts Amherst-USA
Fri 4:00-4:30 pm Break and Demonstrations Fri 4:30-5:45 pm Session 11
QUESTION ANSWERING, FACTS, AND SUMMARIES Logical Validation, Answer
Merging and Witness Selection-A Study in Multi-Stream Question
Answering

      Ingo Gl$(D??(Bckner, Sven Hartrumpf, Johannes Leveling, Univ of Hagen-
Germany Estimating Importance Features for Fact Mining (With a Case
Study in Biography Mining)

      Sisay Fissaha Adafre, Maarten de Rijke, Informatics Institute,
Univ of Amsterdam-the Netherlands Capturing Sentence Prior for Query-
Based Multi-Document Summarization

      Prasad Pingali, International Institute of Information
Technology-India, Jagadeesh Jagarlamudi, Microsoft-USA Fri 5:45-6:30
pm Concluding Ceremony





METIOREW - University of M$(D??(Blaga

Cleopatra - National Research Council

MOMINDUM Base- MOMINDUM

Vertical sorting: How to efficiently replace hand-filed documents with
simple stacks of paper - Xamance MIND - Spiking neuron network for
video indexing and image search - SPIKENET Technology The Arrowsmith
Suite of Tools to enhance Biomedical Discovery and Collaboration -
University of Illinois Region based image analysis toolkit -
University of Essex Histara - Reviews - Ecole Pratique des Hautes
Etudes / University of Bern

ACADI: Automatic Character (in Audiovisual Document) Indexing -
University of Toulouse ADVESTIGATE - Advestigo The SPLINE Platform for
e-learning - Indian Institute of Management Bangalore EasyLayer - A
Multi-Layered Citrix installation for heterogeneous user groups -
University West

6S: P2P Web index collecting and sharing application - University of
Indiana METIORE-WISP version prototype - LORIA Extracting semantic
content from webpages - TheFind Inc.

Automatic Online Advertisement Targeting using Advanced Machine
Learning Techniques - Turn Inc.

Decider.track, the first "Man-Machine Co-Decider" - Co-Decision
Technology SAS The Endeca Information Access Platform - Endeca
NpPicture - NewPhenix Xfy - JustSystems Corporation



The RIAO conferences are organized every three years, in Europe and
North America, by the Center for Advanced Study of Information Systems
(CASIS) and the Centre de Hautes Etudes Internationales d'Informatique
Documentaire (CID)) information@le-cid.org



RIAO 2007 conference is held in cooperation with ACM-SIGIR