| Day
1: Thursday June 17, 2004 |
| 2:30-3:45pm |
Keynote Talk |
| |
Speaker:
Prabhakar Raghavan (Verity, Inc.)
Title: Text Centric Structure Extraction and
Exploitation (slides)
Abstract: In this talk we look at the convergence
of three text-centric areas: entity extraction, semi-structured
querying and the integration of search results; we view
these in the context of text-centric XML applications.
The main focus of the talk will be on text in XML querying
and some recent work in approaching it from the perspective
of information retrieval.
|
| 3:45-4:00pm |
Coffee Break |
| 4:00-5:30pm |
Paper Session 1, Web Querying and Mining
Chair: Kevin C. Chang (U. of Illinois at Urbana-Champaign) |
| |
Spam, Damn Spam, and Statistics:
Using Statistical Analysis to Locate Spam Web Pages
(pdf)
Dennis Fetterly, Mark Manasse, and Marc Najork
Querying Bi-level Information (pdf)
Sudarshan Murthy, David Maier, and Lois Delcambre
Visualizing and Discovering Web Navigational Patterns
(pdf)
Jiyang Chen, Lisheng Sun, Osmar R. Zaiane, and Randy
Goebel
|
| 5:30-5:45pm |
Coffee Break |
| 5:45-6:45pm |
Paper Session 2, Peer-to-Peer
Search Systems
Chair: Juliana Freire (OGI/OHSU) |
| |
One Torus to Rule Them All:
Multidimensional Queries in P2P Systems (pdf)
Prasanna Ganesan, Beverly Yang, and Hector Garcia-Molina
Querying Peer-to-Peer Networks Using P-Trees
(pdf)
Adina Crainiceanu, Prakash Linga, Johannes Gehrke, and
Jayavel Shanmugasundaram |
| |
|
|
Day 2: Friday June
18, 2004 |
| 9:00-10:00am |
Paper Session 3, Data Dissemination
Chair: Alexandros Labrinidis (U. of Pittsburgh) |
| |
Scalable Dissemination: What's
Hot and What's Not (pdf)
Jonathan Beaver, Nicholas Morsillo, Kirk Pruhs, Panos
K. Chrysanthis, and Vincenzo Liberatore
Semantic Multicast for Content-based Stream Dissemination
(pdf)
Olga Papaemmanouil and Ugur Cetintemel |
| 10:00-10:30am |
Coffee Break |
| 10:30am-12:00n |
Paper Session 4, XML Query Processing |
| |
Twig Query Processing over
Graph-Structured XML Data (pdf)
Zografoula Vagena, Mirella M. Moro, and Vassilis J.
Tsotras
Unraveling the Duplicate-Elimination Problem in XML-to-SQL
Query Translation (pdf)
Rajasekar Krishnamurthy, Raghav Kaushik, and Jeffrey
F. Naughton
Best-Match Querying from Document-Centric XML
(pdf)
Jaap Kamps, Maarten Marx, Maarten de Rijke, and Borkur
Sigurbjornsson |
| 12:00n-2:00pm |
Lunch (not provided by the workshop) |
| 2:00-3:30pm |
Paper Session 5, Approximate and
Ranked Query Processing |
| |
Challenges in Selecting Paths
for Navigational Queries: Trade-Off of Benefit of Path
versus Cost of Plan (pdf)
Maria-Esther Vidal, Louiqa Raschid, and Julian Mestre
Content and Structure in Indexing and Ranking XML
(pdf)
Felix Weigel, Holger Meuss, Klaus U. Schulz, and Francois
Bry
Mining Approximate Functional Dependencies and Concept
Similarities to Answer Imprecise Queries (pdf)
Ullas Nambiar and Subbarao Kambhampati |
| 3:30-3:45pm |
Coffee Break |
| 3:45-5:15pm |
Paper Session 6, XML Schemas and
Validation |
| |
DTDs versus XML Schema: A Practical
Study (pdf)
Geert Jan Bex, Frank Neven, and Jan Van den Bussche
On Validation of XML Streams Using Finite State Machines
(pdf)
Cristiana Chitic and Daniela Rosu
Checking Potential Validity of XML Documents
(pdf)
Ionut Emil Iacob, Alexander Dekhtyar, and Michael I.
Dekhtyar |