• Article
  • | |
  • Metrics
  • |
  • Reference [13]
  • |
  • Related [20]
  • |
  • Cited by [21]
  • | |
  • Comments
    Abstract:

    How to efficiently evaluate massive XPaths set over an XML stream is a fundamental problem in applications of the data stream. The current methods can not fully support the commonly used features of XPath, or can not meet the space and time requirement of the data stream applications. In this paper, a new tree automata based machine, XEBT, is proposed to solve the problem. Different from traditional ones, XEBT has the following features: First, it is based on tree automata with a powerful expressiveness, which can support Xpath {[]} without extra states or intermediate results; Second, XEBT supports many optimization strategies, including DTD based XPath tree automata construction, partial determination to reduce the concurrent states at running time with limited extra space costs, and the combination of bottom-up and top-down evaluation. Experimental results show that XEBT supports the complex Xpath and outperforms the former work in both efficiency and space cost.

    Reference
    [1]Diao Y, Fischer P. Yfilter: Efficient and scalable filtering of XML documents. In: Proc. Of the 18th Int'l Conf. On Data Engineering. 2002. 341-345.
    [2]Chan C, Felber P, Garofalakis M, Rastogi R. Efficient filtering of XML document with Xpath expressions. In: Proc. Of the Int'l Conf. On Data Engineering. San Jose: IEEE Computer Society, 2002. 235-244.
    [3]Green TJ, Miklau G, Onizuka M, Suciu D. Processing XML streaming with deterministic automata. In: Calvanese D, Lenzerini M, Motwani R, eds. Proc. Of the Int'l Conf. On Data Theory. LNCS 2572, Springer-Verlag, 2003. 173-189.
    [4]Gupta AK, Suciu D. Stream processing of Xpath queries with predicates. In: Halevy AY, Ives ZG, Doan AH, eds. Proc. Of the 2003 ACM SIGMOD Int'l Conf. On Management of Data. ACM, 2003. 419-430.
    [5]Nguyen B, Abiteboul S, Cobena G, Preda M. Monitoring XML data on the Web. In: Aref WG, ed. Proc. Of the ACM/SIGMOD Conf. On Management of Data. 2001. 437-448.
    [6]Chen J, Dewitt D, Tian F, Wang Y. NiagaraCQ: A scalable continuous query system for internet databases. In: Chen WD, Naughton JF, Bernstein PA, eds. Proc. Of the ACM/SIGMOD Conf. Management of Data. ACM, 2000. 379-390.
    [7]Clark J. XML Path language (Xpath). 1999. Available from the W3C, http://www.w3.org/TR/Xpath
    [8]Neven F. Automata, logic, and XML. In: Proc. Of the 16th Int'l Workshop Computer Science Logic. CSL, 2002. 2-26.
    [9]Milo T, Suciu D, Vianu V. Typechecking for XML Transformers. In: Proc. Of the PODS 2000. ACM, 2000. 11-22.
    [10]Miklau G, Suciu D. Containment and equivalence for an Xpath fragment. In: Popa L, ed. Proc. Of the 21 Symp. On Principle of Database Systems. ACM, 2002. 65-76.
    [11]Gao J, Yang DQ, Tang SW, Wang TJ. DTD based deterministic Xpath rewriting and logical optimization. Journal of Software, 2004,15(12):1860-1868 (in Chinese with English abstract). Http://www.jos.org.cn/1000-9825/15/1860.htm
    [12]NASA's Astronomical Data Center. ADC XML Resource Page. Http://xml.gsfc.nasa.gov
    [13]高军,杨冬青,唐世渭,王腾蛟.一种基于DTD的XPath逻辑优化方法.软件学报,2004,15(12):1860?1868. http://www.jos.org.cn/ 1000-9825/15/1860.htm
    Comments
    Comments
    分享到微博
    Submit
Get Citation

高军,杨冬青,唐世渭,王腾蛟.基于树自动机的XPath在XML数据流上的高效执行.软件学报,2005,16(2):223-232

Copy
Share
Article Metrics
  • Abstract:5332
  • PDF: 5701
  • HTML: 0
  • Cited by: 0
History
  • Received:August 27,2003
  • Revised:May 08,2004
You are the first2044991Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063