Attribute Level Lineage and Probabilistic Computation of Uncertain Data

doi:10.13328/j.cnki.jos.004426

微信服务号

微信订阅号

2025-5-15- 19

Home > Archive>Volume 25, Issue 4, 2014 >863-879. DOI:10.13328/j.cnki.jos.004426

PDF HTML XML Export Cite reminder

Attribute Level Lineage and Probabilistic Computation of Uncertain Data
DOI:
                        10.13328/j.cnki.jos.004426
                    
Author:
                        WANG LiangWANG Liang
Computer School, Wuhan University, Wuhan 430072, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHOU Guang-YanZHOU Guang-Yan
Computer School, Wuhan University, Wuhan 430072, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
WANG Li-WeiWANG Li-Wei
International School of Software, Wuhan University, Wuhan 430079, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
PENG Zhi-YongPENG Zhi-Yong
Computer School, Wuhan University, Wuhan 430072, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In the traditional database applications, data is generally considered to be accurate and available. However, data uncertainty often occurs in the real world. Most of current methods usually use provenance information to track data uncertainty while placing focus on the uncertainty with tuple level rather than attribute level. Their main idea is to identify a tuple with a variable, and then construct Boolean expression based on provenance information to compute the probability of a tuple. For the tuple with lots of uncertain attributes, these methods can not help users rapidly and correctly identify the source of uncertainty. In this paper, attribute expressions are defined and used to construct the lineage expression for each result tuple. With the lineage expression, the new method can not only accurately traces the location where the uncertainty takes place, but also computes the probability of the result tuple. Meanwhile, the exchange algorithm of the lineage expression is proposed to guarantee the correctness of the probability computation. In order to improve the efficiency of the probability computation, a method is also provided to construct share paths, and compute the probability of atomic disjunctions during the period of constructing share paths. Experiments are performed to compare tuple level lineage expressions with the existing methods on both time and cost. The results show the feasibility and validity of the proposed method, and further verify the validity of utilizing share paths to speed up the probability computation.

Key words:uncertainty;attribute expression;lineage expression;probabilistic computation;share path

Get Citation

王梁,周光焱,王黎维,彭智勇.不确定关系数据属性级溯源表示与概率计算.软件学报,2014,25(4):863-879

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 26,2012
Revised:May 03,2013
Adopted:
Online: March 28,2014
Published:

You are the first2044714Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History