rjxb软件学报Journal of Software1000-9825软件学报编辑部中国北京675102204b61d8d34df3f9aabb0723f17e18b8e7301228fe0969969c46769b73b90210.13328/j.cnki.jos.006751数据定价与交易研究综述Survey on Data Pricing and Trading Research江东JIANGDong
In the big data era, an enormous amount of data is collected in every industry with the development of information technology. Data is the foundation of the digital economy, containing great value. However, for the lack of efficient and feasible data-sharing mechanisms, data owners seldom communicate with each other, which leads to the formation of data islands and is unfavorable to the healthy development of the big data industry. Hence, allocating a proper price to data and designing an efficient data market platform have become important ways to eliminate data islands and secure sufficient data flow. This study systematically sorts out the technical issues regarding data pricing and trading. Specifically, the difficulties and related principles of data pricing and trading are introduced. The life cycle of data in the data market is divided into four stages: data collection and integration, data management and analysis, data pricing, and data trading. Upon the research on big data management, related methods applicable to the first two stages are elaborated. After that, data pricing methods are categorized, and usage scenarios, advantages, and shortcomings of these methods are analyzed. Moreover, the classification of data markets is introduced, and the impact of market types and participants’ behavior in data trading on the trading process and prices is studied with game theory and auctions as examples. Finally, future research directions of data pricing and trading are discussed.
数据定价数据交易数据市场定价模型数据管理data pricingdata tradingdata marketpricing modeldata management国家自然科学基金(61932004, 61732003, 62072087, U2001211); 中央高校基本科研基金(N181605012)National Natural Science Foundation of China (61932004, 61732003, 62072087, U2001211); Research Funds for the Central Universities (N181605012)
Huang XRFrom complexity science to big data technique20142925910.3969/j.issn.1672-934X.2014.02.001
Huang XR. From complexity science to big data technique. Journal of Changsha University of Science & Technology (Social Science), 2014, 29(2): 5–9 (in Chinese with English Abstract). [doi: 10.16573/j.cnki.1672-934x.2014.02.001]
Zhang XW, Jiang D, Yuan YA survey of game theory and auction-based data pricing2021746179
Zhang XW, Jiang D, Yuan Y. A survey of game theory and auction-based data pricing. Big Data Research, 2021, 7(4): 61–79 (in Chinese with English Abstract). [doi: 10.11959/j.issn.2096-0271.2021039]
Pei JA survey on data pricing: From economics to data science202234104586460810.1109/TKDE.2020.3045927
Pei J. A survey on data pricing: From economics to data science. IEEE Transactions on Knowledge and Data Engineering, 2022, 34(10): 4586–4608. [doi: 10.1109/TKDE.2020.3045927]
Liu N, Hao XJ, Chen YHA review and comparative analysis of domestic and foreign research on big data pricing methods2021768910210.11959/j.issn.2096-0271.2021063
Liu N, Hao XJ, Chen YH. A review and comparative analysis of domestic and foreign research on big data pricing methods. Big Data Research, 2021, 7(6): 89–102 (in Chinese with English Abstract). [doi: 10.11959/j.issn.2096-0271.2021063]
Cai L, Huang ZH, Liang Y, Zhu YYSurvey of data pricing20211591595160610.3778/j.issn.1673-9418.2103069
Cai L, Huang ZH, Liang Y, Zhu YY. Survey of data pricing. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1595–1606 (in Chinese with English Abstract). [doi: 10.3778/j.issn.1673-9418.2103069]
Liu ZYAnalysis on pricing of big data20161576410.13366/j.dik.2016.01.057
Liu ZY. Analysis on pricing of big data. Documentation, Information & Knowledge, 2016, (1): 57–64 (in Chinese with English Abstract). [doi: 10.13366/j.dik.2016.01.057]
Zhang M, Arafa A, Huang JW, Poor HVPricing fresh data20213951211122510.1109/JSAC.2021.3065088
Zhang M, Arafa A, Huang JW, Poor HV. Pricing fresh data. IEEE Journal on Selected Areas in Communications, 2021, 39(5): 1211–1225. [doi: 10.1109/JSAC.2021.3065088]
Tang SS, Liu YTChina’s big data transaction urgently needs a breakthrough201613192110.3969/j.issn.1673-033X.2016.13.007
Tang SS, Liu YT. China's big data transaction urgently needs a breakthrough. China Development Observation, 2016, (13): 19–21 (in Chinese with English abstract). [doi:DOI: 10.3969/j.issn.1673-033X.2016.13.007] (查阅所有网上资料, 未找到对应的英文翻译, 请联系作者确认)
Hu YLResearch on status quo and pricing issue of big data trade201712161910.14076/j.issn.1006-2025.2017.12.04
Hu YL. Research on status quo and pricing issue of big data trade. Prices Monthly, 2017, (12): 16–19 (in Chinese with English abstract). [doi: 10.14076/j.issn.1006-2025.2017.12.04]
Cai H, Zhu YM, Li J, Yu JDDouble auction for a data trading market with preferences and conflicts of interest201962101490150410.1093/comjnl/bxz025
Cai H, Zhu YM, Li J, Yu JD. Double auction for a data trading market with preferences and conflicts of interest. The Computer Journal, 2019, 62(10): 1490–1504. [doi: 10.1093/comjnl/bxz025]
Xiong ZH, Niyato D, Wang P, Han Z, Zhang YDynamic pricing for revenue maximization in mobile social data market with network effects20201931722173710.1109/TWC.2019.2957092
Xiong ZH, Niyato D, Wang P, Han Z, Zhang Y. Dynamic pricing for revenue maximization in mobile social data market with network effects. IEEE Transactions on Wireless Communications, 2020, 19(3): 1722–1737. [doi: 10.1109/TWC.2019.2957092]
Khokhar RH, Iqbal F, Fung BCM, Bentahar JEnabling secure trustworthiness assessment and privacy protection in integrating data for trading person-specific information202168114916910.1109/TEM.2020.2974210
Khokhar RH, Iqbal F, Fung BCM, Bentahar J. Enabling secure trustworthiness assessment and privacy protection in integrating data for trading person-specific information. IEEE Transactions on Engineering Management, 2021, 68(1): 149–169. [doi: 10.1109/TEM.2020.2974210]
Delgado-Segura S, Pérez-Solà C, Navarro-Arribas G, Herrera-Joancomartí JA fair protocol for data trading based on Bitcoin transactions202010783284010.1016/j.future.2017.08.021
Delgado-Segura S, Pérez-Solà C, Navarro-Arribas G, Herrera-Joancomartí J. A fair protocol for data trading based on Bitcoin transactions. Future Generation Computer Systems, 2020, 107: 832–840. [doi: 10.1016/j.future.2017.08.021]
Shapley LSA value for n-person games199769
Shapley LS. A value for n-person games. Classics in Game Theory, 1997, 69. (查阅所有网上资料, 未找到本条文献信息, 请联系作者确认)
Jia RX, Dao D, Wang BX, Hubis FA, Gürel NM, Li B, Zhang C, Spanos CJ, Song DEfficient task-specific data valuation for nearest neighbor algorithms201912111610162310.14778/3342263.3342637
Jia RX, Dao D, Wang BX, Hubis FA, Gürel NM, Li B, Zhang C, Spanos CJ, Song D. Efficient task-specific data valuation for nearest neighbor algorithms. Proceedings of the VLDB Endowment, 2019, 12(11): 1610–1623. [doi: 10.14778/3342263.3342637]
Balazinska M, Howe B, Suciu DData markets in the cloud: An opportunity for the database community20114121482148510.14778/3402755.3402801
Balazinska M, Howe B, Suciu D. Data markets in the cloud: An opportunity for the database community. Proceedings of the VLDB Endowment, 2011, 4(12): 1482–1485. [doi: 10.14778/3402755.3402801]
Li C, Li DY, Miklau G, Suciu DA theory of pricing private data20143943410.1145/2691190.2691191
Li C, Li DY, Miklau G, Suciu D. A theory of pricing private data. ACM Transactions on Database Systems, 2014, 39(4): 34. [doi: 10.1145/2691190.2691191]
Lin BR, Kifer DOn arbitrage-free pricing for general data queries20147975776810.14778/2732939.2732948
Lin BR, Kifer D. On arbitrage-free pricing for general data queries. Proceedings of the VLDB Endowment, 2014, 7(9): 757–768. [doi: 10.14778/2732939.2732948]
Babaioff M, Immorlica N, Lucier B, Weinberg SMA simple and approximately optimal mechanism for an additive buyer20206742410.1145/3398745
Babaioff M, Immorlica N, Lucier B, Weinberg SM. A simple and approximately optimal mechanism for an additive buyer. Journal of the ACM, 2020, 67(4): 24. [doi: 10.1145/3398745]
Chawla S, Deep S, Koutris P, Teng YFRevenue maximization for query pricing201913111410.14778/3357377.3357378
Chawla S, Deep S, Koutris P, Teng YF. Revenue maximization for query pricing. Proceedings of the VLDB Endowment, 2019, 13(1): 1–14. [doi: 10.14778/3357377.3357378]
Fernandez RC, Subramaniam P, Franklin MJData market platforms: Trading data assets to solve data problems202013121933194710.14778/3407790.3407800
Fernandez RC, Subramaniam P, Franklin MJ. Data market platforms: Trading data assets to solve data problems. Proceedings of the VLDB Endowment, 2020, 13(12): 1933–1947. [doi: 10.14778/3407790.3407800]
Dalvi N, Kumar R, Soliman MAutomatic wrappers for large scale web extraction20114421923010.14778/1938545.1938547
Dalvi N, Kumar R, Soliman M. Automatic wrappers for large scale web extraction. Proceedings of the VLDB Endowment, 2011, 4(4): 219–230. [doi: 10.14778/1938545.1938547]
Cafarella MJ, Halevy A, Wang DZ, Wu E, Zhang YWebTables: Exploring the power of tables on the web20081153854910.14778/1453856.1453916
Cafarella MJ, Halevy A, Wang DZ, Wu E, Zhang Y. WebTables: Exploring the power of tables on the web. Proceedings of the VLDB Endowment, 2008, 1(1): 538–549. [doi: 10.14778/1453856.1453916]
Cafarella M, Halevy A, Lee H, Madhavan J, Yu C, Wang DZ, Wu ETen years of webtables201811122140214910.14778/3229863.3240492
Cafarella M, Halevy A, Lee H, Madhavan J, Yu C, Wang DZ, Wu E. Ten years of webtables. Proceedings of the VLDB Endowment, 2018, 11(12): 2140–2149. [doi: 10.14778/3229863.3240492]
Elmeleegy H, Madhavan J, Halevy AHarvesting relational tables from lists on the Web2009211078108910.14778/1687627.1687749
Elmeleegy H, Madhavan J, Halevy A. Harvesting relational tables from lists on the web. Proceedings of the VLDB Endowment, 2009, 2(1): 1078–1089. [doi: 10.14778/1687627.1687749]
Zhu C, Cao JSummary and prospect on entity resolution2015423812, 1810.11896/j.issn.1002-137X.2015.3.002
Zhu C, Cao J. Summary and prospect on entity resolution. Computer Science, 2015, 42(3): 8–12, 18 (in Chinese with English Abstract). [doi: 10.11896/j.issn.1002-137X.2015.3.002]
Yin XX, Han JW, Yu PSTruth discovery with multiple conflicting information providers on the Web200820679680810.1109/TKDE.2007.190745
Yin XX, Han JW, Yu PS. Truth discovery with multiple conflicting information providers on the web. IEEE Transactions on Knowledge and Data Engineering, 2008, 20(6): 796–808. [doi: 10.1109/TKDE.2007.190745]
Domshlak C, Gal A, Roitman HRank aggregation for automatic schema matching200719453855310.1109/TKDE.2007.1010
Domshlak C, Gal A, Roitman H. Rank aggregation for automatic schema matching. IEEE Transactions on Knowledge and Data Engineering, 2007, 19(4): 538–553. [doi: 10.1109/TKDE.2007.1010]
Li YL, Li Q, Gao J, Su L, Zhao B, Fan W, Han JWConflicts to harmony: A framework for resolving conflicts in heterogeneous data by truth discovery20162881986199910.1109/TKDE.2016.2559481
Li YL, Li Q, Gao J, Su L, Zhao B, Fan W, Han JW. Conflicts to harmony: A framework for resolving conflicts in heterogeneous data by truth discovery. IEEE Transactions on Knowledge and Data Engineering, 2016, 28(8): 1986–1999. [doi: 10.1109/TKDE.2016.2559481]
http://wp.sigmod.org/?p=1629]]>
Liang F, Yu W, An D, Yang QY, Fu XW, Zhao WA survey on big data market: Pricing, trading and protection20186151321515410.1109/ACCESS.2018.2806881
Liang F, Yu W, An D, Yang QY, Fu XW, Zhao W. A survey on big data market: Pricing, trading and protection. IEEE Access, 2018, 6: 15132–15154. [doi: 10.1109/ACCESS.2018.2806881]
Upadhyaya P, Balazinska M, Suciu DPrice-optimal querying with data APIs20169141695170610.14778/3007328.3007335
Upadhyaya P, Balazinska M, Suciu D. Price-optimal querying with data APIs. Proceedings of the VLDB Endowment, 2016, 9(14): 1695–1706. [doi: 10.14778/3007328.3007335]
Wang XW, Wei XH, Liu YY, Gao SOn pricing approximate queries201845319821510.1016/j.ins.2018.04.036
Wang XW, Wei XH, Liu YY, Gao S. On pricing approximate queries. Information Sciences, 2018, 453: 198–215. [doi: 10.1016/j.ins.2018.04.036]
Shen YC, Guo B, Shen Y, Duan XL, Dong XQ, Zhang HA pricing model for big personal data201621548249010.1109/TST.2016.7590317
Shen YC, Guo B, Shen Y, Duan XL, Dong XQ, Zhang H. A pricing model for big personal data. Tsinghua Science and Technology, 2016, 21(5): 482–490. [doi: 10.1109/TST.2016.7590317]
Yu HF, Zhang MXData pricing strategy based on data quality201711211010.1016/j.cie.2017.08.008
Yu HF, Zhang MX. Data pricing strategy based on data quality. Computers & Industrial Engineering, 2017, 112: 1–10. [doi: 10.1016/j.cie.2017.08.008]
Yang J, Zhao CC, Xing CXBig data market optimization pricing model based on data quality20192019596406810.1155/2019/5964068
Yang J, Zhao CC, Xing CX. Big data market optimization pricing model based on data quality. Complexity, 2019, 2019: 5964068. [doi: 10.1155/2019/5964068]
Liu K, Qiu XY, Chen WH, Chen X, Zheng ZBOptimal pricing mechanism for data market in Blockchain-enhanced internet of things2019669748976110.1109/JIOT.2019.2931370
Liu K, Qiu XY, Chen WH, Chen X, Zheng ZB. Optimal pricing mechanism for data market in Blockchain-enhanced internet of things. IEEE Internet of Things Journal, 2019, 6(6): 9748–9761. [doi: 10.1109/JIOT.2019.2931370]
Kang X, Zhang R, Motani MPrice-based resource allocation for spectrum-sharing femtocell networks: A Stackelberg game approach201230353854910.1109/JSAC.2012.120404
Kang X, Zhang R, Motani M. Price-based resource allocation for spectrum-sharing femtocell networks: A Stackelberg game approach. IEEE Journal on Selected Areas in Communications, 2012, 30(3): 538–549. [doi: 10.1109/JSAC.2012.120404]
Yao HP, Mai TL, Wang JJ, Ji Z, Jiang CX, Qian YResource trading in Blockchain-based industrial Internet of Things20191563602360910.1109/TII.2019.2902563
Yao HP, Mai TL, Wang JJ, Ji Z, Jiang CX, Qian Y. Resource trading in Blockchain-based industrial internet of things. IEEE Transactions on Industrial Informatics, 2019, 15(6): 3602–3609. [doi: 10.1109/TII.2019.2902563]
Wu QW, Zhou MC, Zhu QS, Xia YNVCG auction-based dynamic pricing for multigranularity service composition201815279680510.1109/TASE.2017.2695123
Wu QW, Zhou MC, Zhu QS, Xia YN. VCG auction-based dynamic pricing for multigranularity service composition. IEEE Transactions on Automation Science and Engineering, 2018, 15(2): 796–805. [doi: 10.1109/TASE.2017.2695123]
Gao WC, Yu W, Liang F, Hatcher WG, Lu CPrivacy-preserving auction for big data trading using homomorphic encryption20207277679110.1109/TNSE.2018.2846736
Gao WC, Yu W, Liang F, Hatcher WG, Lu C. Privacy-preserving auction for big data trading using homomorphic encryption. IEEE Transactions on Network Science and Engineering, 2020, 7(2): 776–791. [doi: 10.1109/TNSE.2018.2846736]
Cao XY, Chen Y, Ray Liu KJData trading with multiple owners, collectors, and users: An iterative auction mechanism20173226828110.1109/TSIPN.2017.2668144
Cao XY, Chen Y, Ray Liu KJ. Data trading with multiple owners, collectors, and users: An iterative auction mechanism. IEEE Transactions on Signal and Information Processing over Networks, 2017, 3(2): 268–281. [doi: 10.1109/TSIPN.2017.2668144]
Xiong W, Xiong LAnti-collusion data auction mechanism based on smart contract202155538640910.1016/j.ins.2020.10.053
Xiong W, Xiong L. Anti-collusion data auction mechanism based on smart contract. Information Sciences, 2021, 555: 386–409. [doi: 10.1016/j.ins.2020.10.053]
Wang RY, Strong DMBeyond accuracy: What data quality means to data consumers199612453310.1080/07421222.1996.11518099
Wang RY, Strong DM. Beyond accuracy: What data quality means to data consumers. Journal of Management Information Systems, 1996, 12(4): 5–33. [doi: 10.1080/07421222.1996.11518099]
Sen ARational fools: A critique of the behavioral foundations of economic theory197764317344
Sen A. Rational fools: A critique of the behavioral foundations of economic theory. Philosophy and Public Affairs, 1977, 6(4): 317–344.
McAfee RP. A dominant strategy double auction. Journal of Economic Theory, 1992, 56(2): 434–450. [doi: 10.1016/0022-0531(92)90091-U]
Niyato D, Hoang DT, Luong NC, Wang P, Kim DI, Han ZSmart data pricing models for the Internet of Things: A bundling strategy approach2016302182510.1109/MNET.2016.7437020
Niyato D, Hoang DT, Luong NC, Wang P, Kim DI, Han Z. Smart data pricing models for the internet of things: A bundling strategy approach. IEEE Network, 2016, 30(2): 18–25. [doi: 10.1109/MNET.2016.7437020]
Nash J. Non-cooperative games. Annals of mathematics, 1951, 54(2): 286–295. [doi: 10.2307/1969529]
Luong NC, Hoang DT, Wang P, Niyato D, Kim DI, Han ZData collection and wireless communication in internet of things (IoT) using economic analysis and pricing models: A survey20161842546259010.1109/COMST.2016.2582841
Luong NC, Hoang DT, Wang P, Niyato D, Kim DI, Han Z. Data collection and wireless communication in internet of things (IoT) using economic analysis and pricing models: A survey. IEEE Communications Surveys & Tutorials, 2016, 18(4): 2546–2590. [doi: 10.1109/COMST.2016.2582841]
Li ZN, Yang ZY, Xie SLComputing resource trading for edge-cloud-assisted Internet of Things20191563661366910.1109/TII.2019.2897364
Li ZN, Yang ZY, Xie SL. Computing resource trading for edge-cloud-assisted internet of things. IEEE Transactions on Industrial Informatics, 2019, 15(6): 3661–3669. [doi: 10.1109/TII.2019.2897364]
Simaan M, Cruz JBOn the Stackelberg strategy in nonzero-sum games197311553355510.1007/BF00935665
Simaan M, Cruz JB. On the Stackelberg strategy in nonzero-sum games. Journal of Optimization Theory and Applications, 1973, 11(5): 533–555. [doi: 10.1007/BF00935665]
Haddadi S, Ghasemi APricing-based Stackelberg game for spectrum trading in self-organised heterogeneous networks201610111374138310.1049/iet-com.2016.0033
Haddadi S, Ghasemi A. Pricing-based Stackelberg game for spectrum trading in self-organised heterogeneous networks. IET Communicatons, 2016, 10(11): 1374–1383. [doi: 10.1049/iet-com.2016.0033]
Lv XY, Zhang RT, Yue JJCompetition and cooperation between participants of the internet of things industry value chain201241140641210.4156/aiss.vol4.issue11.50
Lv XY, Zhang RT, Yue JJ. Competition and cooperation between participants of the internet of things industry value chain. International Journal on Advances in Information Sciences and Service Sciences, 2012, 4(11): 406–412. [doi: 10.4156/aiss.vol4.issue11.50] (查阅所有网上资料, 未找到对应的期号信息, 请联系作者确认)
Mao YX, Cheng T, Zhao HY, Shen NA strategic bargaining game for a spectrum sharing scheme in cognitive radio-based heterogeneous wireless sensor networks20171712273710.3390/s17122737
Mao YX, Cheng T, Zhao HY, Shen N. A strategic bargaining game for a spectrum sharing scheme in cognitive radio-based heterogeneous wireless sensor networks. Sensors, 2017, 17(12): 2737. [doi: 10.3390/s17122737]
Moulik S, Misra S, Gaurav ACost-effective mapping between wireless body area networks and cloud service providers based on multi-stage bargaining20171661573158610.1109/TMC.2016.2571286
Moulik S, Misra S, Gaurav A. Cost-effective mapping between wireless body area networks and cloud service providers based on multi-stage bargaining. IEEE Transactions on Mobile Computing, 2017, 16(6): 1573–1586. [doi: 10.1109/TMC.2016.2571286]
Azimi SM, Manshaei MH, Hendessi FCooperative primary-secondary dynamic spectrum leasing game via decentralized bargaining201622375576410.1007/s11276-015-0999-8
Azimi SM, Manshaei MH, Hendessi F. Cooperative primary-secondary dynamic spectrum leasing game via decentralized bargaining. Wireless Networks, 2016, 22(3): 755–764. [doi: 10.1007/s11276-015-0999-8]
An D, Yang QY, Yu W, Yang XY, Fu XW, Zhao WSto2Auc: A stochastic optimal bidding strategy for microgrids2017462260227410.1109/JIOT.2017.2764879
An D, Yang QY, Yu W, Yang XY, Fu XW, Zhao W. Sto2Auc: A stochastic optimal bidding strategy for microgrids. IEEE Internet of Things Journal, 2017, 4(6): 2260–2274. [doi: 10.1109/JIOT.2017.2764879]
An D, Yang QY, Yu W, Yang XY, Fu XW, Zhao WSODA: Strategy-proof online double auction scheme for multimicrogrids bidding20184871177119010.1109/TSMC.2017.2651072
An D, Yang QY, Yu W, Yang XY, Fu XW, Zhao W. SODA: Strategy-proof online double auction scheme for multimicrogrids bidding. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2018, 48(7): 1177–1190. [doi: 10.1109/TSMC.2017.2651072]
Nisan N, Ronen AComputationally feasible VCG mechanisms200729194710.1613/jair.2046
Nisan N, Ronen A. Computationally feasible VCG mechanisms. Journal of Artificial Intelligence Research, 2007, 29: 19–47. [doi: 10.1613/jair.2046]
Kirchkamp O, Poen E, Reiß JPOutside options: Another reason to choose the first-price auction200953215316910.1016/j.euroecorev.2008.03.005
Kirchkamp O, Poen E, Reiß JP. Outside options: Another reason to choose the first-price auction. European Economic Review, 2009, 53(2): 153–169. [doi: 10.1016/j.euroecorev.2008.03.005]
Edelman B, Ostrovsky M, Schwarz MInternet advertising and the generalized second-price auction: Selling billions of dollars worth of keywords2007971242259
Edelman B, Ostrovsky M, Schwarz M. Internet advertising and the generalized second-price auction: Selling billions of dollars worth of keywords. American Economic Review, 2007, 97(1): 242–259.