期刊文献+

不确定性数据管理技术研究综述 被引量:185

A Survey on the Management of Uncertain Data
在线阅读 下载PDF
导出
摘要 随着数据采集和处理技术的进步,人们对数据的不确定性的认识也逐步深入.在诸如经济、军事、物流、金融、电信等领域的具体应用中,数据的不确定性普遍存在.不确定性数据的表现形式多种多样,它们可以以关系型数据、半结构化数据、流数据或移动对象数据等形式出现.目前,根据应用特点与数据形式差异,研究者已经提出了多种针对不确定数据的数据模型.这些不确定性数据模型的核心思想都源自于可能世界模型.可能世界模型从一个或多个不确定的数据源演化出诸多确定的数据库实例,称为可能世界实例,而且所有实例的概率之和等于1.尽管可以首先分别为各个实例计算查询结果,然后合并中间结果以生成最终查询结果,但由于可能世界实例的数量远大于不确定性数据库的规模,这种方法并不可行.因此,必须运用排序、剪枝等启发式技术设计新型算法,以提高效率.文中介绍了不确定性数据管理技术的概念、特点与挑战,综述了数据模型、数据预处理与集成、存储与索引、查询处理等方面的工作. The importance of the data uncertainty was studied deeply with the rapid development in data gathering and processing in various fields, inclusive of economy, military, logistic, finance and telecommunication, etc. Uncertain data has many different styles, such as relational data, semistructured data, streaming data, and moving objects. According to scenarios and data characteristics, tens of data models have been developed, stemming from the core possible world model that contains a huge number of the possible world instances with the sum of probabilities equal to 1. However, the number of the possible world instances is far greater than the volume of the uncertain database, making it infeasible to combine medial results generated from all of possible world instances for the final query results. Thus, some heuristic techniques, such as ordering, pruning, must be used to reduce the computation cost for the high efficiency. This paper introduces the concepts, characteristics and challenges in uncertain data management, proposes the advance of the research on uncertain data management, including data model, preprocessing, in- tegrating, storage, indexing, and query processing.
出处 《计算机学报》 EI CSCD 北大核心 2009年第1期1-16,共16页 Chinese Journal of Computers
基金 国家自然科学基金(60803020) 上海市重点学科建设项目(B412)资助
关键词 不确定性数据 可能世界模型 数据集成 世系 不确定数据流 uncertain data possible world model data integration lineage uncertain stream
  • 相关文献

参考文献98

  • 1Deshpande A, viprin C, Madden S, Hellerstein J M, Hong W. Model-driven data acquisition in sensor networks// Proceedings of the 30th International Conference on Very Large Data Bases. Toronto, 2004:588-599
  • 2李建中,李金宝,石胜飞.传感器网络及其数据管理的概念、问题与进展[J].软件学报,2003,14(10):1717-1727. 被引量:622
  • 3谷峪,于戈,张天成.RFID复杂事件处理技术[J].计算机科学与探索,2007,1(3):255-267. 被引量:54
  • 4Madhavan J, Cohen S, Xin D, Halevy A, Jeffery S, Ko D, Yu C. Web-scale data integration: You can afford to pay as you go//Proceedings of the 33rd Biennial Conference on Innovative Data Systems Research. Asilomar, 2007:342-350
  • 5Liu Ling. From data privacy to location privacy: Models and algorithms (tutorial)//Proceedings of the 33rd International Conference on Very Large Data bases. Vienna, 2007: 1429- 1430
  • 6Samarati P, Sweeney L. Generalizing data to provide anonymity when disclosing information (abstract)//Proeeedings of the 17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. Seattle, 1998:188
  • 7Cavallo R, Pittarelli M. The theory of probabilistic databases//Proceedings of the 13th International Conference on Very Large Data Bases. Brighton, 1987:71-81
  • 8Barbara D, Garcia-Molina H, Porter D. The management of probabilistic data. IEEE Transactions on Knowledge and Data Engineering, 1992, 4(5): 487-502
  • 9Fuhr N, Rolleke T. A probabilistic relational algebra for the integration of information retrieval and database systems. ACM Transactions on Information Systems, 1997, 15(1): 32-66
  • 10Zimanyi E. Query evaluation in probabilistic databases. Theoretical Computer Science, 1997, 171(1-2): 179-219

二级参考文献103

  • 1Ganesan D, Govindan R, Shenker S, Estrin D. Highly-Resilient, energy-efficient multipath muting in wireless sensor networks.Mobile Computing and Communications Review, 2002,1(2):295-298.
  • 2Braginsky D, Estrin D. Rumor routing algorithm for sensor networks. In: Raghavendra CS, ed. Proceedings of the 1st Workshop on Sensor Networks and Applications. New York: ACM Press, 2002.
  • 3Girod L, Bychkovskiy V, Elson J, Estrin D. Locating tiny sensors in time and space: A case study. In: Manoli Y, Kim KS, eds.Proceedings of the International Conference on Computer Design. Piscataway: IEEE Press, 2002. 195-204.
  • 4Bulusu N, Estrin D, Girod L, Heidemann J. Scalable coordination for wireless sensor networks: Self-Configuring localization systems. 2001. http://lecs.cs.ucla.edu/-bulusu/papers/Bulusu01c.html.
  • 5Cerpa A, Estrin D. ASCENT: Adaptive self-configuring sensor networks topologies. In: Kermani P, ed. Proceedings of the 21st International Annual Joint Conference of the IEEE Computer and Communications Societies. Piscataway: IEEE Press, 2002.101-111
  • 6Elson J. Time synchronization services for wireless sensor networks. In: Kumar V, ed. Proceedings of the 15th International Parallel & Distributed Processing Symposium. 2001. Los Alamitos: IEEE Computer Press, 2001. 1965-1970.
  • 7Ye W, Heidemann J, Estrin D. An energy-efficient MAC protocol for wireless sensor networks. In: Kermani P, ed. Proceedings of the 21st International Annual Joint Conference of the IEEE Computer and Communications Societies. Piscataway: IEEE Press,2002.91-100.
  • 8Heidemann J, Silva F, Intanagonwiwat C. Building efficient wireless sensor networks with low level naming. In: Marzullo K, ed.Proceedings of the 18th ACM Symposium on Operating System Principles. New York: ACM Press, 2001. 146-159.
  • 9Intanagonwiwat C, Govindan R, Estrin D, Heidemann J, Silva F. Directed diffusion for wireless sensor networking. ACM/IEEE Transactions on Networking, 2002, 11(1):2-16.
  • 10Liu J, Cheung P, Ouibas L, Zhao F. A dual-space approach to tracking and sensor management in wireless sensor networks. In:Reghavendrv CS, ed. Proceedings of the ACM International Workshop on Wireless Sensor Networks and Applications. New York:ACM Press, 2002. 162-173.

共引文献833

同被引文献2058

引证文献185

二级引证文献2170

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部