A Multi-dimensional Index Structure Based on Improved VA-file and CAN in the Cloud 被引量：2

A Multi-dimensional Index Structure Based on Improved VA-file and CAN in the Cloud

导出

摘要 Currently,the cloud computing systems use simple key-value data processing,which cannot support similarity search efectively due to lack of efcient index structures,and with the increase of dimensionality,the existing tree-like index structures could lead to the problem of"the curse of dimensionality".In this paper,a novel VF-CAN indexing scheme is proposed.VF-CAN integrates content addressable network(CAN)based routing protocol and the improved vector approximation fle(VA-fle) index.There are two index levels in this scheme:global index and local index.The local index VAK-fle is built for the data in each storage node.VAK-fle is thek-means clustering result of VA-fle approximation vectors according to their degree of proximity.Each cluster forms a separate local index fle and each fle stores the approximate vectors that are contained in the cluster.The vector of each cluster center is stored in the cluster center information fle of corresponding storage node.In the global index,storage nodes are organized into an overlay network CAN,and in order to reduce the cost of calculation,only clustering information of local index is issued to the entire overlay network through the CAN interface.The experimental results show that VF-CAN reduces the index storage space and improves query performance efectively. Currently,the cloud computing systems use simple key-value data processing,which cannot support similarity search efectively due to lack of efcient index structures,and with the increase of dimensionality,the existing tree-like index structures could lead to the problem of"the curse of dimensionality".In this paper,a novel VF-CAN indexing scheme is proposed.VF-CAN integrates content addressable network(CAN)based routing protocol and the improved vector approximation fle(VA-fle) index.There are two index levels in this scheme:global index and local index.The local index VAK-fle is built for the data in each storage node.VAK-fle is thek-means clustering result of VA-fle approximation vectors according to their degree of proximity.Each cluster forms a separate local index fle and each fle stores the approximate vectors that are contained in the cluster.The vector of each cluster center is stored in the cluster center information fle of corresponding storage node.In the global index,storage nodes are organized into an overlay network CAN,and in order to reduce the cost of calculation,only clustering information of local index is issued to the entire overlay network through the CAN interface.The experimental results show that VF-CAN reduces the index storage space and improves query performance efectively.

作者 Chun-Ling Cheng Chun-Ju Sun Xiao-Long Xu Deng-Yin Zhang

机构地区 College of Computer Jiangsu High Technology Research Key Laboratory for Wireless Sensor Networks Key Lab of Broadband Wireless Communication and Sensor Network Technology

出处《International Journal of Automation and computing》 EI CSCD 2014年第1期109-117,共9页 国际自动化与计算杂志（英文版）

基金 supported by National Natural Science Foundation of China(No.61071093) Research and Innovation Projects for Graduates of Jiangsu Province(Nos.CXZZ12 0483 and CXLX12 0481) Science and Technology Support Program of Jiangsu Province(No.BE2012849) Priority Academic Program Development of Jiangsu Higher Education Institutions(No.yx002001)

关键词 Cloud computing index similarity search clustering vector approximation fle(VA-fle) content addressable network(CAN) Cloud computing index similarity search clustering vector approximation fle(VA-fle) content addressable network(CAN)

分类号 TP391.3 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1孟必平,王腾蛟,李红燕,杨冬青.分片位图索引:一种适用于云数据管理的辅助索引机制[J].计算机学报,2012,35(11):2306-2316. 被引量：30
2陈康,郑纬民.云计算:系统实例与研究现状[J].软件学报,2009,20(5):1337-1348. 被引量：1314

二级参考文献48

1Sims K. IBM introduces ready-to-use cloud computing collaboration services get clients started with cloud computing. 2007. http://www-03.ibm.com/press/us/en/pressrelease/22613.wss
2Boss G, Malladi P, Quan D, Legregni L, Hall H. Cloud computing. IBM White Paper, 2007. http://download.boulder.ibm.com/ ibmdl/pub/software/dw/wes/hipods/Cloud_computing_wp_final_8Oct.pdf
3Zhang YX, Zhou YZ. 4VP+: A novel meta OS approach for streaming programs in ubiquitous computing. In: Proc. of IEEE the 21st Int'l Conf. on Advanced Information Networking and Applications (AINA 2007). Los Alamitos: IEEE Computer Society, 2007. 394-403.
4Zhang YX, Zhou YZ. Transparent Computing: A new paradigm for pervasive computing. In: Ma JH, Jin H, Yang LT, Tsai JJP, eds. Proc. of the 3rd Int'l Conf. on Ubiquitous Intelligence and Computing (UIC 2006). Berlin, Heidelberg: Springer-Verlag, 2006. 1-11.
5Barroso LA, Dean J, Holzle U. Web search for a planet: The Google cluster architecture. IEEE Micro, 2003,23(2):22-28.
6Brin S, Page L. The anatomy of a large-scale hypertextual Web search engine. Computer Networks, 1998,30(1-7): 107-117.
7Ghemawat S, Gobioff H, Leung ST. The Google file system. In: Proc. of the 19th ACM Symp. on Operating Systems Principles. New York: ACM Press, 2003.29-43.
8Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters. In: Proc. of the 6th Symp. on Operating System Design and Implementation. Berkeley: USENIX Association, 2004. 137-150.
9Burrows M. The chubby lock service for loosely-coupled distributed systems. In: Proc. of the 7th USENIX Symp. on Operating Systems Design and Implementation. Berkeley: USENIX Association, 2006. 335-350.
10Chang F, Dean J, Ghemawat S, Hsieh WC, Wallach DA, Burrows M, Chandra T, Fikes A, Gruber RE. Bigtable: A distributed storage system for structured data. In: Proc. of the 7th USENIX Symp. on Operating Systems Design and Implementation. Berkeley: USENIX Association, 2006. 205-218.

共引文献1342

1查伟,孙燕琼,郑继平.基于云测试架构的FIVP解决方案[J].铁路技术创新,2021(S01):82-86.
2林少伟.人工智能法律主体资格实现路径:以商事主体为视角[J].中国政法大学学报,2021(3):165-177. 被引量：7
3胡祖林,肇杰.云计算下的网盘安全[J].计算机产品与流通,2020,0(1):164-164.
4张盛,任伟,王玉,黄金明,陈旭彤.基于Web的重力异常正演建模工具[J].地质论评,2023,69(S01):595-597.
5赵文韬.基于5G技术的黑龙江云计算产业发展[J].电子技术（上海）,2020,49(9):186-187.
6Longfei He,Mei Xue,Bin Gu.Internet-of-things enabled supply chain planning and coordination with big data services:Certain theoretic implications[J].Journal of Management Science and Engineering,2020,5(1):1-22. 被引量：6
7吴劲松,陈孚.云计算发展及应用研究[J].广西通信技术,2011(2):9-13. 被引量：5
8黄纬,温志萍,程初.云计算中基于K-均值聚类的虚拟机调度算法研究[J].南京理工大学学报,2013,37(6):807-812. 被引量：17
9孙凌宇,欧阳春娟,冷明,刘昌鑫,夏洁武.云计算与高等教育管理信息服务系统构建[J].山西财经大学学报,2012,34(S1). 被引量：9
10王荣荣.云计算技术基础上数字图书馆云服务平台的实现[J].河北北方学院学报（社会科学版）,2013,29(4):72-74. 被引量：2

同被引文献4

1支晓栋,林宗坚,苏国中,钟良.基于改进四叉树的LiDAR点云数据组织研究[J].计算机工程与应用,2010,46(9):71-74. 被引量：20
2杨寒冰,赵龙,贾金原.HBase数据库迁移工具的设计与实现[J].计算机科学与探索,2013,7(3):236-246. 被引量：11
3冯义从,岑敏仪,杨晓芸,张同刚.基于3D格网与哈希表的车载LiDAR点云八叉树索引[J].测绘科学,2014,39(6):104-107. 被引量：6
4周维,路劲,周可人,王世普,姚绍文.基于并发跳表的云数据处理双层索引架构研究[J].计算机研究与发展,2015,52(7):1531-1545. 被引量：5

引证文献2

1朱锐,王宏志,崔双双,张恺欣,燕钰.面向元宇宙的云边端协同大数据管理[J].大数据,2023,9(1):63-77. 被引量：8
2刘星平,罗湘运,杨海.基于HBase的高效交通数据云索引技术[J].控制工程,2016,23(4):560-564. 被引量：3

二级引证文献11

1张新兴.基于云计算的科学数据资源聚合系统研究[J].图书馆学研究,2017(21):60-64. 被引量：7
2罗海艳,杨勇,王珏,于海龙.基于云计算的移动用户上网行为分析系统[J].控制工程,2018,25(2):218-223. 被引量：3
3李剑锋,陈世平,段林茂,钮亮.一种支持范围查询的云数据空间索引研究[J].小型微型计算机系统,2018,39(5):967-972. 被引量：2
4李欣悦.移动终端智能及其在推荐场景的应用[J].互联网周刊,2023(6):54-56. 被引量：1
5王威.事件驱动模式下物联网数据交换平台的研究[J].福建电脑,2023,39(7):47-51.
6李相俊,刘晓宇,韩雪冰,杨佳涛,李睿.电化学储能电站数字化智能化技术及其应用展望[J].供用电,2023,40(8):3-12. 被引量：8
7王智,夏树涛,毛睿.基于边缘智能的沉浸式元宇宙关键技术与展望[J].大数据,2024,10(1):35-45. 被引量：2
8李宗辉.元宇宙产业发展的技术、模式与专利战略[J].科技管理研究,2023,43(22):221-227. 被引量：4
9王冀彬,杨海龙,冯凯,孙欣,张敏达,雷克伦,肖智文,张逸飞,吴佳熙.面向大数据场景的系统性能优化实践[J].大数据,2024,10(4):21-33.
10金瑶,张毅.数智技术赋能城市社区政民互动的技术路径与优化策略——以元宇宙技术为例[J].合肥工业大学学报（社会科学版）,2024,38(6):66-76.

1再掀多屏普及之风——蓝宝HD6450 FleX显卡[J].电脑迷,2012(1):38-38.
2黄寿孟.基于Flex的数据通信技术研究与应用[J].中国现代教育装备,2016(17):12-15. 被引量：8
3李焰峰,李真,李汉斌,张学杰.基于Content-Addressable Network的对等网络研究[J].云南大学学报（自然科学版）,2007,29(S2):249-253.
4陈苏海.基于VB的排序算法研究[J].电脑编程技巧与维护,2015(21):33-34.
5张展鹏,邹卫军.多总线数据记录与回放系统的设计与实现[J].工业控制计算机,2016,29(9):8-9. 被引量：2
6Minghe YU,Guoliang LI,Dong DENG,Jianhua FENG.String similarity search and join： a survey[J].Frontiers of Computer Science,2016,10(3):399-417. 被引量：4
7汪卫,王宇君,施伯乐.Dynamic Interval Index Structure in Constraint Database Systems[J].Journal of Computer Science & Technology,2000,15(6):542-551. 被引量：1
8徐晨.网页居中布局解决方案研究[J].科学咨询,2016,0(18):64-65.
9平小艳,陈华,史小春.基于Kademlia的结构化对等网络原理及其应用[J].科技信息,2008(24).
10近200家客户受益于开创性的InforFlex计划[J].CAD/CAM与制造业信息化,2010(4):3-3.

International Journal of Automation and computing

2014年第1期

浏览历史

内容加载中请稍等...