北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2018, Vol. 41 ›› Issue (4): 16-22.doi: 10.13190/j.jbupt.2017-234

• 论文 • 上一篇    下一篇

一种基于查询聚类的物化视图动态调整策略

冯霞1,2, 张江2, 左海超1   

  1. 1. 中国民航信息技术科研基地, 天津 300300;
    2. 中国民航大学 计算机科学与技术学院, 天津 300300
  • 收稿日期:2017-11-08 出版日期:2018-08-28 发布日期:2018-10-09
  • 作者简介:张江(1992-),男,硕士生,E-mail:jzhangmike@163.com;冯霞(1970-),女,教授,硕士生导师.
  • 基金资助:
    国家自然科学基金项目(61502499);中央高校科研业务费专项资金项目(3122015z007)

A Dynamic Adjustment Strategy of Materialized Views Based on Query Clustering

FENG Xia1,2, ZHANG Jiang2, ZUO Hai-chao1   

  1. 1. Information Technology Research Base of Civil Aviation Administration of China, Tianjin 300300, China;
    2. College of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, China
  • Received:2017-11-08 Online:2018-08-28 Published:2018-10-09

摘要: 为了提高数据仓库的查询响应性能,避免视图集频繁调整引发的"抖动性",提出了一种基于查询聚类的物化视图动态调整策略,运用关联规则挖掘方法计算属性字段相似性,进而计算查询语句相似性,并对一个查询周期内的查询语句集进行聚类,产生候选视图集,根据效益模型计算候选视图的效益,再运用物化视图动态调整算法生成物化视图.在航空公司机票结算数据集上的实验结果表明,在单机环境和分布式环境下,较基准算法相比,所提出的方法均能显著提升数据仓库的查询响应性能,尤其是对高频查询语句的响应性能.

关键词: 数据仓库, 物化视图集, 动态选择, 查询聚类, 属性字段相似度

Abstract: In order to improve performance of query response of data warehouse, and avoid the frequent "jitter" phenomenon for materialized views set caused by immediate adjustment algorithm, a dynamic adjustment strategy of materialized views based on query clustering is presented. Firstly, attribute similarity can be calculated based on method of mining association rules, then queries similarity can be calculated and candidate views set can be generated by clustering the queries set during a statistical time, and then the benefits of candidate views can be calculated according to benefit model. Finally, the latest materialized views can be selected using dynamic management algorithm of materialized views. Based on the experimental results with data of air ticket settlement recorded by airlines. Whether in single-machine environment or distributed environment, compared to other benchmark algorithms, the overall performance of query response of data warehouse has been improved greatly, especially for high frequency queries.

Key words: data warehouse, materialized views set, dynamic selection, query clustering, attribute similarity

中图分类号: