可重构媒体处理器任务编译器的前端设计

doi:10.13190/jbupt.201103.108.364

北京邮电大学学报 ›› 2011, Vol. 34 ›› Issue (3): 108-112.doi: 10.13190/jbupt.201103.108.364

可重构媒体处理器任务编译器的前端设计

殷崇勇,尹首一,刘雷波,杨超,朱敏,魏少军

1清华大学移动计算研究中心, 北京 100084; 2清华大学微电子学研究所, 北京 100084;  3清华信息科学与技术国家实验室, 北京 100084

收稿日期:2010-05-28 修回日期:2011-03-17 出版日期:2011-06-28 发布日期:2011-03-29
通讯作者: 殷崇勇 E-mail:yin-cy06@mails.tsinghua.edu.cn
基金资助:
国家高技术研究发展计划项目(2009AA011702)；国家自然科学基金项目(60803018)

FrontEnd Design of Task Compiler for Reconfigurable Multimedia Processor

Received:2010-05-28 Revised:2011-03-17 Online:2011-06-28 Published:2011-03-29

摘要/Abstract

摘要：

为了解决算法程序自动映射到可重构媒体处理器的问题，有效提高程序并行执行的效率，提出一种具有自动并行化的任务编译前端. 该任务编译前端通过展开核心循环可提高并行执行度，在数据依赖分析确保运算正确执行的基础上，对循环体内的数组访问进行标量替换，以优化数据传输开销. 实验结果表明，该任务编译前端能有效提高代码并行性和优化数据传输能力，与Garp C编译器的编译前端相比，该任务编译前端设计的性能可提升约2~4倍.

关键词: 可重构计算, 任务编译器, 循环展开, 标量替换

Abstract:

In order to automatically map algorithms onto reconfigurable multimedia processor and improve the parallel efficiency of algorithms, a task compiler frontend is designed. The kernel loop unrolling is introduced to improve the degree of parallelism; and the scalar replacement technique based on data dependence analysis is used to optimize the cost of data transmission. Experiments show that the task compiler frontend improves the degree of parallelism effectively, its performance can be, compared with the frontend of Garp C compiler, sharply increased up to 2~4 times in the whole system.

Key words: reconfigurable computing, task compiler, loop unrolling, scalar replacement

殷崇勇,尹首一,刘雷波,杨超,朱敏,魏少军. 可重构媒体处理器任务编译器的前端设计[J]. 北京邮电大学学报, 2011, 34(3): 108-112.

可重构媒体处理器任务编译器的前端设计

FrontEnd Design of Task Compiler for Reconfigurable Multimedia Processor

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 0

编辑推荐

Metrics

本文评价