论文标题
基于软SIMD的节能计算微体系结构
A Soft SIMD Based Energy Efficient Computing Microarchitecture
论文作者
论文摘要
当今机器学习算法的尺寸和计算复杂性不断增加,对基础硬件的压力越来越大。从这个角度来看,需要新颖和专用的架构解决方案来通过利用机会(例如算法的内在并行性和鲁棒性)来优化能源效率。我们在这里通过引入灵活的两阶段计算管道来应对这一挑战。该管道可以通过软件支持的单个指令多个数据(SIMD)操作来支持细粒的操作数量化。此外,由于零钉和规范的签名数字(CSD)编码,它可以有效地对SIMD子字上的顺序乘法。最后,轻巧的重新包装单元允许在运行时动态更改子字的位。这些功能在能量和区域预算紧张的范围内实施。确实,实验结果表明,我们的方法在区域和能源需求方面都大大优于传统硬件SIMD。尤其是,我们的管道比硬件SIMD高达53.1%,同时执行乘法高达88.8%。
The ever-increasing size and computational complexity of today's machine-learning algorithms pose an increasing strain on the underlying hardware. In this light, novel and dedicated architectural solutions are required to optimize energy efficiency by leveraging opportunities (such as intrinsic parallelism and robustness to quantization errors) exposed by algorithms. We herein address this challenge by introducing a flexible two-stages computing pipeline. The pipeline can support fine-grained operand quantization through software-supported Single Instruction Multiple Data (SIMD) operations. Moreover, it can efficiently execute sequential multiplications over SIMD sub-words thanks to zero-skipping and Canonical Signed Digit (CSD) coding. Finally, a lightweight repacking unit allows changing the bitwidth of sub-words at run-time dynamically. These features are implemented within a tight energy and area budget. Indeed, experimental results showcase that our approach greatly outperforms traditional hardware SIMD ones both in terms of area and energy requirements. In particular, our pipeline occupies up to 53.1% smaller than a hardware SIMD one supporting the same sub-word widths, while performing multiplication up to 88.8% more efficiently.