Speed-optimized Implementation of HIGHT Block Cipher Algorithm

Vol. 22, No. 3, pp. 495-504, Jun. 2012
10.13089/JKIISC.2012.22.3.495, Full Text:
Keywords: HIGHT, block cipher, Bit-slice, Byte-slice, GPGPU, CUDA
Abstract

This paper presents various speed optimization techniques for software implementation of the HIGHT block cipher on CPUs and GPUs. We considered 32-bit and 64-bit operating systems for CPU implementations. After we applied the bit-slicing and byte-slicing techniques to HIGHT, the encryption speed recorded 1.48Gbps over the intel core i7 920 CPU with a 64-bit operating system, which is up to 2.4 times faster than the previous implementation. We also implemented HIGHT on an NVIDIA GPU equipped with CUDA, and applied various optimization techniques, such as storing most frequently used data like subkeys and the F lookup table in the shared memory; and using coalesced access when reading data from the global memory. To our knowledge, this is the first result that implements and optimizes HIGHT on a GPU. We verified that the byte-slicing technique guarantees a speed-up of more than 20%, resulting a speed which is 31 times faster than that on a CPU.

Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
E. Baek and M. Lee, "Speed-optimized Implementation of HIGHT Block Cipher Algorithm," Journal of The Korea Institute of Information Security and Cryptology, vol. 22, no. 3, pp. 495-504, 2012. DOI: 10.13089/JKIISC.2012.22.3.495.

[ACM Style]
Eun-Tae Baek and Mun-Kyu Lee. 2012. Speed-optimized Implementation of HIGHT Block Cipher Algorithm. Journal of The Korea Institute of Information Security and Cryptology, 22, 3, (2012), 495-504. DOI: 10.13089/JKIISC.2012.22.3.495.