JCSE, vol. 18, no. 3, pp.144-151, 2024
DOI: http://dx.doi.org/10.5626/JCSE.2024.18.3.144
A Robust Decryption Technique Using Letter Frequency Analysis for Short Monoalphabetic Substitution Ciphers
Dayeong Kang and Jiyeon Lee
School of Computer Science and Engineering, Kyungpook National University, Daegu, Korea
Abstract: Substitution ciphers?봶here characters are systematically replaced with different ones?봦ave represented a vital tool in safeguarding sensitive information for centuries, eventually laying the foundation for modern encryption techniques. Despite their large key space, substitution ciphers are susceptible to frequency analysis that leverages common English letter patterns. While existing research has suggested certain methods that can improve decryption accuracy using ngram frequencies, these methods face difficulties when used with short ciphertexts due to incomplete letter distribution representation. The present study examines the limitations of current frequency analysis in decrypting short ciphertexts, with the results revealing that deterministic bigram approaches can reduce accuracy in certain cases. To address this shortcoming, we introduce a novel algorithm that uses randomized index selection based on letter distribution to generate multiple candidate keys. We also present a word-level key guessing method using these candidates that maps prominent English words to uncover a secret key. The results of tests with 200 ciphertexts of varying lengths showed an average decryption accuracy of 84.1% for 200-character ciphertexts, an improvement of 147.1% over existing methods. In experiments without dictionary-based decryption, an accuracy of 77.6% was achieved with a decryption time of approximately 0.27 seconds, which is a reasonable completion time. Altogether, these results highlight the efficiency and practicality of our approach for decrypting short ciphertexts.
Keyword:
Cryptography; Cryptoanalysis; Monoalphabetic cipher; Letter frequency analysis
Full Paper: 29 Downloads, 64 View
|