영어-한국어 탈옥 프롬프트 데이터셋 구축 및 탈옥 프롬프트 분류기 모델 성능 비교 분석
Vol. 35, No. 3, pp. 613-622,
6월.
2025
10.13089/JKIISC.2025.35.3.613,
Full Text:
Keywords: LLM, Jailbreak Attack, text classifier, data augmentation
Abstract Statistics
Cite this article


Keywords: LLM, Jailbreak Attack, text classifier, data augmentation
Abstract Statistics
Cite this article
[IEEE Style]
박대얼, 최대선, 장현준, 윤두식, "English-Korean Jailbreak Prompt Dataset Construction and Performance Analysis of Jailbreak Prompt Classification Models," Journal of The Korea Institute of Information Security and Cryptology, vol. 35, no. 3, pp. 613-622, 2025. DOI: 10.13089/JKIISC.2025.35.3.613.
[ACM Style]
박대얼, 최대선, 장현준, and 윤두식. 2025. English-Korean Jailbreak Prompt Dataset Construction and Performance Analysis of Jailbreak Prompt Classification Models. Journal of The Korea Institute of Information Security and Cryptology, 35, 3, (2025), 613-622. DOI: 10.13089/JKIISC.2025.35.3.613.