Unsafe Tools Activation for Translation Tasks: When Asking

Mike Luu

Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer

Mike Luu

Vol. 2, No. 2, pp. 0-0, Oct. 2025

10.23246/AAIRJ.2025.02.02.02

Alignment

LLM

Tool Uses

PDF

Abstract

Large Language Models (LLMs) equipped with tool-calling capabilities present new security vulnerabilities when performing seemingly benign language processing tasks. This paper demonstrates how translation, summarization, and explanation requests containing malicious foreign-language instructions can trigger un-safe tool activations in LLMs. Using a benchmark dataset of 150 unsafe instructions across Korean, Japanese, and Vietnamese, we evalu- ate tool activation rates across different task types. Our findings reveal that translation tasks achieve the highest vulnerability with 86.67% tool activation rate, followed by summarization (86.00%) and explanation (69.33%). Surprisingly, prompt engineering safeguards ("make sure to only translate the content") prove highly effective for explanation tasks, reducing activation rates from 69.33% to 0.00%, while remaining less effective for translation and summarization tasks. This research highlights security considerations for deploying tool-enabled LLMs in multilingual environments and demonstrates varying effectiveness of mitigation strategies across different task types. The benchmark dataset is available at Hugging Face Datasets and our experimental code is available at GitHub.

Statistics

Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.

Cite this article

[IEEE Style]

M. Luu, "Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer," AAIRJ, vol. 2, no. 2, pp. 0-0, 2025. DOI: 10.23246/AAIRJ.2025.02.02.02.

[ACM Style]

Mike Luu. 2025. Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer. AAIRJ, 2, 2, (2025), 0-0. DOI: 10.23246/AAIRJ.2025.02.02.02.

[KICS Style]

Mike Luu, "Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer," AAIRJ, vol. 2, no. 2, pp. 0-0, 2. 2025. (https://doi.org/10.23246/AAIRJ.2025.02.02.02)

Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer

Submenu

Search
(IN TITLE, AUTHOR, ABSTRACT,KEYWORDS)

Advanced Search

Recent Publications
(LAST 3 YEARS)

Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer

Submenu

Search (IN TITLE, AUTHOR, ABSTRACT,KEYWORDS)

Advanced Search

POPULAR KEYWORDS(TOP 10 KEYWORDS)

Recent Publications(LAST 3 YEARS)

Search
(IN TITLE, AUTHOR, ABSTRACT,KEYWORDS)

POPULAR KEYWORDS
(TOP 10 KEYWORDS)

Recent Publications
(LAST 3 YEARS)