Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer 


Vol. 2,  No. 2, pp. 0-0, Oct.  2025
10.23246/AAIRJ.2025.02.02.02


PDF
  Abstract

Large Language Models (LLMs) equipped with tool-calling capabilities present new security vulnerabilities when performing seemingly benign language processing tasks. This paper demonstrates how translation, summarization, and explanation requests containing malicious foreign-language instructions can trigger un-safe tool activations in LLMs. Using a benchmark dataset of 150 unsafe instructions across Korean, Japanese, and Vietnamese, we evalu- ate tool activation rates across different task types. Our findings reveal that translation tasks achieve the highest vulnerability with 86.67% tool activation rate, followed by summarization (86.00%) and explanation (69.33%). Surprisingly, prompt engineering safeguards ("make sure to only translate the content") prove highly effective for explanation tasks, reducing activation rates from 69.33% to 0.00%, while remaining less effective for translation and summarization tasks. This research highlights security considerations for deploying tool-enabled LLMs in multilingual environments and demonstrates varying effectiveness of mitigation strategies across different task types. The benchmark dataset is available at Hugging Face Datasets and our experimental code is available at GitHub.

  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

M. Luu, "Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer," AAIRJ, vol. 2, no. 2, pp. 0-0, 2025. DOI: 10.23246/AAIRJ.2025.02.02.02.

[ACM Style]

Mike Luu. 2025. Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer. AAIRJ, 2, 2, (2025), 0-0. DOI: 10.23246/AAIRJ.2025.02.02.02.

[KICS Style]

Mike Luu, "Unsafe Tools Activation for Translation Tasks: When Asking "Translate this ..." Can Corrupt Your Computer," AAIRJ, vol. 2, no. 2, pp. 0-0, 2. 2025. (https://doi.org/10.23246/AAIRJ.2025.02.02.02)