Critical ReviewComputer Systems

PROMPTFLUX: When Malware Uses LLMs at Runtime

The first malware families—PROMPTFLUX and PROMPTSTEAL—that invoke large language models at runtime have been documented, marking a shift from static attack scripts to adaptive, language-model-driven intrusion chains.

By Sean K.S. Shin

This blog summarizes research trends based on published paper abstracts. Specific numbers or findings may contain inaccuracies. For scholarly rigor, always consult the original papers cited in each post.

Malware has historically operated on deterministic logic. A payload executes a predefined sequence—exploit a vulnerability, establish persistence, exfiltrate data—using code that was static from the moment of compilation. The attacker's intelligence was embedded before deployment; the malware itself was, in computational terms, unintelligent.

That assumption no longer holds. The MDPI AI survey on AI-driven cybersecurity, cross-referenced with Trend Micro's first-half 2025 threat intelligence report, documents the emergence of PROMPTFLUX and PROMPTSTEAL—the first malware families that invoke large language models at runtime. These are not AI-assisted attack tools where a human operator uses ChatGPT to write phishing emails. These are autonomous malicious programs that call LLM APIs during execution to adapt their behavior, generate context-appropriate social engineering content, and modify their attack strategies based on the target environment.

The distinction matters. A static phishing template can be fingerprinted and blocked. A malware instance that generates unique, contextually tailored communications on every execution—drawing on an LLM's ability to mimic writing styles, respond to security prompts, and construct plausible pretexts—presents a fundamentally different detection challenge.

The Research Landscape: AI in Cybersecurity, Both Sides

The dual-use nature of AI in security is not new in principle, but the scale of adoption in 2025 has accelerated on both offense and defense simultaneously.

Defensive adoption: AI cybersecurity tool adoption has increased substantially according to the surveyed literature, driven primarily by the need to process alert volumes that exceed human analyst capacity. Security operations centers (SOCs) now routinely deploy ML-based anomaly detection, natural language processing for log analysis, and automated incident triage. The economic logic is straightforward: the volume of network telemetry, endpoint events, and application logs generated by modern infrastructure exceeds what human teams can review, and attack dwell times punish slow detection.

Offensive evolution: On the adversarial side, LLMs have lowered the skill barrier for constructing sophisticated attacks. Pre-LLM social engineering required linguistic competence in the target's language, knowledge of organizational context, and the patience to craft individualized messages. LLMs commoditize all three capabilities. What PROMPTFLUX and PROMPTSTEAL add is the removal of the human operator from the loop entirely—the malware itself handles the adaptive communication.

Institutional awareness: a large majority of security leaders surveyed report that they are preparing for routine AI-powered attacks, suggesting that the threat is no longer hypothetical in the minds of practitioners, even if documented in-the-wild cases remain limited to these initial families.

Critical Analysis

Claim	Source Evidence	Verdict
First malware families using LLMs at runtime have been identified (PROMPTFLUX, PROMPTSTEAL)	Documented in Trend Micro 1H 2025 threat intelligence and corroborated in MDPI AI survey	✅ Supported — novel finding with named malware families
AI cybersecurity tool adoption increased substantially	Reported in MDPI AI survey synthesis across multiple industry studies	⚠️ Aggregated metric — methodology of aggregation across studies may vary
a large majority of security leaders preparing for routine AI-powered attacks	Survey data reported in the literature synthesis	⚠️ Survey-dependent — sample composition and question framing affect interpretation
LLMs are used for both attack and defense in cybersecurity	Multiple documented use cases on both sides	✅ Supported — well-established dual-use pattern

What the Evidence Shows

PROMPTFLUX represents a qualitative shift rather than a quantitative escalation. The malware does not simply use AI to be "better" at what malware already did—it introduces a new capability class. By invoking LLMs at runtime, the malware can:

Generate contextually appropriate communications tailored to the specific target, including mimicking internal communication styles observed during reconnaissance

Adapt evasion strategies based on detected security controls, potentially reformulating payloads or communication patterns when initial approaches are blocked

Respond to security analyst interactions during incident response, potentially extending dwell time by producing plausible explanations for anomalous activity

PROMPTSTEAL, the second documented family, focuses specifically on credential harvesting, using LLM-generated prompts to construct convincing authentication pages and session hijacking pretexts.

What Remains Uncertain

The prevalence of LLM-runtime malware in the wild is difficult to assess. The documented cases may represent the visible edge of a larger trend, or they may be proof-of-concept outliers that have not yet achieved widespread adoption in the criminal ecosystem. The substantial adoption growth for defensive AI tools, while specific, aggregates across heterogeneous studies with different measurement approaches, making direct comparison difficult.

The cost structure also matters. Calling commercial LLM APIs from malware creates a financial trail and depends on API access that providers can revoke. Whether attackers will shift to locally hosted open-weight models (eliminating the API dependency) or develop novel access methods remains an open question.

Open Questions

Detection methodology: How should security tools identify LLM API calls originating from malware as distinct from legitimate application usage? Distinguishing malicious from legitimate calls within the same enterprise environment requires behavioral context that current tools may not capture.

Open-weight model implications: If attackers migrate from API-based LLMs to locally hosted models (Llama, Mistral, and others), API-level detection strategies become irrelevant. The security community has not yet developed approaches for detecting locally hosted LLM inference within malware execution environments.

Escalation dynamics: Will defensive AI and offensive AI enter a co-evolutionary arms race? Historical precedent from other dual-use technologies (cryptography, network security) suggests co-evolution rather than decisive advantage.

Closing Reflection

The appearance of PROMPTFLUX and PROMPTSTEAL marks a transition point—malware that thinks, adapts, and communicates using the same language models that power enterprise productivity tools. The substantial increase in defensive AI adoption suggests that defenders are not standing still, but the fundamental asymmetry of security—where attackers need to find one weakness while defenders must protect all surfaces—is amplified when the attacker's tool can generate novel approaches at machine speed. The question is no longer whether AI will reshape cybersecurity, but whether defensive applications of the same technology can maintain pace with adversarial innovation.

면책 조항: 이 게시물은 정보 제공 목적의 연구 동향 개요이다. 특정 연구 결과, 통계 및 주장은 학술 연구에서 인용하기 전에 원본 논문을 통해 검증해야 한다.

PROMPTFLUX: 악성코드가 런타임에 LLM을 활용할 때

악성코드는 역사적으로 결정론적 논리에 기반하여 작동해 왔다. 페이로드는 취약점 악용, 지속성 확립, 데이터 유출 등 사전에 정의된 일련의 과정을 실행하며, 이에 사용되는 코드는 컴파일 시점부터 정적인 상태로 고정된다. 공격자의 지능은 배포 이전에 내재되었으며, 악성코드 자체는 계산적 관점에서 지능이 없었다.

그 전제는 더 이상 유효하지 않다. AI 기반 사이버보안에 관한 MDPI AI 조사는 Trend Micro의 2025년 상반기 위협 인텔리전스 보고서와의 교차 참조를 통해, 런타임에 대규모 언어 모델(LLM)을 호출하는 최초의 악성코드 패밀리인 PROMPTFLUX와 PROMPTSTEAL의 등장을 기록하고 있다. 이것들은 인간 운영자가 ChatGPT를 사용하여 피싱 이메일을 작성하는 AI 보조 공격 도구가 아니다. 이것들은 실행 중 LLM API를 호출하여 자신의 행동을 적응시키고, 상황에 맞는 소셜 엔지니어링 콘텐츠를 생성하며, 대상 환경에 따라 공격 전략을 수정하는 자율적인 악성 프로그램이다.

이 차이는 중요하다. 정적인 피싱 템플릿은 핑거프린팅되어 차단될 수 있다. 그러나 매번 실행 시마다 고유하고 상황에 맞게 조정된 통신을 생성하는 악성코드 인스턴스—LLM의 문체 모방 능력, 보안 프롬프트에 대한 응답, 그럴듯한 구실 구성 능력을 활용하는—는 근본적으로 다른 탐지 문제를 제기한다.

연구 현황: 사이버보안에서의 AI, 양면

보안 분야에서 AI의 이중 사용 특성은 원칙적으로 새로운 것이 아니지만, 2025년의 도입 규모는 공격과 방어 양측에서 동시에 가속화되었다.

방어적 도입: 조사된 문헌에 따르면, AI 사이버보안 도구의 도입은 인간 분석가의 처리 용량을 초과하는 경보량을 처리해야 할 필요성에 주로 이끌려 상당히 증가하였다. 보안 운영 센터(SOC)는 현재 ML 기반 이상 탐지, 로그 분석을 위한 자연어 처리, 자동화된 인시던트 분류를 일상적으로 배포하고 있다. 경제적 논리는 명확하다. 현대 인프라에서 생성되는 네트워크 텔레메트리, 엔드포인트 이벤트 및 애플리케이션 로그의 양은 인간 팀이 검토할 수 있는 수준을 초과하며, 공격 체류 시간은 느린 탐지에 불이익을 준다.

공격적 진화: 적대적 측면에서 LLM은 정교한 공격 구성을 위한 기술 장벽을 낮추었다. LLM 이전의 소셜 엔지니어링은 대상 언어에 대한 언어 능력, 조직적 맥락에 대한 지식, 그리고 개별 메시지를 작성할 인내심을 필요로 했다. LLM은 이 세 가지 능력을 모두 상품화한다. PROMPTFLUX와 PROMPTSTEAL이 추가하는 것은 인간 운영자를 루프에서 완전히 제거하는 것이다—악성코드 자체가 적응형 통신을 처리한다.

기관의 인식: 조사 대상 보안 리더의 대다수는 일상적인 AI 기반 공격에 대비하고 있다고 응답하였으며, 이는 실제 문서화된 사례가 이 초기 패밀리들로 제한되더라도 실무자들의 인식 속에서 해당 위협이 더 이상 가상의 것이 아님을 시사한다.

비판적 분석

주장	출처 근거	판정
런타임에 LLM을 사용하는 최초의 악성코드 패밀리가 식별됨 (PROMPTFLUX, PROMPTSTEAL)	Trend Micro 2025년 상반기 위협 인텔리전스에 기록되었으며 MDPI AI 조사에서도 확인됨	✅ 지지됨 — 명명된 악성코드 패밀리를 포함한 새로운 발견
AI 사이버보안 도구 도입이 상당히 증가함	여러 업계 연구에 걸친 MDPI AI 조사 종합에서 보고됨	⚠️ 집계된 지표 — 연구 간 집계 방법론이 상이할 수 있음
보안 리더의 대다수가 일상적인 AI 기반 공격에 대비 중	문헌 종합에서 보고된 설문 데이터	⚠️ 설문 의존적 — 표본 구성 및 질문 구성 방식이 해석에 영향을 미침
LLM은 사이버보안에서 공격과 방어 모두에 활용된다	양측 모두에 걸친 다수의 문서화된 사례	✅ 지지됨 — 잘 확립된 이중 사용 패턴

증거가 보여주는 것

PROMPTFLUX는 양적 확장이 아닌 질적 전환을 나타낸다. 이 악성코드는 단순히 AI를 활용하여 기존 악성코드가 하던 일을 "더 잘" 수행하는 것이 아니라, 새로운 기능 범주를 도입한다. 런타임에 LLM을 호출함으로써 이 악성코드는 다음을 수행할 수 있다.

맥락에 적합한 통신 생성: 정찰 과정에서 파악한 내부 커뮤니케이션 방식을 모방하는 등 특정 대상에 맞춤화된 통신을 생성한다.

탐지된 보안 통제에 기반한 회피 전략 적응: 초기 접근이 차단될 경우 페이로드나 통신 패턴을 잠재적으로 재구성한다.

침해사고 대응 과정에서 보안 분석가와의 상호작용에 대응: 이상 활동에 대해 그럴듯한 설명을 생성함으로써 잠복 시간을 연장할 가능성이 있다.

두 번째로 문서화된 악성코드 계열인 PROMPTSTEAL은 자격 증명 탈취에 특화되어 있으며, LLM이 생성한 프롬프트를 활용하여 설득력 있는 인증 페이지와 세션 하이재킹 구실을 구성한다.

여전히 불확실한 것

야생에서 LLM 런타임 악성코드의 유병률을 평가하기는 어렵다. 문서화된 사례들은 더 큰 트렌드의 가시적인 최전선을 나타낼 수도 있고, 범죄 생태계에서 아직 광범위하게 채택되지 않은 개념 증명 단계의 사례들일 수도 있다. 방어용 AI 도구의 상당한 채택 성장은 구체적인 수치이긴 하지만, 서로 다른 측정 방식을 사용하는 이질적인 연구들을 집계한 것이므로 직접적인 비교가 어렵다.

비용 구조 또한 중요하다. 악성코드에서 상용 LLM API를 호출하면 금융 추적 경로가 생성되고, 제공업체가 취소할 수 있는 API 접근에 의존하게 된다. 공격자들이 로컬에 호스팅된 오픈 웨이트 모델(API 의존성 제거)로 전환할 것인지, 아니면 새로운 접근 방법을 개발할 것인지는 아직 열린 질문으로 남아 있다.

열린 질문들

탐지 방법론: 보안 도구는 악성코드에서 발생하는 LLM API 호출을 정상적인 애플리케이션 사용과 어떻게 구별해야 하는가? 동일한 기업 환경 내에서 악의적인 호출과 정상적인 호출을 구별하려면 현재 도구들이 포착하지 못할 수 있는 행동적 맥락이 필요하다.

오픈 웨이트 모델의 함의: 공격자들이 API 기반 LLM에서 로컬에 호스팅된 모델(Llama, Mistral 등)로 이전할 경우, API 수준의 탐지 전략은 무의미해진다. 보안 커뮤니티는 아직 악성코드 실행 환경 내에서 로컬에 호스팅된 LLM 추론을 탐지하는 방법을 개발하지 못했다.

확전 역학: 방어용 AI와 공격용 AI는 공진화적 군비 경쟁에 돌입하게 될 것인가? 다른 이중 사용 기술(암호학, 네트워크 보안)에서의 역사적 선례는 결정적 우위보다는 공진화를 시사한다.

마치며

PROMPTFLUX와 PROMPTSTEAL의 등장은 하나의 전환점을 표시한다. 이는 기업 생산성 도구를 구동하는 것과 동일한 언어 모델을 사용하여 생각하고, 적응하고, 소통하는 악성코드의 출현이다. 방어용 AI 채택의 상당한 증가는 방어자들이 정체되어 있지 않음을 시사하지만, 보안의 근본적인 비대칭성—공격자는 한 가지 약점만 찾으면 되는 반면 방어자는 모든 표면을 보호해야 한다—은 공격자의 도구가 기계 속도로 새로운 접근 방식을 생성할 수 있을 때 더욱 증폭된다. 이제 문제는 AI가 사이버보안을 재편할 것인지 여부가 아니라, 동일한 기술의 방어적 응용이 적대적 혁신의 속도를 따라잡을 수 있는지 여부이다.

References (3)

[1] Ahi, K. & Valizadeh, S. (2025). Large Language Models (LLMs) and Generative AI in Cybersecurity and Privacy: A Survey of Dual-Use Risks, AI-Generated Malware, Explainability, and Defensive Strategies. IEEE Silicon Valley Cybersecurity Conference.

DOI Scholar

[2] Google Threat Intelligence Group (2025). PROMPTFLUX Malware That Uses Gemini AI to Rewrite Its Code. The Hacker News.

Scholar

Silva, F. A. d. (2025). Navigating the dual-edged sword of generative AI in cybersecurity. Brazilian Journal of Development, 11(1), e76869.