Multimodal and Large Language Model Approaches in Cybersecurity: A Systematic Review

Main Article Content

Gupreet Singh
Trina Banerjee
Piyush
Mukthikka V

Abstract

The rapid evolution of cyber threats demands increasingly sophisticated defensive mechanisms. In recent years, Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) have gained traction as valuable tools across multiple cybersecurity domains, offering capabilities that extend well beyond traditional rule-based and classical machine learning approaches. This systematic review provides a detailed analysis of 55 research papers published between 2019 and 2025, examining the application of LLMs and multimodal AI across eight key cybersecurity domains: vulnerability detection, malware analysis, phishing detection, network intrusion detection, cyber threat intelligence, security operations, penetration testing, and deepfake detection. We present a unified taxonomy that categorizes these approaches by their architectural type, covering encoder-only models (BERT variants), decoder-only models (GPT family), and multimodal architectures, as well as by their application domains. Our comparative analysis shows that while LLMs demonstrate strong capabilities in code comprehension, threat classification, and automated security analysis, notable challenges persist in areas such as hallucination, adversarial robustness, and the dual-use nature of these technologies. We further examine the security vulnerabilities present in LLMs themselves, including prompt injection and jailbreaking attacks. This review identifies open research gaps and proposes future directions, including agentic AI workflows, privacy-preserving security models, and the development of domain-specific foundation models for cybersecurity.

Article Details

Section

Articles

Author Biographies

Trina Banerjee

Trina Banerjee is an Integration Bus (IIB) Developer at Saks Global, where she works on enterprise integration and data-driven systems. Her professional and academic interests include data science, computational analytics, and emerging applications of artificial intelligence. She is particularly interested in interdisciplinary research that combines intelligent systems, data modeling, and advanced computational methods to address complex real-world challenges.

Piyush

Piyush is an undergrad student in University of Delhi, who is currently doing research in the field of MultiModality 

Mukthikka V

Mukthikka is currently a student at Bharath Institute of Higher Education and Research , Chennai, India. Her main research areas are multimodality

How to Cite

Multimodal and Large Language Model Approaches in Cybersecurity: A Systematic Review. (2026). International Journal of Engineering, Science and Environment (IJESE), 1(1). https://ijese.in/journal/article/view/18

References