Skip to main navigation Skip to search Skip to main content

Short paper: Evaluating the capabilities of AI-based penetration testing tools

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Advances in AI are extending the capabilities of tools for penetration testing. However, due to a fragmented market and rapid technical developments the extent of capabilities and maturity of available tools are not well understood. This short paper provides an overview of this area by reviewing recent academic literature and proposing several assessment criteria. The literature review identifies an active and growing research field, with numerous advancements in the past 18 months. A focus on LLM agents has progressed capabilities towards exploiting zero-day vulnerabilities. Frontier models show superior performance when combined with a focused knowledge base and multi-agent architectures. However, in most cases human involvement is still required, and fully autonomous solutions are not yet evident. To evaluate the maturity of tools, 12 assessment criteria within four broad categories are proposed: AI sophistication, action capabilities, features, and requirements. Maturity levels are proposed for each criterion which enables an objective benchmark of tool capabilities.
Original languageEnglish
Title of host publicationComputer Security. ESORICS 2025 International Workshops
Subtitle of host publicationANUBIS 2025, SSECAI 2025, SecAssure 2025, STMUS 2025, Toulouse, France, September 22–24, 2025, Revised Selected Papers, Part II
EditorsRomain Laborde, Joaquin Garcia-Alfaro, Gregory Blanc, Pierre-François Gimenez, Harsha Kalutarage, Naoto Yanai, Ankur Shukla, Sandeep Pirbhulal, Joachim Posegga, Kwok-Yan Lam
Place of PublicationCham
PublisherSpringer
Pages296-305
Number of pages10
VolumePart II
ISBN (Electronic)9783032160928
ISBN (Print)9783032160911
DOIs
Publication statusPublished - 1 May 2026
EventWorkshop on Security and Artificial Intelligence - Toulouse, France
Duration: 25 Sept 202526 Sept 2025
https://sites.google.com/view/secai2025/home

Publication series

NameLecture Notes in Computer Science (LNCS)
PublisherSpringer
Volume16232
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Workshop

WorkshopWorkshop on Security and Artificial Intelligence
Abbreviated titleSECAI 2025
Country/TerritoryFrance
CityToulouse
Period25/09/2526/09/25
Internet address

Keywords

  • AI
  • Penetration testing
  • Assessment criteria

Fingerprint

Dive into the research topics of 'Short paper: Evaluating the capabilities of AI-based penetration testing tools'. Together they form a unique fingerprint.

Cite this