Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider Including RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents #5

Open
laysakura opened this issue Feb 25, 2025 · 1 comment

Comments

@laysakura
Copy link

Hi,

We would like to suggest adding our recent paper to the Awesome-LLM4Cybersecurity repository.

Summary: RapidPen is a fully autonomous penetration testing framework that uses an LLM-based agent to achieve initial system access (IP-to-Shell) without any human intervention. It employs advanced ReAct-style task planning with a retrieval-augmented exploit knowledge base and an execution-feedback loop to autonomously scan services, identify attack vectors, and execute exploits. In a HackTheBox evaluation, RapidPen obtained shell access within minutes (≈200–400 seconds) and achieved ~60% success when leveraging prior success-case data, illustrating the potential of truly automated pentesting for both novices and experts.

Suggested Category: LLM Assisted Attack (applications of LLMs in cybersecurity)

Additional Reference: [Demo] RapidPen Automatically Gains a Shell (HTB Blue Machine)

Thank you for considering our work!

@tmylla
Copy link
Owner

tmylla commented Feb 26, 2025

Thank you for your suggestion!

We appreciate your contribution and find that this work is highly relevant to the LLM Assisted Attack theme.

We will include your paper in our next repository update, which is scheduled to be released within the next few days, likely by the end of this month or early next month.

Once again, thank you for bringing this to our attention, and we look forward to featuring your work in the Awesome-LLM4Cybersecurity repository!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants