LogoHackDB
icon of subwiz

subwiz

A lightweight GPT model trained to discover subdomains, aiding in reconnaissance and bug bounty hunting.

Introduction

Subwiz is a lightweight transformer model designed for subdomain discovery. It leverages a generative pre-trained transformer (GPT) architecture, specifically based on nanoGPT, to predict subdomains from a given set of known subdomains. Key features include:

  • Lightweight Model: With only 17.3M parameters, Subwiz offers efficient performance without requiring extensive computational resources.
  • Trained on Subdomain Data: The model is trained on 26M tokens of subdomain lists from passive sources, enhancing its ability to predict valid subdomains.
  • Custom Tokenizer: Utilizes a tokenizer trained on the same subdomain lists (8192 tokens) for optimized performance.
  • Beam Search Inference: Employs a beam search algorithm to predict the N most likely subdomain sequences, improving the accuracy of subdomain discovery.

Use cases:

  • Reconnaissance: Discovering subdomains for a target organization to expand the attack surface.
  • Bug Bounty Hunting: Identifying potential vulnerabilities in previously unknown subdomains.
  • Security Assessments: Augmenting existing subdomain enumeration techniques with AI-driven predictions.

Information

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates