THUGON Website Scanner
Overview:
The THUGON Website Scanner is a desktop-based structural website analysis tool designed to crawl and map publicly accessible website architecture. It builds a structured inventory of reachable pages, linked resources, and exposed files starting from a defined entry URL.
This tool utilizes
Selenium browser automation to render modern websites exactly as a real user browser would, including JavaScript-driven and dynamically generated content. Unlike traditional crawlers, it analyzes fully rendered DOM output rather than raw source code.
The scanner operates as a
single-threaded crawler to reduce aggressive request patterns and minimize the likelihood of automated detection or blocking by web servers.
Download:
Download ZIP SHA256: D2D1AE03522423F4E011251DD0083B79D53A8A3E998B891AD088FBC588B8E192
How-To Video:
Watch on YouTube
What This Tool Does
This application systematically scans a website from a specified starting point and recursively explores internal links within the same domain. It evaluates HTTP response status codes (200, 404, 403, 500, etc.) and captures dynamically rendered content that may not be visible to basic crawlers.
During scanning, the tool collects and organizes:
- Accessible internal URLs with HTTP status codes
- External domain references
- Discovered file resources across all detected file extensions
- Malformed or broken links encountered during crawling
- Technical crawl errors or execution issues
Why This Tool Is Useful
Modern websites often contain hundreds or thousands of interconnected assets. Over time, content accumulates, structures change, and visibility into what is actually exposed to the public becomes limited.
The THUGON Website Scanner provides transparency into the real, publicly accessible structure of a website. By rendering pages through a real browser engine and capturing HTTP responses, it produces an accurate structural inventory that reflects how the website behaves in the real world.
This allows organizations to better understand:
- What pages and files are publicly reachable
- Which links return errors or invalid responses
- Where external dependencies exist
- How site architecture is interconnected
- Whether unexpected file types are exposed
Who Uses This Tool
Regulators
Regulatory bodies can review website structure to confirm required disclosures, public-facing materials, and compliance documentation are accessible and properly linked.
Security Professionals
Security teams can gain visibility into publicly reachable endpoints, file resources, and external dependencies that contribute to overall exposure and attack surface awareness.
Compliance Teams
Compliance professionals can document website structure and demonstrate due diligence through structured reporting of accessible resources.
Web Designers & Developers
Developers and designers can analyze legacy or complex site architectures to assist with restructuring, migration planning, and content validation.
Key Features
- Real browser-based rendering (Selenium-powered)
- Single-threaded controlled crawling
- Internal link discovery
- External domain tracking
- Dynamic detection of all file extensions
- HTTP status code analysis (200 / 403 / 404 / 500)
- Malformed link detection
- Structured Excel report generation
Output
After completion, the tool generates a structured Excel report containing:
- Visited URLs and corresponding HTTP status codes
- Detected crawl issues and execution errors
- External links grouped by domain
- Discovered file resources across all extensions
Disclaimer:
This software is provided "as is" and "where is" without warranties of any kind, either express or implied, including but not limited to accuracy, completeness, reliability, or fitness for a particular purpose. The user assumes all risks associated with its use.
By downloading or using this software, the user agrees to release and hold harmless the developer from any and all liability, damages, losses, or claims arising from its use or misuse. Users are solely responsible for ensuring they have proper authorization before scanning any website or domain.
This tool performs structural analysis only. It does not conduct vulnerability scanning, exploitation, penetration testing, or security intrusion activities.
This disclaimer may be modified at any time without prior notice.