HTTrack is a free, open-source tool used for offline browsing and website mirroring. It allows users to download entire websites (or specific parts) to local storage for offline viewing, analysis, or archival purposes. HTTrack is available for Windows and Linux (including Kali Linux).
HTTrack in Windows
Installation
-
Download the installer from the official website:
🔗 https://www.httrack.com -
Run the installer and follow the setup wizard.
-
Launch WinHTTrack (GUI version) or use the command line (
httrack).
Usage (GUI – WinHTTrack)
-
Open WinHTTrack.
-
Set a project name and destination path.
-
Enter the URL(s) of the website(s) to mirror.
-
Configure options:
-
Mirroring depth (how many levels deep to download).
-
Filters (include/exclude specific files).
-
Robots.txt compliance (respect or ignore).
-
-
Click “Next” and start the mirroring process.
Command-Line Usage (Windows CMD)
-
-O: Output directory -
-v: Verbose mode
HTTrack in Kali Linux
Installation
HTTrack is pre-installed in Kali Linux. If missing, install via:
Usage (CLI)
Basic command:
Common Options
| Option | Description |
|---|---|
-O <dir> |
Output directory |
-%v |
Verbose mode |
-rN |
Max recursion depth (e.g., -r5 for 5 levels) |
-F <UA> |
Custom user-agent (e.g., -F "Mozilla/5.0") |
--robots=0 |
Ignore robots.txt restrictions |
-i |
Continue interrupted mirroring |
Example (Advanced)
GUI Version in Kali (Optional)
Install the GUI version (webhttrack):
Then run:
Key Features of HTTrack
✅ Mirror entire websites (HTML, images, CSS, JS).
✅ Resume interrupted downloads.
✅ Custom filters (include/exclude file types).
✅ Proxy support (for anonymity).
✅ Cross-platform (Windows, Linux, macOS).
Legal & Ethical Considerations
⚠ Use responsibly!
-
Mirroring publicly available websites for personal use is generally acceptable.
-
Avoid unauthorized scraping of private/copyrighted content.
-
Check the website’s
robots.txtbefore mirroring.