pio
all i've ever wanted
- Joined
- Oct 26, 2025
- Posts
- 706
- Reputation
- 703
Hello guys i made this for @Hernan in case there's an apocalypse or something, but you still want to ldar on org.
https://docs.google.com/document/d/1_hBTGnvq1rM7hUMDACPbtKWEAeZeanTmF2f-cFQW_kg/edit?usp=sharing. ( FULL GUIDE HERE WITH .py to paste)
https://www.python.org/downloads/ + make sure to check "Add Python to PATH" to download pip.
Run in command center/terminal: “pip install requests beautifulsoup4 cloudscraper”
Save this script as “looksmax_archiver.py”
https://gofile.io/d/mX9bpJ
In case the gofile link expires ill paste the full script in another tab. (Just paste it into a .txt and rename it to looksmax_archiver.py)
Now run the script: “python looksmax_archiver.py”
It will run automatically and:
Discover all forums from the homepage, it will go through every page, through every thread listing. Scrape every post on every thread page. Save everything to looksmax_archive.db (SQLite)
import sqlite3
conn = sqlite3.connect("looksmax_archive.db")
# Search all posts for a keyword
results = conn.execute(
"SELECT author, content FROM posts WHERE content LIKE '%looksmaxxing%'"
).fetchall()
# Get all threads in a forum
threads = conn.execute(
"SELECT title, reply_count FROM threads WHERE forum_id='123'"
).fetchall()
lmk if this works it should
https://docs.google.com/document/d/1_hBTGnvq1rM7hUMDACPbtKWEAeZeanTmF2f-cFQW_kg/edit?usp=sharing. ( FULL GUIDE HERE WITH .py to paste)
Step 1.
Install your Python dependencies.https://www.python.org/downloads/ + make sure to check "Add Python to PATH" to download pip.
Run in command center/terminal: “pip install requests beautifulsoup4 cloudscraper”
Step 2.
Save this script as “looksmax_archiver.py”
https://gofile.io/d/mX9bpJ
In case the gofile link expires ill paste the full script in another tab. (Just paste it into a .txt and rename it to looksmax_archiver.py)
Now run the script: “python looksmax_archiver.py”
Step 3.
It will run automatically and:
Discover all forums from the homepage, it will go through every page, through every thread listing. Scrape every post on every thread page. Save everything to looksmax_archive.db (SQLite)
- Resume support — if it crashes or you stop it, just re-run and it picks up exactly where it left off. Nothing is re-scraped twice.
- Cloudflare bypass — uses cloudscraper which handles the JS challenge automatically
- Polite delays — 2–4 second random waits between requests so you don't get IP banned
- Retry logic — exponential backoff on failed requests
- Logging — writes to both terminal and archiver.log
Run this to browse:
import sqlite3
conn = sqlite3.connect("looksmax_archive.db")
# Search all posts for a keyword
results = conn.execute(
"SELECT author, content FROM posts WHERE content LIKE '%looksmaxxing%'"
).fetchall()
# Get all threads in a forum
threads = conn.execute(
"SELECT title, reply_count FROM threads WHERE forum_id='123'"
).fetchall()
lmk if this works it should