TAG

Posts tagged "Engineering"

4 posts

Le Cheval de Troie
Engineering8 min read

Headless browser fingerprinting in 2026: how detection works and what to do

A technical breakdown of the signals anti-bot services use to detect headless browsers, and the patches that close the gap.

The Eclipse in Venice
Engineering8 min read

Scraping JavaScript-heavy SPAs: Next.js, Nuxt, and React in 2026

Why plain HTTP fetching returns empty pages on modern frontends, what render targets work, and how to recover server-shipped data without a headless browser.

Seascape Study with Rain Cloud
Engineering7 min read

How to scrape Cloudflare-protected sites without getting blocked

A practical, layered approach to defeating Cloudflare's bot challenges in 2026. TLS fingerprints, hardened headless, cookie persistence, and when to escalate.

An Experiment on a Bird in the Air Pump
Engineering7 min read

LLM extraction vs CSS selectors: why selector-based scraping is dead at scale

Selectors break when sites redesign. LLMs extract by semantic meaning. Here's why the tradeoff has flipped, with cost numbers from real workloads.