Blog
Technical write-ups on backend engineering, data systems, and performance work.
- Achieving 100x Compression on Scraped Pricing Data in ScyllaDB
How we designed a storage layer for competitor hotel prices that compresses ~30 TB of raw scraped data into ~300 GB in ScyllaDB.
- From Custom Python per Customer to a Parameterized Pricing Engine
How we moved from per-customer pipelines (startup-speed trade-offs) to one parameterized engine, migrated gradually with no downtime, and shipped previews before saving changes.
- Building a Convergent Event Deduplication Pipeline with Graph Algorithms
How I built a system to deduplicate scraped events (concerts, shows, etc.) across multiple sources using connected components and geospatial clustering.