❝

“When digital transformation is done right, it’s like a caterpillar turning into a butterfly; when done wrong, all you have is a really fast caterpillar.” - George Westerman

Explore
waib3.Tech

Yes! I'm Free

Web Crawling at Large Scale

Web crawling at scale is essential for aggregating data, monitoring websites, and enabling downstream applications like machine learning models and analytics. This project involved developing a scalable web crawling and re-crawling pipeline using Scrapy, storing the data in MongoDB, and deploying a machine learning model accessible through a Flask API. The entire pipeline was Dockerized and deployed on AWS EC2 for robust, high-availability operations.

Explore
waib3.Tech

Web Crawling at Large Scale

Drop us a line

Reach Us

Explorewaib3.Tech

Web Crawling at Large Scale

Drop us a line

Reach Us

Explore
waib3.Tech