Search on blog:

Interesting links [2021.01.09]

  • toscrape.com

    Web Scraping Sandbox created by Scrapinghub

    A fictional bookstore that desperately wants to be scraped. It's a safe place for beginners learning web scraping and for developers validating their scraping technologies as well.

    Available at: books.toscrape.com

    A website that lists quotes from famous people. It has many endpoints showing the quotes in many different ways, each of them including new scraping challenges for you, - Default - Microdata and pagination - Scroll - infinite scrolling pagination - JavaScript - JavaScript generated content - Delayed - Same as JavaScript but with a delay (?delay=10000) - Tableful - a table based messed-up layout - Login - login with CSRF token (any user/passwd works) - ViewState - an AJAX based filter form with ViewStates - Random - a single random quote

    Available at: quotes.toscrape.com

  • Scrapy

    An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way. Maintained by Scrapinghub and many other contributors

    Documentation: Scrapy

  • Scrapy Video Tutorials

    Free Scrapy Tutorials To Learn Web Scraping created by Scrapinghub

If you like it
Buy a Coffee