Web browser automation tools
Open source Tools
Automa
- Automa is a browser extension for browser automation. From auto-fill forms, doing a repetitive task, taking a screenshot, to scraping data of the website
- Workflow is executed using the local browser
Scrapy
Scrapy is an open source and collaborative Python framework for web scraping, extracting the data you need from websites.
It's the de-facto standard in the industry for web scraping in Python. Best choice for website that doesn't have any particular website anti-bot tool.
Playwright
Playwright is a testing tool for web application, useful also for web scraping, released on 2021.
Best choice when there's need of a fully rendered browser to scrape a website.
This configuration is more computing power intensive than a simply Scrapy installation so is used only when a fully rendered browser is needed
Puppeteer
Puppeteer is a Node.js library which provides a high-level API to control Chrome or Chromium over the DevTools Protocol.
Puppeteer is a browser automation tool useful for web scraping. Its features are very similar to Playwright.
Best choice when we need a full browser rendering the page and Playwright and Selenium can't help. Or we want to control via Javascript instead of Python.
Selenium
Selenium Webdriver is a web application testing suite used also for web scraping.
Playwright is preferred to Selenium Webdriver, as it has a more similar behaviour to a real user, easier to install and configure.
PageProbe
- extension for Firefox only
- Unlimited number of trackers. Fully local solution - does not require an account
- Create automatic monitors for detecting and scanning changes on web pages
Headless Recorder
- extension for Chromium only
- open-source extension to record browser interaction and generating Puppeteer & Playwright scripts
AutoHotkey
- AutoHotkey is a free, open-source scripting language for Windows that allows users to easily create small to complex scripts for all kinds of tasks such as: form fillers, auto-clicking, macros, etc.
- AutoHotkey gives you the freedom to automate any desktop task.
Commercial tools
Microsoft Power Automate
- Pricing starts at USD 15/month. If you are a Windows 10/11 user, you have access to personal desktop flows in Power Automate for desktop for free. 1
- This tool does not only automate web browser but also other process automation.
Rocketride
- Pricing starts at USD 9/month. Free for 500 local runs, 30 cloud runs
- The task runner uses Playwright
- Typical things to automate: Web scraping, Fill out forms, Download files, Periodically check for changes, Automate actions
- Hacker News discussion
Browserflow
- Pricing starts at USD 49/month. Free for desktop runs, but 1 minute per run
- Automatically perform actions on websites as if you were doing them
visualping
- Pricing starts at USD 14/month. Free for limited use
- Website change detection and alerts
Distill.io
- Pricing starts at USD 15/month. Free for limited use
- automated tracking website updates
ScrapFly
- Pricing starts at USD 15/month. Free for 1000 API calls / month
ScrapingBee
- Pricing starts at USD 49/month. Free for 1000 API calls
ScraperAPI
- Pricing starts at USD 29/month. Free for 1000 web pages per month.
Morph.io
- Free to use. But can subscribe as a supporter, starting at USD 14/month.
Scraping Fish
- Pricing starts at USD 0.002 per each API calls.
Crawly - Web Crawler by Diffbot
- Turn websites into data. Crawly spiders and extracts complete structured data from an entire website.
Browse AI
- Train a robot to scrape any website in 2 mins with no-code
- Pricing starts at USD 49/month. Limited free plan available
Scrape Owl
- Simple and powerful web scraping API that manages proxies, headless browsers, and HTML parsing. Simply specify the website and the element you want
- Pricing starts at USD 29/month
Web Automation
- the largest marketplace to find ready-made no code web scrapers. With only a few clicks and a few seconds you can start extracting data from your favourite site without coding or building from scratch
- Pricing starts at USD 49/month
EasyScrape
- With EasyScrape you can scrape the articles from any webpage with just one click. You can upload a text file containing a list of URLs or a single URL or a keyword, articles will be scraped from those URLs with just a click.
Scrape All
- Automated web data scraper. Extract any kind of web data without codding skills requirements
- Pricing starts at USD 10 / 3K credits
ProxyScrape
- Scrape websites without limits, with up to 60 000 datacenter proxies
APIFY
- web scraping and automation platform
- Pricing starts at USD 49/month. Limited free plan available
Zyte (formerly Scrapinghub)
- Pricing is not clear at the moment (2022-05-09)
Price2Spy
- Scraping tools for e-commerce (pricing bot)
- Pricing starts at USD 24/month
Bright Data
- complicated pricing page
- Demo video
Simplescraper
- need to install extension on Chrome
- no code style, point and click to select element
- pricing starts at USD 35/month. Free plan available
Automatio
- need to install extension on Chrome
- the scraper bot runs in the cloud not your local browser
- pricing page is buried inside its web app
Related resources
Tutorials
- ScrapFly | Web Scraping With Python Tutorial
- ScrapFly | How to Scrape Without Getting Blocked Tutorial
- ScrapFly | Parsing HTML with Xpath
- ScrapFly | Parsing HTML with CSS Selectors
- ScrapingBee | Web Scraping with Python: Everything you need to know (2022)
- ScrapingBee | Web Scraping using Selenium and Python
- ScrapingBee | Using Parsel to Extract Text from HTML in Python
- While libraries like Beautiful Soup, Scrapy, and Selenium might be overkill, Parsel is a great option for simple web scraping. Parsel’s simple methods and Selectors provide all of the functionality required for basic scraping scripts, whereas other libraries contain a slew of features that you’ll never use in a simple scraping script—such as browser automation, telnet console, logging, and emails, which aren’t required when all you want to do is extract content from a static website.
- ScrapingBee | Easy web scraping with Scrapy
- Python BeautifulSoup Tutorial: Web Scraping In 20 Lines Of Code
- Batch Downloading With Python
- enable clicking on the "show more" button on medium blog site via selenium
- Dataquest [YouTube] | Web Scraping Beginner Tutorial: BeautifulSoup, Playwright, And Python
- realpython | Python Web Scraping Learning Path
Children
Footnotes
Backlinks