🔥 Search, scrape, and clean the web for AI agents.
-
Updated
May 21, 2026 - TypeScript
🔥 Search, scrape, and clean the web for AI agents.
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
🔥 The open-source no-code platform for web scraping, crawling, search and AI data extraction • Turn websites into structured APIs in minutes 🔥
Clone any website with one command using AI coding agents
Self-hosted webscraper.
Undetected version of the Playwright testing and automation library.
The undetected self-hosted browser automation platform. Powered by Camoufox (Firefox) for 0% detection rates. Built for speed, privacy, and scalability.
🔥 AI-powered web monitoring platform. Create automated scouts that search the web and send email alerts when they find what you're looking for.
A JavaScript library for generating random user agents with data that's updated daily.
Undetected NodeJS version of the Playwright testing and automation library.
A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches. Local alternative to SERP APIs with MCP server integration.
100% free and full open-source edge Firecrawl alternative with better links extraction for agents - that you can deploy to cloudflare or vercel by yourself.
Open source web infrastructure for AI. Scrape, crawl, and automate the web, clean markdown, browser sessions, ready for your agents.
Build complex browser workflows visually and execute them via API.
Model Context Protocol (MCP) Server for Graphlit Platform
Full-content web fetcher for AI agents — Chrome TLS fingerprinting, browser impersonation, and multi-strategy article extraction
High-performance web crawler API optimized for LLMs. Turn any search or website into clean Markdown using remote browsers. Firecrawl alternative
Anti-detection browser server for AI agents — REST API wrapping Camoufox engine with OpenClaw plugin support
n8n node to interact with browserless instance
⚡ Ayakashi.io - The next generation web scraping framework
Add a description, image, and links to the web-scraping topic page so that developers can more easily learn about it.
To associate your repository with the web-scraping topic, visit your repo's landing page and select "manage topics."