Crawlee is a powerful web scraping and browser automation library for Node.js, enabling developers to create reliable crawlers in JavaScript and TypeScript. It supports data extraction for AI applications, including LLMs and GPTs, and offers features like proxy rotation and both headful and headless modes.
claude install apify/crawleehttps://crawlee.dev/docs
["1. Install Crawlee in your Node.js project using npm: `npm install crawlee`.","2. Choose the appropriate crawler type (e.g., CheerioCrawler for static pages or PuppeteerCrawler for dynamic content).","3. Define your scraping targets, including URLs and data extraction rules. Use selectors to target specific HTML elements.","4. Implement proxy rotation by configuring the `proxyConfiguration` option with a list of proxy URLs.","5. Run your crawler in headless mode by setting `headless: true` in the crawler configuration. Monitor performance and adjust settings as needed."]
Extracting product data from e-commerce sites for price comparison and market analysis.
Gathering competitive intelligence from industry websites to inform business strategies.
Automating data collection for research purposes, such as gathering statistics or survey results.
Downloading media files from various sources for analysis and content creation.
claude install apify/crawleegit clone https://github.com/apify/crawleeCopy the install command above and run it in your terminal.
Launch Claude Code, Cursor, or your preferred AI coding agent.
Use the prompt template or examples below to test the skill.
Adapt the skill to your specific use case and workflow.
Using Crawlee, scrape the latest product reviews from [ECOMMERCE_SITE] for [PRODUCT_CATEGORY]. Extract key metrics like average rating, common praise points, and frequent complaints. Save the data in a structured JSON format for analysis. Use proxy rotation to avoid IP blocking and run in headless mode for efficiency.
Successfully scraped 1,243 product reviews from Amazon for the 'Bluetooth Headphones' category. The average rating is 4.2 stars, with common praise points including 'comfortable fit' (mentioned 312 times) and 'excellent sound quality' (287 mentions). Frequent complaints include 'battery life could be better' (143 mentions) and 'connectivity issues' (98 mentions). Data saved in structured JSON format with fields for review text, rating, and date. Proxy rotation successfully avoided IP blocking, and headless mode ensured efficient scraping.
Spreadsheet with built-in API integrations and automation
Your one-stop shop for church and ministry supplies.
Unlock data insights with interactive dashboards and collaborative analytics capabilities.
Automate your browser workflows effortlessly
Real-time collaborative writing platform
Enhance performance monitoring and root cause analysis with real-time distributed tracing.
Take a free 3-minute scan and get personalized AI skill recommendations.
Take free scan