Overview ↗
noOriginal Documentation
Documentation Index#
Fetch the complete documentation index at: https://docs.crewai.com/llms.txt Use this file to discover all available pages before exploring further.
Extract data from websites and automate browser interactions with powerful scraping tools
These tools enable your agents to interact with the web, extract data from websites, and automate browser-based tasks. From simple web scraping to complex browser automation, these tools cover all your web interaction needs.
Available Tools#
General-purpose web scraping tool for extracting content from any website.
Target specific elements on web pages with precision scraping capabilities.
Crawl entire websites systematically with Firecrawl’s powerful engine.
High-performance web scraping with Firecrawl’s advanced capabilities.
Search and extract specific content using Firecrawl’s search features.
Browser automation and scraping with Selenium WebDriver capabilities.
Professional web scraping with ScrapFly’s premium scraping service.
Graph-based web scraping for complex data relationships.
Comprehensive web crawling and data extraction capabilities.
Cloud-based browser automation with BrowserBase infrastructure.
Fast browser interactions with HyperBrowser’s optimized engine.
Intelligent browser automation with natural language commands.
Access web data at scale with Oxylabs.
SERP search, Web Unlocker, and Dataset API integrations.
Common Use Cases#
- Data Extraction: Scrape product information, prices, and reviews
- Content Monitoring: Track changes on websites and news sources
- Lead Generation: Extract contact information and business data
- Market Research: Gather competitive intelligence and market data
- Testing & QA: Automate browser testing and validation workflows
- Social Media: Extract posts, comments, and social media analytics
Quick Start Example#
from crewai_tools import ScrapeWebsiteTool, FirecrawlScrapeWebsiteTool, SeleniumScrapingTool
# Create scraping tools
simple_scraper = ScrapeWebsiteTool()
advanced_scraper = FirecrawlScrapeWebsiteTool()
browser_automation = SeleniumScrapingTool()
# Add to your agent
agent = Agent(
role="Web Research Specialist",
tools=[simple_scraper, advanced_scraper, browser_automation],
goal="Extract and analyze web data efficiently"
)Scraping Best Practices#
- Respect robots.txt: Always check and follow website scraping policies
- Rate Limiting: Implement delays between requests to avoid overwhelming servers
- User Agents: Use appropriate user agent strings to identify your bot
- Legal Compliance: Ensure your scraping activities comply with terms of service
- Error Handling: Implement robust error handling for network issues and blocked requests
- Data Quality: Validate and clean extracted data before processing
Tool Selection Guide#
- Simple Tasks: Use
ScrapeWebsiteToolfor basic content extraction - JavaScript-Heavy Sites: Use
SeleniumScrapingToolfor dynamic content - Scale & Performance: Use
FirecrawlScrapeWebsiteToolfor high-volume scraping - Cloud Infrastructure: Use
BrowserBaseLoadToolfor scalable browser automation - Complex Workflows: Use
StagehandToolfor intelligent browser interactions