Crawl4AI

Crawl4AI

Wraps the Crawl4AI open-source library for retrieving text content from websites.

AIApify

🔥🕷️ Crawl4AI: Crawl Smarter, Faster, Freely. For AI.

Crawl4AI is an Apify Actor that wraps the powerful Crawl4AI library, providing you with a feature-packed web crawler and scraper with additional functionalities like link-following and automatic retries for failed requests.

The Actor can:

  • Crawl and scrape websites with precision using CSS, XPath, or LLM-based extraction methods.
  • Generate clean Markdown output, suitable for RAG pipelines or direct ingestion into large language models.
  • Automatically follow links to explore websites further without manual intervention.
  • Retry failed requests to ensure maximum data collection with minimal effort.

Usage

Scraping with Crawl4AI is straightforward. Just follow these steps to get your data quickly:

  1. Input your target URLs.
  2. Set your extraction method (optional - CSS, XPath, or LLM-based).
  3. Configure advanced options like proxies or session settings (optional).
  4. Run the Actor to start crawling, link-following, and retrying failed requests automatically.
  5. Retrieve your data in structured Markdown format for further use in your projects.

How much will it cost?

Apify provides $5 free usage credits every month on the Apify Free plan. With Crawl4AI, you can enjoy a certain number of results per month for free.

For larger data needs, consider upgrading to the $49/month Starter plan for increased monthly results volume. Or opt for the Scale plan for even higher result limits.

Results

Here is an example of the data that the Actor produces:

1[{
2  "url": "https://docs.crawl4ai.com/",
3  "markdown": "https://api.apify.com/v2/key-value-stores/m1Sqnke1KWM0AI8co/records/content_4242424242.md",
4  "html": "https://api.apify.com/v2/key-value-stores/m1Sqnke1KWM0AI8co/records/content_4242424242.html",
5  "metadata": {
6    "title": "Home - Crawl4AI Documentation (v0.5.x)",
7    "description": "🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper"
8  }
9},
10{
11  "url": "https://docs.crawl4ai.com/advanced/ssl-certificate/",
12  "markdown": "https://api.apify.com/v2/key-value-stores/m1Sqnke1KWM0AI8co/records/content_4242424242.md",
13  "html": "https://api.apify.com/v2/key-value-stores/m1Sqnke1KWM0AI8co/records/content_4242424242.html",
14  "metadata": {
15    "title": "SSL Certificate - Crawl4AI Documentation (v0.5.x)",
16    "description": "🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper"
17  }
18},
19// ...
20]

Frequently Asked Questions

Is it legal to scrape job listings or public data?

Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.

Do I need to code to use this scraper?

No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.

What data does it extract?

It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.

Can I scrape multiple pages or filter by location?

Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.

How do I get started?

You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!