IMDb Movies Scraper

IMDb Movies Scraper

IMDb Movies Scraper is an Apify Actor that extracts movie details from IMDb's Top 250. It gathers titles, ratings, release years, durations, and certifications into structured JSON. Leveraging Puppeteer and proxy support, it enables efficient, reliable headless data extraction.

DEVELOPER_TOOLSAUTOMATIONNEWSApify

🎬 IMDb Movies Scraper

IMDb Movies Scraper is an Apify Actor that scrapes movie details from IMDb's Top Chart. It extracts key information such as the movie title, rating, release year, duration, and certification. This Actor processes the IMDb Top 250 chart and visits each movie's detail page to gather structured data.

🚀 Features

  • Scrapes movie details: title, rating, release year, duration, and certification.
  • Extracts movie links: Starts from the IMDb Top Chart page and enqueues individual movie pages.
  • Handles multiple layouts: Uses fallback selectors to account for changes in IMDb's page structure.
  • Uses Puppeteer and Proxies: Supports proxy configurations and headless browsing for efficient scraping.
  • Stores data in Apify Dataset: Outputs structured JSON data for further processing and export.

📥 Input Parameters

The Actor accepts the following input parameters:

ParameterTypeDescriptionDefault Value
chartUrlstringThe URL of the IMDb chart page to scrape (e.g., "https://www.imdb.com/chart/top/")."https://www.imdb.com/chart/top/"
max_moviesintegerThe maximum number of movies to process from the chart page.250

Example Input

1{
2  "chartUrl": "https://www.imdb.com/chart/top/",
3  "max_movies": 250
4}

📤 Output

The Actor outputs a JSON dataset containing movie details for each processed movie. A sample output object is shown below:

1{
2  "title": "The Shawshank Redemption",
3  "rating": "9.3",
4  "year": "1994",
5  "duration": "2h 22m",
6  "certification": "R",
7  "url": "https://www.imdb.com/title/tt0111161/"
8}

This project is intended for educational and research purposes only. The use of this Actor must comply with IMDb's Terms of Service and robots.txt policies.

  • Usage Compliance: You are responsible for ensuring your use case does not violate IMDb's website terms.
  • Respectful Scraping: Avoid aggressive scraping that could negatively impact IMDb's infrastructure.
  • Data Ownership: IMDb's content and trademarks are the property of IMDb, Inc.
  • Commercial Use: For commercial purposes, consider contacting IMDb for official data access.

Frequently Asked Questions

Is it legal to scrape job listings or public data?

Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.

Do I need to code to use this scraper?

No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.

What data does it extract?

It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.

Can I scrape multiple pages or filter by location?

Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.

How do I get started?

You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!