Video Link Crawler

Video Link Crawler

Effortlessly discover and extract video links from any website with our powerful Video Link Crawler within few seconds. Starting from a specified URL, it navigates through web pages, identifies video content, and compiles structured datasets.! Try it Now!

AUTOMATIONVIDEOSDEVELOPER_TOOLSApify

🎥 Video Link Crawler

The Video Link Crawler is a powerful web crawler designed to extract 🎞️ video links from websites, starting from a specified URL, it navigates through web pages, discovers video content, and compiles the extracted links into a structured dataset. It supports customizable crawling settings and efficient video extraction. Whether you're building a video catalog or analyzing video trends, this tool is your go-to solution for efficient video discovery.


📝 Description

The Video Link Crawler specializes in extracting video URLs from various HTML elements, including:

  • <video> and <source> tags
  • <embed> and <object> tags
  • <iframe> tags (e.g., YouTube, Vimeo)
  • <a> tags linking to video files (e.g., .mp4, .webm)

With advanced features like proxy support, crawl depth control, and retry mechanisms, it adapts seamlessly to your web scraping needs. Extracted data is stored in an Apify Dataset, ready for analysis or integration into your workflows.


✨ Key Features

  • Customizable Crawling
    ⬩ Define starting URLs
    ⬩ Restrict crawling using glob patterns
    ⬩ Set maximum crawl depth and request limits

  • Video URL Extraction
    ⬩ Extract video links from multiple HTML tags
    ⬩ Convert relative URLs to absolute links for seamless results

  • Proxy Support
    ⬩ Ensure anonymity with Apify Proxy
    ⬩ Optionally specify proxy groups

  • Error Handling
    ⬩ Detailed logs for failed requests
    ⬩ Handles redirect loops and HTTP 403 errors gracefully

  • Data Output
    ⬩ Structured and easy-to-use dataset with video links


⚙️ How It Works

  • Initialization
    Reads input parameters such as startUrl, glob patterns, and crawl depth to configure the crawler.

  • Crawling
    Begins at the startUrl, follows links matching the glob patterns, and stops upon reaching the crawl limits.

  • Video Extraction
    Detects video links from supported tags and resolves relative URLs to absolute ones.

  • Data Storage
    Extracted URLs and video links are saved to an Apify Dataset.

  • Error Logging
    Logs all failed or skipped requests for review and troubleshooting.


🔧 Input Configuration

The Video Link Crawler accepts the following input parameters in JSON format:

FieldTypeRequiredDescription
startUrlstringYesThe URL where the crawler begins.
globsstring[]YesArray of glob patterns to restrict crawling to specific domains or paths.
maxRequestsPerCrawlnumberOptionalMaximum number of requests the crawler processes.
maxCrawlDepthnumberOptionalMaximum depth of pages to crawl from the starting URL.
proxyConfigobjectOptionalProxy configuration for anonymized crawling (e.g., Apify Proxy).

🛡️ Proxy Configuration

  • useApifyProxy (boolean): Enable Apify Proxy for anonymized crawling.
  • apifyProxyGroups (string[]): Specify proxy groups for more granular control (optional).

📝 Example Input

1{
2    "startUrl": "https://example.com",
3    "globs": ["https://example.com/**"],
4    "maxRequestsPerCrawl": 50,
5    "maxCrawlDepth": 3,
6    "proxyConfig": {
7        "useApifyProxy": true,
8        "apifyProxyGroups": ["SHARED"]
9    }
10}

📄 Example Output

The output of the Actor is saved to an Apify Dataset in the following structured format:

1[
2    {
3        "url": "https://example.com/page1",
4        "videoLinks": [
5            "https://example.com/videos/video1.mp4",
6            "https://youtube.com/watch?v=abc123"
7        ]
8    },
9    {
10        "url": "https://example.com/page2",
11        "videoLinks": []
12    }
13]

ℹ️ Need Help?

If you have any questions, suggestions, or need assistance, feel free to:

💬 Open an issue in the Issues section.

📧 Contact us: infoweaver7@gmail.com

Frequently Asked Questions

Is it legal to scrape job listings or public data?

Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.

Do I need to code to use this scraper?

No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.

What data does it extract?

It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.

Can I scrape multiple pages or filter by location?

Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.

How do I get started?

You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!