Extracts various media files, such as images, videos, audio, and other related media elements, from multiple websites. It then provides the corresponding descriptions or the alt="" content. You may need to use proxies to run this actor for some websites with bot blocking features.
The Web Media Scraping Tool is a powerful utility designed to extract various media files, including images, videos, audio, and other related elements, from multiple websites. It provides a convenient way to gather media content and retrieve associated descriptions or alt="" content. The tool supports output formats such as JSON or CSV and offers proxy support for websites with bot blocking features.
This documentation provides a comprehensive guide to the Web Media Scraper, a powerful tool designed to automate the process of collecting media files (images, videos, audio) and their corresponding descriptions from various websites.
Automated Extraction: Saves time by automatically collecting media from multiple websites.
Rich Media Support: Captures images, videos, audio, and other relevant media elements.
Description Retrieval: Extracts associated descriptions like "alt" text for accessibility.
Flexible Output Formats: Choose your preferred format for seamless integration - JSON or CSV.
Optional Proxy Support: Overcome website bot-blocking measures if necessary.
Increased Efficiency: Effortlessly gather media and descriptions from numerous websites in one go.
Time-Saving: Automate the data gathering process, freeing your time for analysis and creative work.
Streamlined Workflows: Easily integrate extracted media and descriptions into your projects.
Content Curation and Aggregation:
Market Research and Competitor Analysis:
Brand Monitoring and Reputation Management:
Identifying Influencer Partnerships:
Product and Image Recognition:
Research and Data Analysis:
Content Personalization:
Digital Rights Management:
E-commerce Optimization:
Event Tracking and Reporting:
Capture images related to specific events, campaigns, or product launches for comprehensive tracking and post-event analysis.
The actor requires only the website URLs from which to retrieve images and information on which proxies to utilize. You can specify multiple websites to obtain multiple results in a single run.
1{ 2 "startUrls": [ 3 { 4 "url": "https://apify.com", 5 }, 6 ] 7}
The actor saves its outcomes in the default dataset linked with the actor's operation. Subsequently, it offers the flexibility to export the data into different formats, including JSON, XML, CSV, or Excel.
Each website within the dataset is represented as a distinct object following this structure (illustrated in JSON format below):
1[{ 2 "URL": "https://crawlee.dev/docs/guides/configuration", 3 "total_media": 5, 4 "media_elements": [], 5 "images": [ 6 { 7 "id": "s6OotqTrMLa", 8 "url": "https://crawlee.dev/docs/guides/configuration", 9 "src": "/img/crawlee-light.svg", 10 "alt": "", 11 "type": "image" 12 }, 13 14 { 15 "id": "ZESUvm5A47e", 16 "src": "/img/crawlee-dark.svg", 17 "alt": "", 18 "url": "https://crawlee.dev/docs/guides/configuration", 19 "type": "image" 20 } 21 ], 22 23 "svg": [ 24 { 25 "id": "JwdyTS8P6Kt", 26 "url": "https://crawlee.dev/docs/guides/configuration", 27 "type": "svg" 28 }, 29 30 { 31 "id": "0r4WQSDIyNV", 32 "url": "https://crawlee.dev/docs/guides/configuration", 33 "type": "svg", 34 } 35 ], 36 "videos": [], 37 "audios": [], 38 "embed": [], 39 "object": [], 40 "canvas": [], 41},
Important: If you require customization or wish to request additional features, please feel free to contact us via email . We aim to respond to all inquiries within one business day, ensuring prompt assistance and addressing your needs effectively.
Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.
No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.
It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.
Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.
You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!