This actor scrapes WordPress blog posts of one or more websites, cleans the HTML content, and pushes flattened JSON data (collects all data it can find in the post). It uses Selenium to handle pages requiring JavaScript rendering.
The WordPress Scraper Actor allows you to easily scrape content from (multiple) WordPress websites, including blogs, articles, author details, categories, comments and media. It uses the WordPress REST API, Requests library and if necessary Selenium for accurate data extraction. Only works on WP sites that accept REST API calls
The actor takes a single or multiple website URLs as input, interacts with the REST API to gather data, and uses Selenium to handle JavaScript-rendered pages. The scraped data is cleaned and formatted as structured JSON.
start_urls
(required): List of website URLs to scrape (company1.com,company2.com,etc).max_results
(optional): Maximum number of posts to retrieve per site. Set to 'all'
for all posts.scrape_mode
(required, default is 'posts'): Choose the data you wish to scrape, you can choose from 'posts', 'media', 'categories','comments'The actor outputs (cleaned) JSON data for each post, including:
start_urls
and optionally max_results
.robots.txt
policies.1{ 2 "start_urls": [ 3 { "url": "https://example.com" }, 4 { "url": "https://another-example.com" } 5 ], 6 "max_results": "all" 7}
1{ 2 "title": "Sample Blog Post", 3 "cleaned_content": "This is the content of the blog post, without HTML tags.", 4 "date_published": "2023-10-01", 5}
Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.
No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.
It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.
Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.
You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!