A powerful [Apify Actor] that finds sitemap URLs for any website. This Actor helps you discover XML sitemaps by checking common locations, robots.txt files, and analyzing HTML content for sitemap links.
A powerful Apify Actor that finds sitemap URLs for any website. This Actor helps you discover XML sitemaps by checking common locations, robots.txt files, and analyzing HTML content for sitemap links.
/sitemap.xml
, /sitemap-index.xml
, etc.)Parameter | Type | Description |
---|---|---|
url | String | The URL of the website to search for sitemaps (required) |
findAll | Boolean | If true, find all available sitemaps. If false, find only the primary sitemap (default: true) |
noVerify | Boolean | If true, skip verification of sitemap XML format (default: false) |
timeout | Integer | Timeout in seconds for HTTP requests (default: 5) |
verbose | Boolean | Enable detailed logging of the sitemap discovery process (default: false) |
The Actor saves results to the default dataset with the following structure:
1{ 2 "url": "example.com", 3 "sitemaps": [ 4 "https://example.com/sitemap.xml", 5 "https://example.com/post-sitemap.xml" 6 ], 7 "count": 2 8}
1{ 2 "url": "example.com", 3 "sitemap": "https://example.com/sitemap.xml" 4}
You can run the Actor on the Apify Platform or integrate it into your own applications using the Apify API.
1import apify_client 2 3# Initialize the ApifyClient with your API token 4client = apify_client.ApifyClient(token="YOUR_API_TOKEN") 5 6# Prepare the Actor input 7run_input = { 8 "url": "example.com", 9 "findAll": True, 10 "verbose": True 11} 12 13# Run the Actor and wait for it to finish 14run = client.actor("YOUR_ACTOR_ID").call(run_input=run_input) 15 16# Fetch and print Actor results from the default dataset 17results = client.dataset(run["defaultDatasetId"]).list_items().items 18print(results)
Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.
No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.
It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.
Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.
You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!