Take one dataset on Apify platform, compare it to another, and output the missing ones. This can also be used to output only changed items, using a compound key
Take one dataset on Apify platform, compare to another, and output the missing ones. This can also be used to output only changed items, using a compound key.
Supports using whole nested objects as value, they are JSON.stringify
'd before being turned
into a small non-cryptographic space efficient hash
1await Apify.call('pocesar/diff-datasets', { 2 baseDatasetId: 'LdNAlaOY1aKGhwAah', // place the datasets here. The order of "base" and "other" matters 3 otherDatasetId: 'Bzu1pgOjenN43VhPY', // existing items in "base" are not output from "other" 4 uniqueFields: [ 5 // simple primitive field value, like string, number, boolean 6 "pageUrl", 7 8 // you can use lodash.get notation to get nested items, 9 // in this case `sub.fields.0` works like `sub.fields[0]` and the object looks like 10 // { 11 // pageUrl: "https//pageurl", 12 // sub: { 13 // fields: [ 14 // {...}, 15 // {...} 16 // ] 17 // } 18 // } 19 "sub.fields.0", 20 21 // you can also use .length to count arrays or string characters, as in 22 "sub.fields.length", 23 "pageUrl.length" 24 ], 25});
base
dataset, more items more memory needed.Set
with too many itemsApache 2.0
Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.
No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.
It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.
Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.
You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!