Net-A-Porter Scraper is designed to extract product data such as prices, designers, colors, and codes from Net-A-Porter. It efficiently handles pagination and proxies.
This Apify Actor scrapes product data from Net-A-Porter.
It navigates through listing pages (with pagination) and then visits each product’s detail page to extract key information such as:
pagination_max
parameter.products_max
parameter.Parameter | Type | Description | Default Value |
---|---|---|---|
startUrls | array | Start URLs for scraping. | ["https://www.net-a-porter.com/fr-fr/shop/chaussures"] |
pagination_max | integer | Maximum number of pages to scrape. | 5 |
products_max | integer | Maximum number of products to scrape. | 50 |
proxyConfiguration | object | Proxy configuration for scraping (using Apify proxies is recommended). | { "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"] } |
The Actor generates a JSON dataset containing, for each product, an object similar to:
{ "url": "https://www.net-a-porter.com/fr-fr/shop/product/gianvito-rossi/...", "designer": "GIANVITO ROSSI", "productName": "Flat shoes with pointed toe in suede", "price": "690", "color": "Black", "productCode": "1647597357846081" }
This project is intended for educational and research purposes only. Please ensure you comply with the Net-A-Porter website’s terms of use and applicable scraping laws.
Pagination and Limitation:
The routes.js file checks on each listing page if the page number (defined via the pageNumber
parameter in the URL) is less than pagination_max
and adds the next page. Additionally, before adding a product detail page for processing, it verifies that the total number of scraped products does not exceed products_max
.
Page Differentiation:
The logic in handleRequest determines whether the URL corresponds to a listing page or a detail page. On listing pages, product detail links are extracted using the selector .ProductList0__productItemContainer a
.
Proxy Usage:
The proxy configuration is managed via the proxyConfiguration
parameter in the input schema and is applied when launching the crawler.
You can customize and extend this template according to your specific needs or changes in the target site's HTML structure.
Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.
No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.
It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.
Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.
You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!