Net-A-Porter Scraper

Net-A-Porter Scraper is designed to extract product data such as prices, designers, colors, and codes from Net-A-Porter. It efficiently handles pagination and proxies.

ECOMMERCEFOR_CREATORSAUTOMATIONApify

Try Now →Read Guide →

This Apify Actor scrapes product data from Net-A-Porter.
It navigates through listing pages (with pagination) and then visits each product’s detail page to extract key information such as:

Designer
Product Name
Price
Color
Product Code

Features

Scraping of both listing pages and product detail pages.
Automatic pagination based on the pagination_max parameter.
Limitation on the total number of scraped products via the products_max parameter.
Use of proxies (Apify) to reduce the risk of blocking.

Input Parameters

Parameter	Type	Description	Default Value
`startUrls`	array	Start URLs for scraping.	["https://www.net-a-porter.com/fr-fr/shop/chaussures"]
`pagination_max`	integer	Maximum number of pages to scrape.	5
`products_max`	integer	Maximum number of products to scrape.	50
`proxyConfiguration`	object	Proxy configuration for scraping (using Apify proxies is recommended).	{ "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"] }

Output

The Actor generates a JSON dataset containing, for each product, an object similar to:

{ "url": "https://www.net-a-porter.com/fr-fr/shop/product/gianvito-rossi/...", "designer": "GIANVITO ROSSI", "productName": "Flat shoes with pointed toe in suede", "price": "690", "color": "Black", "productCode": "1647597357846081" }

Legal Disclaimer

This project is intended for educational and research purposes only. Please ensure you comply with the Net-A-Porter website’s terms of use and applicable scraping laws.

Additional Explanations

Pagination and Limitation:
The routes.js file checks on each listing page if the page number (defined via the pageNumber parameter in the URL) is less than pagination_max and adds the next page. Additionally, before adding a product detail page for processing, it verifies that the total number of scraped products does not exceed products_max.
Page Differentiation:
The logic in handleRequest determines whether the URL corresponds to a listing page or a detail page. On listing pages, product detail links are extracted using the selector .ProductList0__productItemContainer a.
Proxy Usage:
The proxy configuration is managed via the proxyConfiguration parameter in the input schema and is applied when launching the crawler.

You can customize and extend this template according to your specific needs or changes in the target site's HTML structure.

Frequently Asked Questions

Is it legal to scrape job listings or public data?

Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.

Do I need to code to use this scraper?

No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.

What data does it extract?

It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.

Can I scrape multiple pages or filter by location?

Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.

How do I get started?

You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!