Net-A-Porter Scraper

Net-A-Porter Scraper

Net-A-Porter Scraper is designed to extract product data such as prices, designers, colors, and codes from Net-A-Porter. It efficiently handles pagination and proxies.

ECOMMERCEFOR_CREATORSAUTOMATIONApify

This Apify Actor scrapes product data from Net-A-Porter.
It navigates through listing pages (with pagination) and then visits each product’s detail page to extract key information such as:

  • Designer
  • Product Name
  • Price
  • Color
  • Product Code

Features

  • Scraping of both listing pages and product detail pages.
  • Automatic pagination based on the pagination_max parameter.
  • Limitation on the total number of scraped products via the products_max parameter.
  • Use of proxies (Apify) to reduce the risk of blocking.

Input Parameters

ParameterTypeDescriptionDefault Value
startUrlsarrayStart URLs for scraping.["https://www.net-a-porter.com/fr-fr/shop/chaussures"]
pagination_maxintegerMaximum number of pages to scrape.5
products_maxintegerMaximum number of products to scrape.50
proxyConfigurationobjectProxy configuration for scraping (using Apify proxies is recommended).{ "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"] }

Output

The Actor generates a JSON dataset containing, for each product, an object similar to:

{ "url": "https://www.net-a-porter.com/fr-fr/shop/product/gianvito-rossi/...", "designer": "GIANVITO ROSSI", "productName": "Flat shoes with pointed toe in suede", "price": "690", "color": "Black", "productCode": "1647597357846081" }

This project is intended for educational and research purposes only. Please ensure you comply with the Net-A-Porter website’s terms of use and applicable scraping laws.


Additional Explanations

  • Pagination and Limitation:
    The routes.js file checks on each listing page if the page number (defined via the pageNumber parameter in the URL) is less than pagination_max and adds the next page. Additionally, before adding a product detail page for processing, it verifies that the total number of scraped products does not exceed products_max.

  • Page Differentiation:
    The logic in handleRequest determines whether the URL corresponds to a listing page or a detail page. On listing pages, product detail links are extracted using the selector .ProductList0__productItemContainer a.

  • Proxy Usage:
    The proxy configuration is managed via the proxyConfiguration parameter in the input schema and is applied when launching the crawler.

You can customize and extend this template according to your specific needs or changes in the target site's HTML structure.

Frequently Asked Questions

Is it legal to scrape job listings or public data?

Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.

Do I need to code to use this scraper?

No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.

What data does it extract?

It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.

Can I scrape multiple pages or filter by location?

Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.

How do I get started?

You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!