Unlock valuable insights from any article! Get clean text, publication data, keywords, summaries, and more. Ideal for research, content marketing, and competitive analysis. Fast, reliable, and easy to use.
Extract clean, structured data from news articles and blog posts with this powerful Apify Actor. Get article text, metadata, keywords, summaries, and more – perfect for content analysis, market research, news aggregation, and SEO monitoring. No coding required!
This Actor is your one-stop solution for extracting valuable data from online articles. Whether you're a marketer tracking brand mentions, a researcher collecting data for analysis, or a developer building a news aggregation app, this tool saves you time and effort.
The Actor returns a JSON dataset with the following fields for each article:
Field | Description |
---|---|
articleURL | The URL of the scraped article |
sourceURL | The base URL of the website |
articleLanguage | The language of the article (e.g., "en", "es") |
articleTitle | The title of the article |
articleAuthors | A comma-separated list of the article's authors |
articlePublishDate | The publication date of the article (ISO 8601 format) |
articleText | The full text content of the article |
articleTopImage | The URL of the main image of the article |
articleAllImages | A comma-separated list of URLs for all images found |
articleVideos | A comma-separated list of URLs for embedded videos |
articleKeywords | A comma-separated list of keywords extracted |
articleSummary | A concise summary of the article |
scrapedAt | The timestamp of when the article was scraped |
scrapeSuccess | Boolean indicating scraping success |
articleMetaDescription | The meta description of the article |
articleMetaKeywords | A comma-separated list of the meta keywords |
scrapeErrorMessage | An error message if scrapeSuccess is false |
1[ 2 { 3 "articleURL": "https://www.example.com/news/article1", 4 "sourceURL": "https://www.example.com", 5 "articleLanguage": "en", 6 "articleTitle": "Example News Article", 7 "articleAuthors": "John Doe, Jane Smith", 8 "articlePublishDate": "2024-07-27T10:00:00Z", 9 "articleText": "This is the full text of the example news article...", 10 "articleTopImage": "https://www.example.com/images/article1.jpg", 11 "articleAllImages": "https://www.example.com/images/article1.jpg,https://www.example.com/images/article2.png", 12 "articleVideos": "", 13 "articleKeywords": "news, example, article", 14 "articleSummary": "A brief summary of the example news article.", 15 "scrapedAt": "2024-07-27T12:34:56Z", 16 "scrapeSuccess": true, 17 "articleMetaDescription": "An example article for demonstration.", 18 "articleMetaKeywords": "example, article, news, demo" 19 } 20]
Find the "Article Scraper & News Content Extractor" in the Apify Store
Configure the input:
startUrls
: An array of URLs to scrapelanguage
: (Optional) The expected language of the articles (default: "en")requestTimeout
: (Optional) The timeout for each request (default: 7 seconds)fetchImages
: (Optional) Whether to fetch images (default: true)proxyConfiguration
: Select a proxy configurationbrowserUserAgent
: (Optional) Custom User-AgentRun the Actor
Access results in JSON, CSV, Excel, or other formats
Optional: Schedule automatic runs, set up webhooks, or integrate with other Apify Actors
Start extracting valuable data from articles today! ➡️
Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.
No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.
It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.
Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.
You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!