Reddit Comments Scraper

Reddit Comments Scraper

Extract Reddit comments with their complete thread structure, including nested replies, user information, and engagement metrics. Perfect for analyzing discussions, sentiment analysis, and tracking community engagement on Reddit posts.

SOCIAL_MEDIAApify

Reddit Comments Scraper 💬

Extract complete comment threads from Reddit posts with full conversation context and rich metadata.

🌟 Features

  • Extract complete comment threads with nested replies
  • Capture user information and avatars
  • Track comment engagement metrics
  • Support for various content types (text, images)
  • Duplicate comment detection

📊 Output Data Structure

Each comment includes:

  • Comment ID and metadata
  • Author information and avatar
  • Parent-child relationship tracking
  • Creation timestamp (ISO format)
  • Content and content type
  • Engagement metrics (upvotes)
  • Moderation status
  • Nested replies array

💡 Use Cases

  • Sentiment analysis
  • Discussion tracking
  • User engagement analysis
  • Content moderation monitoring
  • Community feedback analysis
  • Market research
  • Academic research
  • Social media monitoring

🔧 Input Parameters

ParameterTypeDescription
postUrlStringURL of the Reddit post to scrape
maxItemsIntegerMaximum number of comments to scrape
proxyConfigurationObjectOptional proxy settings

Input Example

A full explanation of an input example in JSON.

1{
2    "postUrl": "https://www.reddit.com/r/ChatGPT/comments/1epeshq/these_are_all_ai/",
3     "maxItems": 100
4}

Output sample

The results will be wrapped into a dataset which you can always find in the Storage tab. Here's an excerpt from the data you'd get if you apply the input parameters above:

And here is the same data but in JSON. You can choose in which format to download your data: JSON, JSONL, Excel spreadsheet, HTML table, CSV, or XML.

1[
2	{
3		"comment_id": "t1_lhk1f7n",
4		"post_id": "t3_1epeshq",
5		"author": "AutoModerator",
6		"permalink": "https://www.reddit.com/r/ChatGPT/comments/1epeshq/comment/lhk1f7n/",
7		"upvotes": 1,
8		"content_type": "text",
9		"parent_id": null,
10		"author_avatar": "https://styles.redditmedia.com/t5_1yz875/styles/profileIcon_klqlly9fc4l41.png?width=64&height=64&frame=1&auto=webp&crop=64:64,smart&s=d2d22c53cc8b2756dc0cfabd790128c50c94a74d",
11		"userUrl": "https://www.reddit.com/user/AutoModerator",
12		"contentText": "Moderator Announcement\n          Read More »\n        \n      \n          \n    \n    Hey u/Maxie445!\n  \n    If your post is a screenshot of a ChatGPT conversation, please reply to this message with the conversation link or prompt.\n  \n    If your post is a DALL-E 3 image post, please reply with the prompt used to make this image.\n  \n    Consider joining our public discord server! We have free bots with GPT-4 (with vision), image generators, and more!\n  \n    🤖\n  \n    Note: For any ChatGPT-related concerns, email support@openai.com\n  \n    I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.",
13		"created_time": "2024-08-11T07:12:09.272000+0000"
14	},
15	{
16		"comment_id": "t1_lhkeis2",
17		"post_id": "t3_1epeshq",
18		"author": "Alternative_Lynx_155",
19		"permalink": "https://www.reddit.com/r/ChatGPT/comments/1epeshq/comment/lhkeis2/",
20		"upvotes": 1434,
21		"content_type": "text",
22		"parent_id": null,
23		"author_avatar": "https://styles.redditmedia.com/t5_4g5czk/styles/profileIcon_729ggyg69ivd1.jpeg?width=64&height=64&frame=1&auto=webp&crop=64:64,smart&s=3efec1c537ca85a1aeacaa9a0c7a58b052181ef3",
24		"userUrl": "https://www.reddit.com/user/Alternative_Lynx_155",
25		"contentText": "That is crazy. When I was younger I thought thispersondoesnotexist.com was scary, but this is something completly different.",
26		"created_time": "2024-08-11T09:39:54.843000+0000",
27		"replies": [
28			{
29				"comment_id": "t1_lhmhxjf",
30				"post_id": "t3_1epeshq",
31				"author": "who_am_i_to_say_so",
32				"permalink": "https://www.reddit.com/r/ChatGPT/comments/1epeshq/comment/lhmhxjf/",
33				"upvotes": 279,
34				"content_type": "text",
35				"parent_id": "t1_lhkeis2",
36				"author_avatar": "https://styles.redditmedia.com/t5_9nppax/styles/profileIcon_kmdpkdhclabd1.png?width=64&height=64&frame=1&auto=webp&crop=64:64,smart&s=a0b6857e2d06bc9b7e7df01e5cea98fc4eacb31e",
37				"userUrl": "https://www.reddit.com/user/who_am_i_to_say_so",
38				"contentText": "I just spent 30 mins f5'ing that page. It's so addicting!",
39				"created_time": "2024-08-11T18:22:16.497000+0000"
40			}
41		]
42	},
43    ...
44]

Frequently Asked Questions

Is it legal to scrape job listings or public data?

Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.

Do I need to code to use this scraper?

No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.

What data does it extract?

It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.

Can I scrape multiple pages or filter by location?

Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.

How do I get started?

You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!