Professional YouTube transcript extraction tool. Features multi-language support, timestamped chunks in JSON format, and complete transcript text. Ideal for content analysis, research, and creating searchable video archives.
A powerful Apify actor for extracting transcripts/captions from YouTube videos. The actor supports multiple languages, batch processing, and provides both structured JSON with timestamps and plain text formats.
youtube.com/watch?v=...
)youtu.be/...
)youtube.com/live/...
)Parameter | Type | Required | Description |
---|---|---|---|
youtube_urls | array | Yes | List of YouTube video URLs to process |
preferredLanguage | string | No | Preferred language code (default: 'en') |
Supported languages include: English (en), Spanish (es), French (fr), German (de), Italian (it), Portuguese (pt), Russian (ru), Japanese (ja), Korean (ko), Chinese (zh), Polish (pl), Czech (cs), Slovak (sk), Swedish (sv), Dutch (nl), Hindi (hi), Norwegian (no), Arabic (ar)
The actor saves results to its dataset with the following structure:
1{ 2 "url": "YouTube video URL", 3 "video_id": "Video identifier", 4 "language_used": "Language code of the extracted transcript", 5 "available_languages": ["array", "of", "available", "language", "codes"], 6 "transcript_json": { 7 "text": "Complete transcript text", 8 "chunks": [ 9 { 10 "timestamp": [start_time, end_time], 11 "text": "Segment text" 12 } 13 ] 14 }, 15 "transcript_text": "Plain text transcript", 16 "status": "success/error", 17 "error": "Error message if any" 18}
1{ 2 "youtube_urls": [ 3 "https://www.youtube.com/watch?v=v0sNkmiKsBU" 4 ] 5}
1{ 2 "url": "https://www.youtube.com/watch?v=v0sNkmiKsBU", 3 "video_id": "v0sNkmiKsBU", 4 "language_used": "en", 5 "available_languages": [ 6 "en" 7 ], 8 "transcript_json": { 9 "text": "some people think that people who are so good at math are just nerds and they're whizzes and that's all they know how to do and zack has never been into that category at all...", 10 "chunks": [ 11 { 12 "timestamp": [2.639, 5.359], 13 "text": "some people think that" 14 }, 15 { 16 "timestamp": [3.76, 7.44], 17 "text": "people who are so good at math are just" 18 } 19 // ... more chunks 20 ] 21 }, 22 "transcript_text": "some people think that people who are so good at math are just nerds and they're whizzes and that's all they know how to do...", 23 "status": "success" 24}
Note: The output example shows a truncated version of the transcript for brevity. Actual output will contain the complete transcript with all timestamp chunks.
The actor provides clear error messages when:
For bug reports, feature requests, or questions:
Yes, if you're scraping publicly available data for personal or internal use. Always review Websute's Terms of Service before large-scale use or redistribution.
No. This is a no-code tool — just enter a job title, location, and run the scraper directly from your dashboard or Apify actor page.
It extracts job titles, companies, salaries (if available), descriptions, locations, and post dates. You can export all of it to Excel or JSON.
Yes, you can scrape multiple pages and refine by job title, location, keyword, or more depending on the input settings you use.
You can use the Try Now button on this page to go to the scraper. You’ll be guided to input a search term and get structured results. No setup needed!