Transcribe
Convert media content into accurate transcriptions
Overview
The Transcribe API extracts text transcriptions from video and audio content across major social media platforms and podcast services. Simply provide a media URL, and the API will return a transcription of the audio.
Features
- Markdown Format: Receive transcriptions in both raw text and formatted markdown, ideal for LLM applications.
- Time-stamped Text: Get precise timing for each sentence in the transcription
- Paragraph Organization: Transcriptions are automatically structured into logical paragraphs
- Platform Metadata: Optionally retrieve media metrics such as upload time, like count, comment count,and thumbnail URL.
- Webhook Support: Get real-time notifications when a transcription is complete.
Basic Usage
To analyze content, simply add the media URL to the post_url
parameter. E.g. https://www.youtube.com/watch?v=example
.
You can optionally include a query
to ask specific questions about the content and a callback_url
to receive notifications when the analysis is complete. E.g. https://your-domain.com/webhook
.
Checking Transcription Status (Async Only)
If you’re using a callback_url
, the API response will include a Location
header containing the operation polling URL, which you can use to check the status of the transcription.
Example Operation Polling URL: https://api.scribesocial.ai/v1/transcribe-result/{operation-id}
When the callback request is sent to your endpoint, it will include a Scribe-Verification-Token
header.
This token corresponds to the API key ID used for the original request, allowing you to verify that the callback is coming from dScribe AI and is legitimate.
Example Response:
For more details about request parameters, response fields, and status codes, check out our Transcribe API Reference.
Common Use Cases
- LLM Applications: Enrich LLMs with high-quality transcriptions to improve contextual understanding, retrieval, and summarization.
- Social Listening & Brand Monitoring: Convert video content into text to track brand mentions, sentiment, and trends across media platforms.
- Content Analysis & Intelligence: Transcribe spoken content to structure and categorize information for easier insight extraction.
- Content Indexing & Searchability: Make video and audio content searchable by generating accurate transcriptions.
- Compliance & Moderation: Support content compliance by transcribing video/audio for policy checks and copyright monitoring.