Built by Metorial, the integration platform for agentic AI.
Provider Summary
transcribe audio files
real-time speech-to-text
speaker diarization
translate transcripts
generate summaries
sentiment analysis
named entity recognition
generate subtitles
extract structured data
custom prompt responses
Transcribe audio and video files to text using asynchronous or real-time streaming modes. Supports 100+ languages with automatic language detection and code-switching. Perform speaker diarization to identify different speakers. Translate transcripts into multiple target languages. Generate summaries, sentiment analysis, named entity recognition, and chapter segmentation from audio. Extract structured data and produce subtitles in SRT/VTT formats. Apply custom vocabulary and spelling corrections, content moderation, and name consistency. Send custom prompts to generate LLM-powered responses from transcripts. Receive results via polling, callback URLs, or account-level webhooks.
This integration is licensed under the AGPL-3.0 License.
Built with ❤️ by Metorial