Built by Metorial, the integration platform for agentic AI.

Learn More

    Provider Summary

    • transcribe audio files

    • real-time speech-to-text

    • speaker diarization

    • translate transcripts

    • generate summaries

    • sentiment analysis

    • named entity recognition

    • generate subtitles

    • extract structured data

    • custom prompt responses

Gladia

Transcribe audio and video files to text using asynchronous or real-time streaming modes. Supports 100+ languages with automatic language detection and code-switching. Perform speaker diarization to identify different speakers. Translate transcripts into multiple target languages. Generate summaries, sentiment analysis, named entity recognition, and chapter segmentation from audio. Extract structured data and produce subtitles in SRT/VTT formats. Apply custom vocabulary and spelling corrections, content moderation, and name consistency. Send custom prompts to generate LLM-powered responses from transcripts. Receive results via polling, callback URLs, or account-level webhooks.

License

This integration is licensed under the AGPL-3.0 License.

Built with ❤️ by Metorial