Gladia

Transcribe audio and video files to text using asynchronous or real-time streaming modes. Supports 100+ languages with automatic language detection and code-switching. Perform speaker diarization to identify different speakers. Translate transcripts into multiple target languages. Generate summaries, sentiment analysis, named entity recognition, and chapter segmentation from audio. Extract structured data and produce subtitles in SRT/VTT formats. Apply custom vocabulary and spelling corrections, content moderation, and name consistency. Send custom prompts to generate LLM-powered responses from transcripts. Receive results via polling, callback URLs, or account-level webhooks.

License

This integration is licensed under the AGPL-3.0 License.

Built with ❤️ by Metorial