AACFlow

Speech-to-Text

Convert speech to text using AI

Usage Instructions

Transcribe audio and video files to text using leading AI providers. Supports multiple languages, timestamps, and speaker diarization.

Tools

stt_whisper

Input

ParameterTypeRequiredDescription
providerstringYesNo description
apiKeystringYesNo description
modelstringNoNo description
audioFilefileNoNo description
audioFileReferencefileNoNo description
audioUrlstringNoNo description
languagestringNoLanguage code (e.g., "en", "es", "fr") or "auto" for auto-detection
timestampsstringNoNo description
translateToEnglishbooleanNoNo description
promptstringNoOptional text to guide the model's style or continue a previous audio segment. Helps with proper nouns and context.
temperaturenumberNoSampling temperature between 0 and 1. Higher values make output more random, lower values more focused and deterministic.
responseFormatstringNoOutput format for the transcription (e.g., "json", "text", "srt", "verbose_json", "vtt")

Output

This tool does not produce any outputs.

stt_deepgram

Input

ParameterTypeRequiredDescription
providerstringYesNo description
apiKeystringYesNo description
modelstringNoNo description
audioFilefileNoNo description
audioFileReferencefileNoNo description
audioUrlstringNoNo description
languagestringNoLanguage code (e.g., "en", "es", "fr") or "auto" for auto-detection
timestampsstringNoNo description
diarizationbooleanNoNo description

Output

This tool does not produce any outputs.

stt_elevenlabs

Input

ParameterTypeRequiredDescription
providerstringYesNo description
apiKeystringYesNo description
modelstringNoNo description
audioFilefileNoNo description
audioFileReferencefileNoNo description
audioUrlstringNoNo description
languagestringNoLanguage code (e.g., "en", "es", "fr") or "auto" for auto-detection
timestampsstringNoNo description

Output

This tool does not produce any outputs.

stt_assemblyai

Input

ParameterTypeRequiredDescription
providerstringYesNo description
apiKeystringYesNo description
modelstringNoNo description
audioFilefileNoNo description
audioFileReferencefileNoNo description
audioUrlstringNoNo description
languagestringNoLanguage code (e.g., "en", "es", "fr") or "auto" for auto-detection
timestampsstringNoNo description
diarizationbooleanNoNo description
sentimentbooleanNoNo description
entityDetectionbooleanNoNo description
piiRedactionbooleanNoNo description
summarizationbooleanNoNo description

Output

This tool does not produce any outputs.

stt_gemini

Input

ParameterTypeRequiredDescription
providerstringYesNo description
apiKeystringYesNo description
modelstringNoNo description
audioFilefileNoNo description
audioFileReferencefileNoNo description
audioUrlstringNoNo description
languagestringNoLanguage code (e.g., "en", "es", "fr") or "auto" for auto-detection
timestampsstringNoNo description

Output

This tool does not produce any outputs.

On this page

Start building today
Trusted by over 100,000 builders.
The SaaS platform to build AI agents and run your agentic workforce.
Get started