AACFlow

Yandex SpeechKit

Sprache synthetisieren und Audio über Yandex SpeechKit erkennen

ya

Yandex SpeechKit is Yandex Cloud's speech technology platform offering text-to-speech (TTS) synthesis and speech-to-text (STT) recognition for Russian and other languages.

With the Yandex SpeechKit integration in AACFlow, you can:

  • Text-to-Speech (Synthesize): Convert text to audio in various voices and formats
  • Speech-to-Text (Recognize Short): Transcribe short audio files (under 1 minute) to text

This integration enables automated voice response systems, audio content generation, and speech-driven workflow triggers.

Nutzungsanleitung

Integrate Yandex SpeechKit into the workflow to add voice synthesis or audio transcription. Requires a Yandex Cloud IAM token. Obtain IAM tokens using the Yandex Cloud IAM block in your workflow.

Tools

yandex_speechkit_tts

Text-to-speech synthesis

Eingabe

ParameterTypErforderlichBeschreibung
iamTokenstringJaYandex Cloud IAM token
textstringJaText to convert to speech
voicestringNeinVoice name (e.g., oksana, alena, filipp)
speednumberNeinSpeech speed (0.1–3.0, default 1.0)
formatstringNeinAudio format: ogg_opus, lpcm, mp3

Ausgabe

ParameterTypBeschreibung
audioDatastringBase64-encoded audio data
mimeTypestringAudio MIME type

yandex_speechkit_stt

Speech-to-text recognition (short audio)

Eingabe

ParameterTypErforderlichBeschreibung
iamTokenstringJaYandex Cloud IAM token
audioDatastringJaBase64-encoded audio data
languagestringNeinLanguage code (ru-RU, en-US)

Ausgabe

ParameterTypBeschreibung
textstringRecognized text
confidencenumberRecognition confidence score

On this page

Heute mit dem Aufbau beginnen
Über 100 000 Entwickler vertrauen uns.
Die SaaS-Plattform zum Aufbau von KI-Agenten und für Ihre agentische Belegschaft.
Loslegen