Overview
FalSTTService
provides speech-to-text capabilities using Fal’s Wizper API with Voice Activity Detection (VAD) to process only speech segments, optimizing API usage and improving response time.
API Reference
Complete API documentation and method details
Fal Docs
Official Fal Wizper documentation and features
Example Code
Working example with VAD integration
Installation
To use Fal services, install the required dependency:FAL_KEY
.
Get your API key from the Fal platform.
Frames
Input
InputAudioRawFrame
- Raw PCM audio data (16-bit, mono)UserStartedSpeakingFrame
- VAD detection of speech startUserStoppedSpeakingFrame
- VAD detection of speech end (triggers processing)STTUpdateSettingsFrame
- Runtime transcription configuration updatesSTTMuteFrame
- Mute audio input for transcription
Output
TranscriptionFrame
- Final transcription results after speech segment endsErrorFrame
- API or processing errors
Models
Fal offers the Wizper model with version options:Model | Version | Description |
---|---|---|
wizper | 3 | Latest Wizper model (default) |
wizper | 2 | Previous version for compatibility |
VAD-Based Processing
FalSTTService
extends SegmentedSTTService
, which uses Voice Activity Detection to process complete speech segments:
- Segment Processing: Only processes complete utterances, not continuous audio
- Audio Buffering: Maintains a 1-second buffer to capture speech before VAD detection
- VAD Requirement: Requires a VAD component like
SileroVADAnalyzer
in your transport
Language Support
View All Supported Languages (90+)
View All Supported Languages (90+)
Language Code | Description | Wizper Code |
---|---|---|
Language.AF | Afrikaans | af |
Language.AM | Amharic | am |
Language.AR | Arabic | ar |
Language.AS | Assamese | as |
Language.AZ | Azerbaijani | az |
Language.BA | Bashkir | ba |
Language.BE | Belarusian | be |
Language.BG | Bulgarian | bg |
Language.BN | Bengali | bn |
Language.BO | Tibetan | bo |
Language.BR | Breton | br |
Language.BS | Bosnian | bs |
Language.CA | Catalan | ca |
Language.CS | Czech | cs |
Language.CY | Welsh | cy |
Language.DA | Danish | da |
Language.DE | German | de |
Language.EL | Greek | el |
Language.EN | English | en |
Language.ES | Spanish | es |
Language.ET | Estonian | et |
Language.EU | Basque | eu |
Language.FA | Persian | fa |
Language.FI | Finnish | fi |
Language.FO | Faroese | fo |
Language.FR | French | fr |
Language.GL | Galician | gl |
Language.GU | Gujarati | gu |
Language.HA | Hausa | ha |
Language.HE | Hebrew | he |
Language.HI | Hindi | hi |
Language.HR | Croatian | hr |
Language.HT | Haitian Creole | ht |
Language.HU | Hungarian | hu |
Language.HY | Armenian | hy |
Language.ID | Indonesian | id |
Language.IS | Icelandic | is |
Language.IT | Italian | it |
Language.JA | Japanese | ja |
Language.JW | Javanese | jw |
Language.KA | Georgian | ka |
Language.KK | Kazakh | kk |
Language.KM | Khmer | km |
Language.KN | Kannada | kn |
Language.KO | Korean | ko |
Language.LA | Latin | la |
Language.LB | Luxembourgish | lb |
Language.LN | Lingala | ln |
Language.LO | Lao | lo |
Language.LT | Lithuanian | lt |
Language.LV | Latvian | lv |
Language.MG | Malagasy | mg |
Language.MI | Maori | mi |
Language.MK | Macedonian | mk |
Language.ML | Malayalam | ml |
Language.MN | Mongolian | mn |
Language.MR | Marathi | mr |
Language.MS | Malay | ms |
Language.MT | Maltese | mt |
Language.MY | Burmese | my |
Language.NE | Nepali | ne |
Language.NL | Dutch | nl |
Language.NN | Norwegian Nynorsk | nn |
Language.NO | Norwegian | no |
Language.OC | Occitan | oc |
Language.PA | Punjabi | pa |
Language.PL | Polish | pl |
Language.PS | Pashto | ps |
Language.PT | Portuguese | pt |
Language.RO | Romanian | ro |
Language.RU | Russian | ru |
Language.SA | Sanskrit | sa |
Language.SD | Sindhi | sd |
Language.SI | Sinhala | si |
Language.SK | Slovak | sk |
Language.SL | Slovenian | sl |
Language.SN | Shona | sn |
Language.SO | Somali | so |
Language.SQ | Albanian | sq |
Language.SR | Serbian | sr |
Language.SU | Sundanese | su |
Language.SV | Swedish | sv |
Language.SW | Swahili | sw |
Language.TA | Tamil | ta |
Language.TE | Telugu | te |
Language.TG | Tajik | tg |
Language.TH | Thai | th |
Language.TK | Turkmen | tk |
Language.TL | Tagalog | tl |
Language.TR | Turkish | tr |
Language.TT | Tatar | tt |
Language.UK | Ukrainian | uk |
Language.UR | Urdu | ur |
Language.UZ | Uzbek | uz |
Language.VI | Vietnamese | vi |
Language.YI | Yiddish | yi |
Language.YO | Yoruba | yo |
Language.ZH | Chinese | zh |
Language.EN
- English -en
Language.ES
- Spanish -es
Language.FR
- French -fr
Language.DE
- German -de
Language.IT
- Italian -it
Language.JA
- Japanese -ja
Usage Example
Basic Configuration
Initialize theFalSTTService
and use it in a pipeline:
Dynamic Configuration
Make settings updates by pushing anSTTUpdateSettingsFrame
for the FalSTTService
:
Metrics
The service provides performance metrics:- Time to First Byte (TTFB) - Latency from audio input to first transcription
- Processing Duration - Total time spent processing audio
Learn how to enable Metrics in your Pipeline.
Additional Notes
- VAD Dependency: Requires a VAD component in your transport for speech segment detection
- Segment Processing: Processes complete utterances rather than streaming audio
- Translation Support: Can translate foreign speech directly to English when using
translate
task - Error Handling: Comprehensive error handling for API failures and network issues