Inputs
| Parameter | Description | Data Type | Required | Range |
|---|---|---|---|---|
audio | Audio to transcribe. | AUDIO | Yes | - |
model | Model to use for transcription. Selecting this model reveals additional parameters. | COMBO | Yes | "scribe_v2" |
tag_audio_events | Annotate sounds like (laughter), (music), etc. in transcript. This parameter is revealed when the "scribe_v2" model is selected. (default: False) | BOOLEAN | No | - |
diarize | Annotate which speaker is talking. This parameter is revealed when the "scribe_v2" model is selected. (default: False) | BOOLEAN | No | - |
diarization_threshold | Speaker separation sensitivity. Lower values are more sensitive to speaker changes. This parameter is revealed when the "scribe_v2" model is selected and diarize is enabled. (default: 0.22) | FLOAT | No | 0.1 - 0.4 |
temperature | Randomness control. 0.0 uses model default. Higher values increase randomness. This parameter is revealed when the "scribe_v2" model is selected. (default: 0.0) | FLOAT | No | 0.0 - 2.0 |
timestamps_granularity | Timing precision for transcript words. This parameter is revealed when the "scribe_v2" model is selected. (default: “word”) | COMBO | No | "word""character""none" |
language_code | ISO-639-1 or ISO-639-3 language code (e.g., ‘en’, ‘es’, ‘fra’). Leave empty for automatic detection. (default: "") | STRING | No | - |
num_speakers | Maximum number of speakers to predict. Set to 0 for automatic detection. (default: 0) | INT | No | 0 - 32 |
seed | Seed for reproducibility (determinism not guaranteed). (default: 1) | INT | No | 0 - 2147483647 |
num_speakers parameter cannot be set to a value greater than 0 when the diarize option is enabled. You must either disable diarize or set num_speakers to 0.
Outputs
| Output Name | Description | Data Type |
|---|---|---|
text | The transcribed text from the audio. | STRING |
language_code | The detected language code of the audio. | STRING |
words_json | A JSON-formatted string containing detailed word-level information, including timestamps and speaker labels if enabled. | STRING |
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub
Source fingerprint (SHA-256):
1541ae5542b83d80cb96ab0c8694c25b2bd9d4f10c1030902064d05319b2a520