Skip to content

Providers and Models

OSTT can use cloud providers or local Whisper-compatible models. Run ostt model to choose the active model. Run ostt auth login first when you want to use a cloud provider.

Transcription request params are configured under [provider.params] or [provider."model".params], or passed per invocation with --param key=value. Use ostt model params PROVIDER/MODEL to list supported params for a cloud or local model.

For practical recommendations by use case, see Choosing a Transcription Model or Provider.

Local

Local models run on your machine and do not require an API key. Open the local model manager with:

bash
ostt model

Choose Local models to download curated models, activate downloaded models, delete model files, inspect metadata, or add a custom Hugging Face/direct model URL.

The curated model list is maintained in the kristoferlund/ostt-models repository. Open a pull request there to suggest a model for inclusion in the default list shown to all users.

Local transcription supports GPU acceleration through Metal on macOS and CUDA or Vulkan builds on Linux.

Local transcription requires WAV signed 16-bit PCM, 16 kHz, mono audio:

toml
[audio]
output_format = "pcm_s16le -ar 16000"

Local Whisper params use the same provider/model format as cloud providers, for example [whisper.params], [whisper.turbo.params], or --param language=sv. Use model IDs such as whisper/turbo. See Local Models for setup details and local param descriptions.

Cloud Provider Guides

ProviderModelsGuide
OpenAIopenai/gpt-4o-transcribe, openai/gpt-4o-mini-transcribe, openai/gpt-4o-transcribe-diarize, openai/whisper-1OpenAI
Deepgramdeepgram/nova-3, deepgram/nova-2Deepgram
Groqgroq/whisper-large-v3, groq/whisper-large-v3-turboGroq
DeepInfraWhisper, Whisper Large V3 Turbo, and Voxtral speech-recognition modelsDeepInfra
AssemblyAIassemblyai/universal-3-proAssemblyAI
Bergetberget/KBLab/kb-whisper-large, berget/NbAiLab/nb-whisper-large, berget/openai/whisper-large-v3Berget
ElevenLabselevenlabs/scribe_v2, elevenlabs/scribe_v1ElevenLabs
Mistralmistral/voxtral-mini-latest, mistral/voxtral-mini-2602Mistral

Landing Pages

These pages are broader introductions and workflow overviews: