Skip to content

Transcriptions (Speech-to-Text)

将音频转写为文本,兼容 OpenAI Whisper API。

POST https://api.wrouter.io/v1/audio/transcriptions

请求为 multipart/form-data

表单字段

字段类型必填说明
filefile音频文件,支持 mp3 mp4 m4a wav webm flac 等,<= 25MB
modelstringwhisper-1gpt-4o-transcribeparaformer-v2
languagestringISO-639-1 代码,如 zhen;不填则自动检测
promptstring提示文本,可用于风格 / 专有名词提示
response_formatstringjson(默认)、textsrtverbose_jsonvtt
temperaturenumber0~1,默认 0
timestamp_granularitiesarray["word"]["segment"],需配合 response_format=verbose_json

示例

bash
curl https://api.wrouter.io/v1/audio/transcriptions \
  -H "Authorization: Bearer $WROUTER_API_KEY" \
  -F file=@./meeting.mp3 \
  -F model=whisper-1 \
  -F language=zh \
  -F response_format=srt
python
from openai import OpenAI
client = OpenAI(api_key="sk-...", base_url="https://api.wrouter.io/v1")
with open("audio.mp3","rb") as f:
    text = client.audio.transcriptions.create(
        model="whisper-1",
        file=f,
        language="zh",
    ).text

Translations 端点

POST /v1/audio/translations 将任意语言音频翻译为英文,参数同上,但 language 字段会被忽略。