Transcriptions (Speech-to-Text)
将音频转写为文本,兼容 OpenAI Whisper API。
POST https://api.wrouter.io/v1/audio/transcriptions请求为 multipart/form-data。
表单字段
| 字段 | 类型 | 必填 | 说明 |
|---|---|---|---|
file | file | ✓ | 音频文件,支持 mp3 mp4 m4a wav webm flac 等,<= 25MB |
model | string | ✓ | 如 whisper-1、gpt-4o-transcribe、paraformer-v2 |
language | string | ISO-639-1 代码,如 zh、en;不填则自动检测 | |
prompt | string | 提示文本,可用于风格 / 专有名词提示 | |
response_format | string | json(默认)、text、srt、verbose_json、vtt | |
temperature | number | 0~1,默认 0 | |
timestamp_granularities | array | ["word"] 或 ["segment"],需配合 response_format=verbose_json |
示例
bash
curl https://api.wrouter.io/v1/audio/transcriptions \
-H "Authorization: Bearer $WROUTER_API_KEY" \
-F file=@./meeting.mp3 \
-F model=whisper-1 \
-F language=zh \
-F response_format=srtpython
from openai import OpenAI
client = OpenAI(api_key="sk-...", base_url="https://api.wrouter.io/v1")
with open("audio.mp3","rb") as f:
text = client.audio.transcriptions.create(
model="whisper-1",
file=f,
language="zh",
).textTranslations 端点
POST /v1/audio/translations 将任意语言音频翻译为英文,参数同上,但 language 字段会被忽略。