Transcribe
Turn audio into text via the OpenAI-compatible endpoint. Three models:
gpt-4o-transcribe (most accurate), gpt-4o-mini-transcribe (cheaper), and
whisper (billed per minute).
POST https://api.nexotao.com/v1/audio/transcriptions (multipart/form-data)Example
curl https://api.nexotao.com/v1/audio/transcriptions \
-H "Authorization: Bearer sk-nexo-..." \
-F file=@meeting.mp3 \
-F model=gpt-4o-transcribe \
-F language=idfrom openai import OpenAI
client = OpenAI(base_url="https://api.nexotao.com/v1", api_key="sk-nexo-...")
with open("meeting.mp3", "rb") as f:
t = client.audio.transcriptions.create(
model="gpt-4o-transcribe",
file=f,
language="id",
)
print(t.text)import OpenAI from "openai"
import fs from "fs"
const client = new OpenAI({ baseURL: "https://api.nexotao.com/v1", apiKey: "sk-nexo-..." })
const t = await client.audio.transcriptions.create({
model: "gpt-4o-transcribe",
file: fs.createReadStream("meeting.mp3"),
language: "id",
})
console.log(t.text)Parameters & file limits
| Parameter | Required | Description |
|---|---|---|
file | yes | audio file (mp3, wav, m4a, webm, mp4, mpeg, mpga, ogg, flac) |
model | yes | gpt-4o-transcribe, gpt-4o-mini-transcribe, or whisper |
language | no | ISO code, e.g. id for Indonesian — speeds up and sharpens results |
Maximum file size: 25 MB.
Billing
gpt-4o-transcribe/mini: per audio token + output text tokens.whisper: per minute of audio duration.
Exact rates are in Models & Pricing. The balance is deducted after the transcript is complete.