ai-gateway
    ai-gateway
    • Suno
      • Suno Generate
        POST
      • Suno Feed
        GET
      • Suno Concat
        POST
      • Suno Generate (Extend)
        POST
    • Stable Diffusion
      • Stable Diffusion 文生图
        POST
      • Stable Diffusion 图生图
        POST
      • ReActor image
        POST
    • Create chat completion
      POST
    • Create chat completion with stream
      POST
    • Create image
      POST
    • Create speech
      POST
    • Create transcription
      POST
    • asr
      POST
    • asr Copy
      POST
    • Create transcription Copy
      POST

      Create transcription Copy

      POST
      /v1/audio/transcriptions
      原文参考:https://platform.openai.com/docs/api-reference/audio/createTranscription

      请求参数

      Authorization
      在 Header 添加参数
      Authorization
      ,其值为在 Bearer 之后拼接 Token
      示例:
      Authorization: Bearer ********************
      Body 参数multipart/form-data
      model
      string 
      必需
      ID of the model to use. Only whisper-1 (which is powered by our open source Whisper V2 model) is currently available.
      示例值:
      whisper-1
      file
      file 
      必需
      The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.
      示例值:
      file:///Users/huxiao/Downloads/2024-02-22 14-47-35.mp3
      language
      string 
      可选
      The language of the input audio. Supplying the input language in ISO-639-1 format will improve accuracy and latency.
      prompt
      string 
      可选
      An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.
      response_format
      string 
      可选
      The format of the transcript output, in one of these options: json, text, srt, verbose_json, or vtt.
      示例值:
      json
      temperature
      string 
      可选
      The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit.
      示例值:
      0
      timestamp_granularities
      array[string]
      可选
      The timestamp granularities to populate for this transcription. response_format must be set verbose_json to use timestamp granularities. Either or both of these options are supported: word, or segment. Note: There is no additional latency for segment timestamps, but generating word timestamps incurs additional latency.

      示例代码

      Shell
      JavaScript
      Java
      Swift
      Go
      PHP
      Python
      HTTP
      C
      C#
      Objective-C
      Ruby
      OCaml
      Dart
      R
      请求示例请求示例
      Shell
      JavaScript
      Java
      Swift
      curl --location --request POST 'https://api.aigateway.work/v1/audio/transcriptions' \
      --form 'model="whisper-1"' \
      --form 'file=@"/Users/huxiao/Downloads/2024-02-22 14-47-35.mp3"'

      返回响应

      🟢200成功
      application/json
      Body
      text
      string 
      必需
      The transcribed text.
      示例
      {
        "text": "你好,我是Chad GPT,有什么可以帮您?"
      }
      修改于 2025-07-17 09:17:39
      上一页
      asr Copy
      Built with