語音轉文字

curl --request POST \
  --url https://api.skiesoft.com/v1/audio/transcriptions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form file='@example-file' \
  --form model=thiannu-v1

{
  "task": "transcribe",
  "duration": 6.5,
  "text": "你好，歡迎使用聽有 AI。這是一個測試錄音。",
  "segments": [
    {
      "id": 0,
      "start": 0,
      "end": 6.5,
      "text": "你好，歡迎使用聽有 AI。這是一個測試錄音。",
      "speaker": 1
    }
  ],
  "num_speakers": 123
}

POST

audio

transcriptions

curl --request POST \
  --url https://api.skiesoft.com/v1/audio/transcriptions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form file='@example-file' \
  --form model=thiannu-v1

{
  "task": "transcribe",
  "duration": 6.5,
  "text": "你好，歡迎使用聽有 AI。這是一個測試錄音。",
  "segments": [
    {
      "id": 0,
      "start": 0,
      "end": 6.5,
      "text": "你好，歡迎使用聽有 AI。這是一個測試錄音。",
      "speaker": 1
    }
  ],
  "num_speakers": 123
}

Authorizations

Authorization

string

header

required

使用 API 金鑰進行身份驗證。格式：Bearer YOUR_API_KEY

Body

multipart/form-data

音訊檔案和辨識參數

file

required

要轉錄的音訊檔案物件（不是檔案名稱），支援以下格式：mp3、wav、webm。

model

enum<string>

required

要使用的模型 ID

Available options:

thiannu-v1

Example:

"thiannu-v1"

speaker_diarization

boolean

default:false

啟用說話者分離功能，可識別和區分不同的說話者。這是軟雲聽有 AI 的獨有功能。

Example:

true

Response

轉錄成功

task

enum<string>

required

執行的任務類型。

Available options:

transcribe

Example:

"transcribe"

duration

number

required

音訊的持續時間，以秒為單位。

Example:

6.5

text

string

required

完整的轉錄文字內容。

Example:

"你好，歡迎使用聽有 AI。這是一個測試錄音。"

segments

object[]

required

從音訊中提取的片段，包含時間戳記、標記和其他元資料。僅在 timestamp_granularities 請求包含 segment 時包含。

Show child attributes

num_speakers

number

偵測到的說話者數量

即時語音辨識使用指南即時語音辨識 WebSocket 連線

⌘I

API 概述

使用指南

API 端點

Authorizations

Body

Response