openai-whisper-api

OpenAI Audio Transcriptions API via curl; gpt-4o-transcribe, mini, diarize, or whisper-1.

来源

GitHub

更新于

2026-06-06

// 安全评估低风险

仅提示词，不执行代码
开源可审计

正在进行安全审计…

凭证密钥
网络外发
代码执行
数据访问
来源供应链

// 安装

复制安装指令，让 AI 自动完成配置 · 推荐新手

请帮我安装 askskill 上的 "openai-whisper-api" 技能：
1. 下载 https://raw.githubusercontent.com/openclaw/openclaw/main/skills/openai-whisper-api/SKILL.md
2. 保存为 ~/.claude/skills/openai-whisper-api/SKILL.md
3. 装好后重载技能，告诉我可以用了

// 下载

下载 SKILL.md机读安装清单 ↗

// 文档

OpenAI transcriptions API

Transcribe audio through /v1/audio/transcriptions. Set OPENAI_BASE_URL for an OpenAI-compatible proxy or local gateway.

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

Defaults:

Model: gpt-4o-transcribe
Output: <input>.txt

Useful flags

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model gpt-4o-transcribe --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model gpt-4o-mini-transcribe
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model gpt-4o-transcribe-diarize --json
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json

Notes:

Supported upload formats include mp3, mp4, mpeg, mpga, m4a, wav, webm.
25 MB upload limit on the hosted API.
Use diarize for speaker labels; script sends chunking_strategy=auto and rejects --prompt.

API key

Set OPENAI_API_KEY, or configure it in the active OpenClaw config file ($OPENCLAW_CONFIG_PATH, default ~/.openclaw/openclaw.json). Optionally set OPENAI_BASE_URL:

{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}

// 同源资产

技能

model-usage

Summarize CodexBar local cost logs by model for Codex or Claude, including current or full breakdowns.

openclaw装→

技能

nano-pdf

Edit PDFs with natural-language instructions using the nano-pdf CLI.

openclaw装→

技能

node-connect

Diagnose OpenClaw Android, iOS, or macOS node pairing, QR/setup code, route, auth, and connection failures.

openclaw装→

技能

node-inspect-debugger

Debug Node.js with node inspect, --inspect, breakpoints, CDP, heap, and CPU profiles.

openclaw装→

技能

notion

Notion CLI/API for pages, Markdown content, data sources, files, comments, search, Workers, and raw API calls.

openclaw装→

技能

meme-maker

Search meme templates, suggest formats, and generate local or hosted image memes.

openclaw装→

$ loading_