将文本批量转换为语音旁白与无障碍朗读,支持音频提示生成。
复制安装指令,让 AI 自动完成配置 · 推荐新手
请帮我安装 askskill 上的 "speech" 技能: 1. 下载 https://raw.githubusercontent.com/openai/skills/main/skills/.curated/speech/SKILL.md 2. 保存为 ~/.claude/skills/speech/SKILL.md 3. 装好后重载技能,告诉我可以用了
Generate spoken audio for the current project (narration, product demo voiceover, IVR prompts, accessibility reads). Defaults to gpt-4o-mini-tts-2025-12-15 and built-in voices, and prefers the bundled CLI for deterministic, reproducible runs.
scripts/text_to_speech.py) with sensible defaults (see references/cli.md).tmp/speech/ for intermediate files (for example JSONL batches); delete when done.output/speech/ when working in this repo.--out or --out-dir to control output paths; keep filenames stable and descriptive.Prefer uv for dependency management.
Python packages:
uv pip install openai
If uv is unavailable:
python3 -m pip install openai
OPENAI_API_KEY must be set for live API calls.If the key is missing, give the user these steps:
OPENAI_API_KEY as an environment variable in their system.If installation isn't possible in this environment, tell the user which dependency is missing and how to install it locally.
gpt-4o-mini-tts-2025-12-15 unless the user requests another model.cedar. If the user wants a brighter tone, prefer marin.instructions are supported for GPT-4o mini TTS models, but not for tts-1 or tts-1-hd.--rpm at 50.OPENAI_API_KEY before any live API call.openai package) for all API calls; do not use raw HTTP.scripts/text_to_speech.py) over writing new one-off scripts.scripts/text_to_speech.py. If something is missing, ask the user before doing anything else.Reformat user direction into a short, labeled spec. Only make implicit details explicit; do not invent new requirements.
Quick clarification (augmentation vs invention):
Template (include only relevant lines):
Voice Affect: <overall character and texture of the voice>
Tone: <attitude, formality, warmth>
Pacing: <slow, steady, brisk>
Emotion: <key emotions to convey>
Pronunciation: <words to enunciate or emphasize>
Pauses: <where to add intentional pauses>
Emphasis: <key words or phrases to stress>
…
用于构建、脚手架生成、重构并排查 ChatGPT Apps SDK 应用问题。
帮助你基于接口文档或脚本快速生成可组合的命令行工具
帮助你将应用与基础设施部署、发布并托管到 Cloudflare 平台。
帮助用户把模糊想法转成具体、可衡量的目标与成功标准。
将 Figma 设计组件与代码组件建立映射,便于设计与实现保持一致。
帮助你构建、评审与重构 ASP.NET Core Web 应用及架构方案
将音频或视频中的语音转成文字,并可区分说话人与整理访谈记录。
通过 MCP 封装 OpenAI 图像与音频生成接口,便于快速集成调用
统一浏览并运行多种AI应用,生成图像、视频、音频等结果
提供 OpenAI 产品与 API 官方文档、模型选型及升级指引。
帮助用户制作适配 Slack 的动画 GIF,并检查规格与动画效果。
用于创建、编辑与优化AI技能,并评测其效果与触发准确性。