From 922a3b2bb75ef4b904838d172cfaa0a3244e6af5 Mon Sep 17 00:00:00 2001 From: Naiyuan Qing <145280634+NevilleQingNY@users.noreply.github.com> Date: Mon, 9 Feb 2026 09:44:49 +0800 Subject: [PATCH] feat(skills): add whisper audio transcription skill Bundled skill that enables the agent to transcribe audio files using OpenAI Whisper CLI. Uses anyBins requirement so the skill is only visible when whisper is installed. Includes brew and uv install specs. Co-Authored-By: Claude Opus 4.6 --- skills/whisper/SKILL.md | 54 +++++++++++++++++++++++++++++++++++++++++ 1 file changed, 54 insertions(+) create mode 100644 skills/whisper/SKILL.md diff --git a/skills/whisper/SKILL.md b/skills/whisper/SKILL.md new file mode 100644 index 00000000..7edbb772 --- /dev/null +++ b/skills/whisper/SKILL.md @@ -0,0 +1,54 @@ +--- +name: Audio Transcription +description: Transcribe audio files using OpenAI Whisper CLI +version: 1.0.0 +metadata: + emoji: "🎙️" + always: true + requires: + anyBins: + - whisper + - whisper-cli + install: + - id: brew-whisper + kind: brew + formula: openai-whisper + bins: [whisper] + label: "Install OpenAI Whisper via Homebrew" + os: [darwin] + - id: pip-whisper + kind: uv + package: openai-whisper + bins: [whisper] + label: "Install OpenAI Whisper via pip/uv" + tags: + - audio + - transcription + - media +userInvocable: false +disableModelInvocation: false +--- + +## Audio Transcription + +When you receive a message indicating an audio or voice message file (e.g., `[audio message received]` with a `File:` path), you should transcribe it. + +### How to Transcribe + +Run the following command using the `exec` tool: + +``` +whisper "" --model turbo --output_format txt --output_dir /tmp +``` + +Then read the resulting `.txt` file (same name as input, in `/tmp/`) to get the transcript. + +### Response Format + +After transcription, respond naturally based on the transcribed content. If the user said something in the voice message, respond to it as if they had typed it. + +If transcription fails, let the user know and suggest they check their Whisper installation. + +### Supported Formats + +Whisper supports: mp3, mp4, mpeg, mpga, m4a, wav, webm, ogg, oga, flac