openai-whisper-api

by Unknown v1.0.0

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

What It Does

Transcribes audio files to text using the OpenAI Whisper API. It supports various audio formats and allows customization of the transcription process.

When To Use

When you need to convert audio recordings into text for documentation, analysis, or accessibility purposes.

Inputs

Audio file (e.g., .m4a, .ogg), OpenAI API Key, optional parameters for model, output path, language, and prompt.

Outputs

Text transcription of the audio file, in plain text or JSON format.

Limitations

Requires an OpenAI API key and internet access. Transcription accuracy depends on the quality of the audio and the chosen Whisper model. Rate limits and usage costs apply based on OpenAI's API pricing.

Installation

1. Install `curl` on your system. 2. Set the `OPENAI_API_KEY` environment variable in your shell environment. 3. Download the skill's scripts and make them executable (e.g., `chmod +x transcribe.sh`).

View Cline documentation

1. Ensure you have `curl` installed. 2. Set the `OPENAI_API_KEY` environment variable in your Cursor IDE environment. 3. Copy the skill's scripts directory to a location accessible by Cursor.

View Cursor IDE documentation

1. Install `curl` on your system. 2. Set the `OPENAI_API_KEY` environment variable in your shell environment. 3. Download the skill's scripts and make them executable (e.g., `chmod +x transcribe.sh`).

View Aider documentation

Have a Skill to Share?

Join the community and help AI agents learn new capabilities. Submit your skill and reach thousands of developers.