Skip to main content
Glama

Audio Transcriber MCP Server

by Ichigo3766

OpenAI Speech-to-Text transcriptions MCP Server

A MCP server that provides audio transcription capabilities using OpenAI's API.

Installation

Setup

  1. Clone the repository:
git clone https://github.com/Ichigo3766/audio-transcriber-mcp.git cd audio-transcriber-mcp
  1. Install dependencies:
npm install
  1. Build the server:
npm run build
  1. Set up your OpenAI API key in your environment variables.
  2. Add the server configuration to your environment:
{ "mcpServers": { "audio-transcriber": { "command": "node", "args": [ "/path/to/audio-transcriber-mcp/build/index.js" ], "env": { "OPENAI_API_KEY": "", "OPENAI_BASE_URL": "", // Optional "OPENAI_MODEL": "" // Optional } } } }

Replace /path/to/audio-transcriber-mcp with the actual path where you cloned the repository.

Features

Tools

  • transcribe_audio - Transcribe audio files using OpenAI's API
    • Takes filepath as a required parameter
    • Optional parameters:
      • save_to_file: Boolean to save transcription to a file
      • language: ISO-639-1 language code (e.g., "en", "es")

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

Install Server
A
security – no known vulnerabilities
A
license - permissive license
A
quality - confirmed to work

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

A MCP server that enables transcription of audio files using OpenAI's Speech-to-Text API, with support for multiple languages and file saving options.

  1. Installation
    1. Setup
  2. Features
    1. Tools
  3. License

    Related MCP Servers

    • -
      security
      A
      license
      -
      quality
      An MCP server that enables LLMs to generate spoken audio from text using OpenAI's Text-to-Speech API, supporting various voices, models, and audio formats.
      Last updated -
      2
      1
      JavaScript
      MIT License
    • -
      security
      F
      license
      -
      quality
      An MCP server that downloads videos/extracts audio from various platforms like YouTube, Bilibili, and TikTok, then transcribes them to text using OpenAI's Whisper model.
      Last updated -
      5
      Python
      • Linux
      • Apple
    • -
      security
      A
      license
      -
      quality
      An MCP server that enables AI assistants to search, analyze, and retrieve information about audio samples from Freesound.org through their API.
      Last updated -
      JavaScript
      MIT License
      • Apple
    • -
      security
      A
      license
      -
      quality
      An MCP server that provides deep knowledge about OpenAI APIs and SDKs, enabling users to query technical information through various MCP clients including ChatGPT Deep Research, Cursor, and OpenAI Responses API.
      Last updated -
      9
      TypeScript
      MIT License

    View all related MCP servers

    MCP directory API

    We provide all the information about MCP servers via our MCP API.

    curl -X GET 'https://glama.ai/api/mcp/v1/servers/Ichigo3766/audio-transcriber-mcp'

    If you have feedback or need assistance with the MCP directory API, please join our Discord server