MS-루시디아-음성-게이트웨이-MCP

Windows 기본 음성 서비스를 사용하여 텍스트-음성 변환 및 음성-텍스트 변환 기능을 제공하는 모델 컨텍스트 프로토콜(MCP) 서버입니다. 이 서버는 PowerShell 명령을 통해 기본 Windows 음성 API(SAPI)를 활용하므로 외부 API나 서비스가 필요하지 않습니다.

특징

Windows SAPI 음성을 사용한 텍스트 음성 변환(TTS)
Windows 음성 인식을 사용한 음성-텍스트 변환(STT)
테스트를 위한 간단한 웹 인터페이스
외부 API 종속성 없음
기본 Windows 기능을 사용합니다

필수 조건

음성 인식이 활성화된 Windows 10/11
노드.js 16+
파워셸

설치

저장소를 복제합니다.

지엑스피1

종속성 설치:

npm install

프로젝트를 빌드하세요:

npm run build

용법

테스트 인터페이스

테스트 서버를 시작합니다.

npm run test

브라우저에서 http://localhost:3000 엽니다.
웹 인터페이스를 사용하여 TTS 및 STT 기능을 테스트하세요

사용 가능한 도구

텍스트 음성 변환

Windows SAPI를 사용하여 텍스트를 음성으로 변환합니다.

매개변수:

text (필수): 음성으로 변환할 텍스트
voice (선택 사항): 사용할 음성(예: "Microsoft David Desktop")
speed (선택 사항): 음성 속도 0.5~2.0(기본값: 1.0)

예:

fetch('http://localhost:3000/tts', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({
    text: "Hello, this is a test",
    voice: "Microsoft David Desktop",
    speed: 1.0
  })
});

음성을 텍스트로 변환

Windows 음성 인식을 사용하여 오디오를 녹음하고 텍스트로 변환합니다.

매개변수:

duration (선택 사항): 녹음 지속 시간(초)(기본값: 5, 최대: 60)

예:

fetch('http://localhost:3000/stt', {
  method: 'POST',
  headers: { 'Content-Type': 'application/json' },
  body: JSON.stringify({
    duration: 5
  })
}).then(response => response.json())
  .then(data => console.log(data.text));

문제 해결

Windows 음성 인식이 활성화되어 있는지 확인하세요.
- Windows 설정 열기
- 시간 및 언어 > 음성으로 이동하세요
- 음성 인식 활성화
사용 가능한 음성을 확인하세요:
- PowerShell을 열고 GXP7을 실행합니다.
음성 인식 테스트:
- Windows 설정에서 음성 인식 열기
- 아직 설정 마법사를 실행하지 않았다면 실행하세요.
- Windows가 사용자의 음성을 인식하는지 테스트합니다.

기여하다

저장소를 포크하세요
기능 브랜치를 생성하세요
변경 사항을 커밋하세요
지점으로 밀어 넣기
새로운 풀 리퀘스트 만들기

특허

MIT

This server cannot be installed

security - not tested

license - not found

quality - not tested

How are these scores calculated?

local-only server

The server can only run on the client's local machine because it depends on local resources.

외부 종속성 없이 Windows의 기본 음성 서비스를 사용하여 텍스트-음성 변환 및 음성-텍스트 변환 기능을 제공하는 서버입니다.

Related Resources

Reddit Discussion about this server

Related MCP Servers

Kokoro TTS MCP Server
giannisanni
-
security
F
license
-
quality
Provides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.
Last updated -
7
Python
Gladia MCPofficial
gladiaio
-
security
A
license
-
quality
Official Model Context Protocol server that enables interaction with powerful Speech-to-Text and Audio Intelligence APIs, allowing clients like Claude Desktop to transcribe audio, analyze speech, translate content, and more.
Last updated -
2
Python
MIT License
Edge-TTS MCP Server
yuiseki
-
security
F
license
-
quality
A Model Context Protocol server that provides text-to-speech functionality for AI agents using Microsoft Edge's text-to-speech technology, supporting multiple voices, languages, and voice customization.
Last updated -
4
Python
VOICEVOX MCP Server
Yuki10Kobayashi
A
security
A
license
A
quality
A Model Context Protocol server that integrates with VOICEVOX engine to provide text-to-speech synthesis and speaker information retrieval, allowing users to generate and play voice audio from text.
Last updated -
2
TypeScript
MIT License

View all related MCP servers

MS-Lucidia-Voice-Gateway-MCP

MS-루시디아-음성-게이트웨이-MCP

특징

필수 조건

설치

용법

테스트 인터페이스

사용 가능한 도구

텍스트 음성 변환

음성을 텍스트로 변환

문제 해결

기여하다

특허

Related Resources

Related MCP Servers

Kokoro TTS MCP Server

Gladia MCPofficial

Edge-TTS MCP Server

VOICEVOX MCP Server

Appeared in Searches

New MCP Servers

MCP directory API