npm

Search results

357 packages found

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

published version 1.2.0, 3 years ago3 dependents licensed under $MIT
1,773,245

n8n node for integrating Palatine Speech API into workflow

published version 1.1.1, 16 days ago0 dependents licensed under $MIT
347,704

Add real-time speech to text functionality into your website with no effort

published version 2.0.0, a month ago8 dependents licensed under $MIT
73,283

Polyfill Web Speech API with Cognitive Services Speech-to-Text service

published version 8.1.4, 6 months ago13 dependents licensed under $MIT
83,395

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

published version 1.13.2, a month ago0 dependents licensed under $Apache-2.0
75,572

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

published version 1.13.2, a month ago24 dependents licensed under $Apache-2.0
72,010

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

published version 1.13.2, a month ago10 dependents licensed under $Apache-2.0
60,819

Voice pipeline for Cloudflare Agents — STT, TTS, VAD, streaming, and SFU utilities

published version 0.3.0, 20 hours ago2 dependents licensed under $MIT
67,268

Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.

published version 1.143.3, 5 months ago0 dependents licensed under $MIT
60,990

Kaldi in-browser speech recognition based on a WASM build of the Vosk library

published version 0.0.8, 3 years ago6 dependents licensed under $Apache-2.0
35,934

Official Soniox SDK for Node

published version 2.1.0, 25 days ago1 dependents licensed under $MIT
40,713

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

published version 1.13.2, a month ago0 dependents licensed under $Apache-2.0
30,500

High-performance speech-to-text inference addon using NVIDIA Parakeet models for Bare runtime

published version 0.7.2, 2 days ago1 dependents licensed under $Apache-2.0
26,481

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

published version 1.13.2, a month ago1 dependents licensed under $Apache-2.0
24,144

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

published version 1.13.2, a month ago0 dependents licensed under $Apache-2.0
26,768

Universal AI Development Platform with working MCP integration, multi-provider support, voice (TTS/STT/realtime), and professional CLI. 58+ external MCP servers discoverable, multimodal file processing, RAG pipelines. Build, test, and deploy AI applicatio

published version 9.70.4, 7 minutes ago6 dependents licensed under $MIT
24,194

Speech-to-text recognition API for Tauri with multi-language support

published version 0.2.0, a month ago0 dependents licensed under $MIT
24,692

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

published version 1.13.2, a month ago0 dependents licensed under $Apache-2.0
19,019

**QVAC SDK** is the canonical entry point to develop AI applications with QVAC.

published version 0.12.2, 9 days ago3 dependents licensed under $Apache-2.0
19,246

Official Soniox SDK for client-side applications

published version 2.1.0, 25 days ago1 dependents licensed under $MIT
15,536