keywords:stt - npm search

@deepgram/captions

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

deepgramai

published version 1.2.0, 3 years ago3 dependents licensed under $MIT

1,773,245

n8n-nodes-palatine-speech

n8n node for integrating Palatine Speech API into workflow

palatine_zealot

published version 1.1.1, 16 days ago0 dependents licensed under $MIT

347,704

speech-to-element

Add real-time speech to text functionality into your website with no effort

ovidijusparsiunas

published version 2.0.0, a month ago8 dependents licensed under $MIT

73,283

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Speech-to-Text service

GitHub Actions

published version 8.1.4, 6 months ago13 dependents licensed under $MIT

83,395

sherpa-onnx-linux-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

GitHub Actions

published version 1.13.2, a month ago0 dependents licensed under $Apache-2.0

75,572

sherpa-onnx-node

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

GitHub Actions

published version 1.13.2, a month ago24 dependents licensed under $Apache-2.0

72,010

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

GitHub Actions

published version 1.13.2, a month ago10 dependents licensed under $Apache-2.0

60,819

@cloudflare/voice

Voice pipeline for Cloudflare Agents — STT, TTS, VAD, streaming, and SFU utilities

GitHub Actions

published version 0.3.0, 20 hours ago2 dependents licensed under $MIT

67,268

Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.

lobehubbot

published version 1.143.3, 5 months ago0 dependents licensed under $MIT

60,990

vosk-browser

Kaldi in-browser speech recognition based on a WASM build of the Vosk library

ccoreilly

published version 0.0.8, 3 years ago6 dependents licensed under $Apache-2.0

35,934

@soniox/node

Official Soniox SDK for Node

slava.soniox

published version 2.1.0, 25 days ago1 dependents licensed under $MIT

40,713

sherpa-onnx-darwin-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

GitHub Actions

published version 1.13.2, a month ago0 dependents licensed under $Apache-2.0

30,500

@qvac/transcription-parakeet

High-performance speech-to-text inference addon using NVIDIA Parakeet models for Bare runtime

GitHub Actions

published version 0.7.2, 2 days ago1 dependents licensed under $Apache-2.0

26,481

sherpa-onnx-linux-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

GitHub Actions

published version 1.13.2, a month ago1 dependents licensed under $Apache-2.0

24,144

sherpa-onnx-win-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

GitHub Actions

published version 1.13.2, a month ago0 dependents licensed under $Apache-2.0

26,768

@juspay/neurolink

Universal AI Development Platform with working MCP integration, multi-provider support, voice (TTS/STT/realtime), and professional CLI. 58+ external MCP servers discoverable, multimodal file processing, RAG pipelines. Build, test, and deploy AI applicatio