Search results
357 packages found
Sort by: Default
- Default
- Most downloaded this week
- Most downloaded this month
- Most dependents
- Recently published
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
- audio
- javascript
- youtube
- typescript
- sdk
- ffmpeg
- speech
- subtitles
- srt
- webvtt
- speech-to-text
- transcription
- stt
- asr
- View more
n8n node for integrating Palatine Speech API into workflow
- n8n-community-node-package
- n8n
- palatine
- speech-to-text
- transcribe
- transcription
- stt
- audio
- ai
- automation
- voice-to-text
- speech-recognition
- audio-transcription
- audio2text
- View more
Add real-time speech to text functionality into your website with no effort
Polyfill Web Speech API with Cognitive Services Speech-to-Text service
- cognitive services
- dictation
- microphone
- polyfill
- react
- speak
- speech recognition
- speech synthesis
- speech to text
- speechsynthesis
- stt
- text to speech
- tts
- unified speech
- View more
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- embedded systems
- open source
- zipformer
- asr
- tts
- stt
- c++
- onnxruntime
- onnx
- View more
Voice pipeline for Cloudflare Agents — STT, TTS, VAD, streaming, and SFU utilities
Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.
Kaldi in-browser speech recognition based on a WASM build of the Vosk library
Official Soniox SDK for Node
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
High-performance speech-to-text inference addon using NVIDIA Parakeet models for Bare runtime
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Universal AI Development Platform with working MCP integration, multi-provider support, voice (TTS/STT/realtime), and professional CLI. 58+ external MCP servers discoverable, multimodal file processing, RAG pipelines. Build, test, and deploy AI applicatio
- ai
- llm
- mcp
- model-context-protocol
- lighthouse
- tool-orchestration
- ai-platform
- openai
- anthropic
- bedrock
- vertex
- azure
- mistral
- View more
Speech-to-text recognition API for Tauri with multi-language support
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
**QVAC SDK** is the canonical entry point to develop AI applications with QVAC.
- ai
- bare
- cross-platform
- deep-learning
- diffusion
- edge-ai
- embeddings
- expo
- gguf
- gpu
- holepunch
- hyperswarm
- image-generation
- inference
- View more
Official Soniox SDK for client-side applications