Search results
14 packages found
Sort by: Default
- Default
- Most downloaded this week
- Most downloaded this month
- Most dependents
- Recently published
Enforce real-time token budgets and spending limits for OpenAI, Anthropic Claude, and Google Gemini API calls in Node.js
Information on LLM models, context window token limit, output token limit, pricing and more
- llm
- language model
- model
- info
- eval
- prompt
- token
- context-window
- pricing
- token-limit
- provider
- gpt
- claude
- gemini
- View more
Your AI session died. Your work didn't. The checkpoint system for AI coding sessions — token limits, session swaps, tool switches, pick up where you left off.
Circuit breaker for AI agents — pick budget-guard or loop-killer mode and stop runaway token spend or stuck agents in one wrapper. Adapters for LangChain, OpenAI Agents SDK, and the Claude Agent SDK.
- langchain
- langchainjs
- langgraph
- langgraph-sdk
- openai
- openai-agents
- claude
- anthropic
- claude-agent-sdk
- vercel
- vercel-ai-sdk
- ai-sdk
- agent
- circuit-breaker
- View more
Runtime policy layer for LLM applications — enforce cost, privacy, and runtime guardrails on every model call
- llm
- ai
- sdk
- guardrails
- openai
- anthropic
- agent
- runtime
- loop-detection
- cost-control
- budget
- pii
- privacy
- fallback
- View more
Context compaction plugin for OpenClaw - works with local models (MLX, llama.cpp) that don't report token limits
Smart, persistent context memory for RAG applications. Stop managing chat history arrays manually.
High-performance streaming Markdown text splitter for LLM pipelines and RAG systems. Zero sequence loss for code blocks, tables, links, and videos. 5 built-in strategies + custom. Zero dependencies.
- markdown
- text-splitter
- llm
- rag
- chunking
- streaming
- zero-loss
- code-blocks
- semantic
- tokenizer
- langchain
- vector-database
- embeddings
- ai
- View more
Transparent MCP sidecar proxy. Intercepts JSON-RPC traffic, analyzes payloads in real-time, and shows you exactly what vurb.ts would fix — without changing your code.
- mcp
- model-context-protocol
- mcp-proxy
- mcp-server
- mcp-debug
- mcp-intercept
- mcp-sidecar
- mcp-diagnostics
- mcp-monitor
- mcp-tools
- mcp-payload
- mcp-pii
- mcp-security
- token-limit
- View more
Diagnostic interceptor for raw MCP servers. Detects architectural flaws in real-time and prescribes the fix.
- mcp
- model-context-protocol
- mcp-debug
- mcp-doctor
- mcp-diagnostics
- mcp-error
- mcp-crash
- mcp-logger
- mcp-monitor
- mcp-tools
- mcp-server
- mcp-troubleshoot
- mcp-payload
- token-limit
- View more
Smartly prune massive JSON/strings for LLM context optimization with cost estimation.
Policy & governance for AI & LLM usage
- llm
- llm-extension
- bantai
- policy-engine
- token-control
- token-quota
- token-limit
- token-limit-extension
- token-limit-policy
- token-limit-rule
- token-limit-context
- token-limit-input
- token-limit-output
- llm-governance
Budget limits and cost guardrails for AI agents. Prevents runaway API spend with hard budget enforcement, circuit breakers, and per-agent cost tracking.
Keep Pi runs controlled with automatic context-budget wrap-up and kill thresholds.