Search results
728 packages found
Sort by: Default
- Default
- Most downloaded this week
- Most downloaded this month
- Most dependents
- Recently published
Tiny JavaScript tokenizer.
A tool set for CSS: fast detailed parser (CSS → AST), walker (AST traversal), generator (AST → CSS) and lexer (validation and matching) based on specs and browser implementations
A promise based streaming tokenizer
small commonmark compliant markdown parser with positional info and concrete tokens
Tokenize CSS
Tokenizes a string that represents a regular expression.
TypeScript definition for strtok3 token
Tokenized zip support
A micro-library of stream components for building custom JSON and JSONC processing pipelines with a minimal memory footprint — parse, filter, and transform JSON far larger than available memory with a SAX-inspired token API, on Node.js or Web Streams.
- json
- json-parser
- parser
- stream
- streaming
- streaming-json
- sax
- tokenizer
- pipeline
- filter
- jsonc
- web-streams
- large-files
- memory-efficient
Lexer / tokenizer
Chevrotain is a high performance fault tolerant javascript parsing DSL for building recursive decent parsers
Fast 0-deps bash parser written in TypeScript
A tokenzier for Sass' SCSS syntax
A tool set for CSS: fast detailed parser (CSS → AST), walker (AST traversal), generator (AST → CSS) and lexer (validation and matching) based on specs and browser implementations
General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.
- natural language processing
- artifical intelligence
- statistics
- Porter stemmer
- Lancaster stemmer
- tokenizer
- bigram
- trigram
- quadgram
- ngram
- stemmer
- bayes
- classifier
- phonetic
- View more
r/w stream of glsl tokens
Trim the whitespace within an array of GLSL tokens
Parse parentheses from a string
A pure JavaScript implementation of a BPE tokenizer (Encoder/Decoder) for GPT-2 / GPT-3 / GPT-4 and other OpenAI models
- BPE
- encoder
- decoder
- tokenizer
- GPT
- GPT-2
- GPT-3
- GPT-3.5
- GPT-4
- GPT-4o
- NLP
- Natural Language Processing
- Text Generation
- OpenAI
- View more
Fast token estimation at 96% accuracy of a full tokenizer in a 2kB bundle