Optimized Combinators

Parserator provides optimized combinators that use specialized implementations for common parsing patterns. These combinators can be significantly faster than their generic equivalents, especially for repetitive operations.

When to Use Optimized Combinators

Optimized combinators are ideal for:

High-frequency operations: Parsing whitespace, digits, or identifiers that appear many times
Performance-critical parsers: When parsing speed matters for large inputs
Simple character classes: When you’re matching well-defined character sets

Performance Comparison

// Generic approach (slower)
const digits = many(digit).map(chars => chars.join(''))

// Optimized approach (faster)
const digits = manyDigit().map(chars => chars.join(''))

The optimized version uses direct string operations and avoids parser overhead, resulting in better performance for large inputs.

Character Repetition

manyChar

Matches zero or more occurrences of a specific character.

const manyChar: <T extends string>(ch: T) => Parser<T[]>

string

required

The character to match (must be a single character)

Example:

const spaces = manyChar(' ')
spaces.run('    hello') // Right([[' ', ' ', ' ', ' '], {...}])
spaces.run('hello') // Right([[], {...}])

Performance: Optimized for direct character comparison without parser overhead.

manyDigit

Matches zero or more digit characters (0-9).

const manyDigit: () => Parser<string[]>

Example:

const digits = manyDigit()
digits.run('12345abc') // Right([['1', '2', '3', '4', '5'], {...}])
digits.run('abc') // Right([[], {...}])

Usage:

// Parse an integer
const integer = manyDigit().map(digits => parseInt(digits.join(''), 10))

many1Digit

Matches one or more digit characters (0-9). Fails if no digits are found.

const many1Digit: () => Parser<string[]>

Example:

const digits = many1Digit()
digits.run('12345abc') // Right([['1', '2', '3', '4', '5'], {...}])
digits.run('abc') // Left(error: Expected at least one digit)

Usage:

// Parse a non-empty number
const number = many1Digit().map(digits => digits.join(''))

manyAlphabet

Matches zero or more alphabetic characters (a-z, A-Z).

const manyAlphabet: () => Parser<string[]>

Example:

const letters = manyAlphabet()
letters.run('Hello123') // Right([['H', 'e', 'l', 'l', 'o'], {...}])
letters.run('123') // Right([[], {...}])

Usage:

// Parse a word
const word = manyAlphabet().map(chars => chars.join(''))

many1Alphabet

Matches one or more alphabetic characters (a-z, A-Z). Fails if no letters are found.

const many1Alphabet: () => Parser<string[]>

Example:

const letters = many1Alphabet()
letters.run('Hello123') // Right([['H', 'e', 'l', 'l', 'o'], {...}])
letters.run('123') // Left(error: Expected at least one letter)

manyAlphanumeric

Matches zero or more alphanumeric characters (a-z, A-Z, 0-9).

const manyAlphanumeric: () => Parser<string[]>

Example:

const identifier = manyAlphanumeric()
identifier.run('var123_') // Right([['v', 'a', 'r', '1', '2', '3'], {...}])
identifier.run('_') // Right([[], {...}])

Usage:

// Parse an identifier (simplified)
const ident = parser(function* () {
  const first = yield* alphabet
  const rest = yield* manyAlphanumeric()
  return first + rest.join('')
})

many1Alphanumeric

Matches one or more alphanumeric characters (a-z, A-Z, 0-9). Fails if none are found.

const many1Alphanumeric: () => Parser<string[]>

Example:

const chars = many1Alphanumeric()
chars.run('abc123') // Right([['a', 'b', 'c', '1', '2', '3'], {...}])
chars.run('_') // Left(error: Expected at least one alphanumeric character)

Whitespace

manyWhitespace

Matches zero or more whitespace characters (space, tab, newline, carriage return).

const manyWhitespace: () => Parser<string[]>

Example:

const ws = manyWhitespace()
ws.run('  \t\nhello') // Right([[' ', ' ', '\t', '\n'], {...}])
ws.run('hello') // Right([[], {...}])

skipWhitespace

Skips zero or more whitespace characters without collecting results.

const skipWhitespace: () => Parser<void>

Example:

const parser = parser(function* () {
  yield* skipWhitespace()  // Skip leading whitespace
  const value = yield* number
  return value
})

parser.run('   42') // Right([42, {...}])

Performance: More efficient than manyWhitespace() when you don’t need the characters.

Character Sets

oneOfChars

Matches any single character from the given set.

const oneOfChars: (chars: string) => Parser<string>

chars

string

required

String of characters to match (must be at least one character)

Example:

const vowel = oneOfChars('aeiouAEIOU')
vowel.run('apple') // Right(['a', {...}])
vowel.run('xyz') // Left(error: Expected one of: aeiouAEIOU)

Usage:

// Parse hex digit
const hexDigit = oneOfChars('0123456789abcdefABCDEF')

anyOfStrings

Matches any of the given strings, trying longest matches first.

const anyOfStrings: (...strings: string[]) => Parser<string>

strings

string[]

required

Array of strings to match

Example:

const operator = anyOfStrings('++', '+', '--', '-')
operator.run('++x') // Right(['++', {...}])
operator.run('+x') // Right(['+', {...}])

Note: Automatically sorts strings by length (longest first) to ensure correct greedy matching.

Predicates

takeWhileChar

Takes characters while a predicate returns true.

const takeWhileChar: (predicate: (ch: string) => boolean) => Parser<string>

predicate

(ch: string) => boolean

required

Function that returns true for characters to consume

Example:

const identifier = takeWhileChar(ch => /[a-zA-Z0-9_]/.test(ch))
identifier.run('var_name123 = 5') // Right(['var_name123', {...}])

Usage:

// Take alphanumeric characters
const alphanum = takeWhileChar(ch => 
  (ch >= 'a' && ch <= 'z') || 
  (ch >= 'A' && ch <= 'Z') || 
  (ch >= '0' && ch <= '9')
)

// Take until whitespace
const word = takeWhileChar(ch => ch !== ' ' && ch !== '\t' && ch !== '\n')

takeUntilChar

Takes characters until a predicate returns true.

const takeUntilChar: (predicate: (ch: string) => boolean) => Parser<string>

predicate

(ch: string) => boolean

required

Function that returns true to stop consuming

Example:

const untilQuote = takeUntilChar(ch => ch === '"')
untilQuote.run('hello"world') // Right(['hello', {...}])

Note: This is equivalent to takeWhileChar(ch => !predicate(ch)).

Fixed Length

takeN

Takes exactly n characters from the input.

const takeN: (n: number) => Parser<string>

number

required

Number of characters to take

Example:

const threeChars = takeN(3)
threeChars.run('abcdef') // Right(['abc', {...}])
threeChars.run('ab') // Left(error: Expected 3 characters but only 2 remaining)

Usage:

// Parse a fixed-length code
const countryCode = takeN(2)  // 'US', 'UK', etc.
const zipCode = takeN(5)      // '12345'

Best Practices

Choose the Right Combinator

// ❌ Less efficient
const spaces = many(char(' '))

// ✅ More efficient
const spaces = manyChar(' ')

Combine with Generic Combinators

const identifier = parser(function* () {
  // Optimized for common case
  const first = yield* alphabet
  const rest = yield* manyAlphanumeric()
  
  // Generic combinators for specific logic
  yield* optional(char('!'))
  
  return first + rest.join('')
})

Use Predicates for Flexibility

// Custom character class with optimized predicate
const whitespaceOrComma = takeWhileChar(ch => 
  ch === ' ' || ch === '\t' || ch === '\n' || ch === ','
)

Performance Tips

Use skipWhitespace() instead of manyWhitespace() when you don’t need the characters
Prefer many1Digit() over many1(digit) for parsing numbers
Use takeWhileChar() with simple predicates instead of complex parser combinations
Choose anyOfStrings() for keyword matching instead of multiple or() combinators with string()
Use takeN() for fixed-length fields instead of counting with count()

Migration Guide

Replacing generic combinators with optimized versions:

// Before
const digits = many(digit).map(ds => ds.join(''))
const letters = many1(alphabet).map(ls => ls.join(''))
const ws = many(or(char(' '), char('\t'), char('\n')))

// After
const digits = manyDigit().map(ds => ds.join(''))
const letters = many1Alphabet().map(ls => ls.join(''))
const ws = manyWhitespace()

​When to Use Optimized Combinators

​Performance Comparison

​Character Repetition

​manyChar

​manyDigit

​many1Digit

​manyAlphabet

​many1Alphabet

​manyAlphanumeric

​many1Alphanumeric

​Whitespace

​manyWhitespace

​skipWhitespace

​Character Sets

​oneOfChars

​anyOfStrings

​Predicates

​takeWhileChar

​takeUntilChar

​Fixed Length

​takeN

​Best Practices

​Choose the Right Combinator

​Combine with Generic Combinators

​Use Predicates for Flexibility

​Performance Tips

​Migration Guide

When to Use Optimized Combinators

Performance Comparison

Character Repetition

manyChar

manyDigit

many1Digit

manyAlphabet

many1Alphabet

manyAlphanumeric

many1Alphanumeric

Whitespace

manyWhitespace

skipWhitespace

Character Sets

oneOfChars

anyOfStrings

Predicates

takeWhileChar

takeUntilChar

Fixed Length

takeN

Best Practices

Choose the Right Combinator

Combine with Generic Combinators

Use Predicates for Flexibility

Performance Tips

Migration Guide