Fork me on GitHub

Sentence Boundary Detection (SBD).

Split text into sentences with a `vanilla` rule based approach (i.e working ~95% of the time).

  • Split a text based on period, question- and exclamation marks.
  • Skips (most) abbreviations (Mr., Mrs., PhD.)
  • Skips numbers/currency
  • Skips urls, websites, email addresses, phone nr.
  • Counts ellipsis and ?! as single punctuation
{{textContent | tokenize | json}}
{{textContent | pluralize }}