Related tools
Why use text statistics?
Check whether sentences run long, whether words are unusually stretched, or how varied your vocabulary looks—before you run heavier NLP or send copy to an editor.
Benefits
- Averages: mean word and sentence size at a glance.
- Extremes: spot outliers in length.
- Richness: simple unique-vs-total percentage.
- Live: metrics update while you type.
- Private: client-side only.
How it works
JavaScript splits and arithmetic—appropriate for English-like plaintext, not a substitute for linguistics tooling.
What it shows
- Words: `trim` then split on whitespace.
- Sentences: split on . ! ? with leftovers trimmed.
- Longest/shortest word: by raw token string length.
- Longest/shortest sentence: by word count inside each segment.
- Richness: `Set` size of normalized forms over token count.
When to use
Draft reviews, student papers, newsletters, and quick comparisons between two pasted versions.
Ideal use cases
- Editing: catch one mega-sentence.
- Style: see if averages drift between sections.
- Teaching: demonstrate naive metrics.
- QA: before specialized readability scores.
- Privacy: offline-first drafts.
Facts
Limitations.
Key points
- Mean sentence length uses the naive sentence count—empty splits are filtered.
- Longest sentence in the UI may be truncated for layout after 100 characters.
- Hyphenated words and contractions count as one token each.
- Non-Latin scripts may affect normalization in the richness set.
- Very large pastes may hit browser memory limits.
Best practices
Use with judgment.
Quality considerations
- Pair with a dedicated readability tool for graded scores.
- For legal or academic counts, follow the required definition of “word.”
- Clean pasted markup to plain text first for fair metrics.
- Compare drafts using the same paste hygiene.
- Don’t treat richness as SEO quality by itself.
When not to rely on it
- When sentence boundaries must match a publisher stylesheet.
- When you need syllable, lemma, or n-gram statistics.
- As a replacement for plagiarism or SEO suites.
Limitations and compatibility
Heuristic metrics; plain textarea input; requires JavaScript.
Text statistics run fully in your browser with no server upload; averages and language metrics refresh instantly as text changes.
Frequently asked questions
Is the text statistics tool free?
Yes. Analysis runs in your browser. No registration or upload.
What is vocabulary richness here?
Unique word forms (lowercased, non-word characters stripped for the uniqueness key) divided by total word count, times 100. It is a simple type–token ratio, not a linguistic type–token ratio (TTR) with full morphology.
How are sentences detected?
The text is split on ., !, and ?. Abbreviations and decimals can create extra sentence segments.
Why do word lengths include punctuation?
Tokens are taken as-is from whitespace splitting, so trailing commas or quotes increase character counts.
Is my text secure?
Yes. Processing stays on your device.
How is this different from a word counter?
A word counter totals words. Here you also get averages, extremes, and a uniqueness percentage.