Try the Word & Character Counter

Translation Text Expansion: Why German Runs Longer, Chinese Runs Shorter, and Your Layout Needs to Survive Both

A 1,000-word English document becomes roughly 1,300 words in German and roughly 700 words in Chinese β€” predictable averages that matter enormously for UI layout and print/subtitle budgeting, but can deviate wildly for short strings. Here's the commonly-cited expansion ratios by language, why pseudo-localization testing catches layout issues before translation even begins, and how expansion affects print page counts and video subtitling/dubbing differently than per-word translation costs.

By sadiqbd Β· June 16, 2026

Share:
Translation Text Expansion: Why German Runs Longer, Chinese Runs Shorter, and Your Layout Needs to Survive Both

A 1,000-word English document can become 1,300 words in German, 700 words in Chinese, and somewhere unpredictable in between for almost everything else β€” and if your design/budget assumes "translation = same length," every localized layout and every per-word translation invoice will be wrong

Text expansion and contraction ratios β€” how much longer or shorter a translated text becomes compared to its source β€” are a well-documented, predictable-on-average phenomenon in localization, with significant practical implications for UI design, translation budgeting, and content planning for multilingual sites.


Why translated text changes length

Structural differences between languages: some languages routinely require more words/characters to express the same meaning as English β€” German's compound words and grammatical case markers, Romance languages' (French, Spanish, Italian, Portuguese) tendency toward longer words and more frequent use of articles/prepositions, and various languages' verb conjugation systems that add length compared to English's relatively minimal inflection.

Other languages routinely compress: Chinese, Japanese, and Korean (CJK languages) often render the same content in significantly fewer characters than English β€” a single Chinese character can convey what requires multiple English words, and these languages don't use spaces between words the way English does, further reducing character counts compared to English's word-plus-space structure.


Commonly-cited expansion ratios (general guidance, not precise rules)

These are general, commonly-cited approximate ranges used in localization planning β€” actual ratios vary by specific content, translator, and language pair, but provide a starting point for planning purposes:

  • German, Finnish, and several other languages with compound-word-heavy structures: often cited around +20% to +35% longer than English
  • French, Spanish, Italian, Portuguese, and other Romance languages: often cited around +15% to +30% longer than English
  • Russian and other Slavic languages with extensive case systems: often in a similar +15-30% range, though specific figures vary by source
  • Chinese, Japanese, Korean: often cited as -30% to -50% shorter than English, in character count β€” though comparing "word count" across languages that don't use spaces the same way as English (CJK languages) is itself a somewhat apples-to-oranges comparison; character count is often the more meaningful metric for these languages, rather than "word" count

The critical caveat: these are broad averages, not guarantees for any specific sentence/document. A specific short phrase might expand far more (or less) than the "average" ratio for its language β€” particularly for short strings (UI button labels, navigation menu items), where the "average" ratio, computed over longer documents, may not hold for an individual, very short string, which could expand by a much larger percentage (a 2-word English label becoming a 5-word translated label is a much larger percentage increase than the "typical" document-level ratio would suggest, even if it's only a few additional characters in absolute terms).


UI/UX design implications: "pseudo-localization" testing

A practical technique used in software localization testing: "pseudo-localization" β€” before actual translation occurs, UI strings are programmatically expanded (e.g., by padding each string with extra characters to simulate a worst-case expansion ratio β€” sometimes specifically targeting +30-40% length increases, reflecting the higher end of commonly-cited expansion ranges for target languages) β€” and the UI is then tested with these artificially-expanded strings, before any actual translation work has occurred*.

What pseudo-localization catches:

  • Buttons/labels that truncate or overflow when text is longer than the original English β€” revealing layout elements that were sized assuming "English-length" text, without accommodation for longer translations
  • Fixed-width containers that don't wrap/resize gracefully β€” text that overflows its container, gets cut off, or breaks the layout (overlapping other elements) when longer than anticipated

Why test before translation, rather than after: catching layout issues during the pseudo-localization phase (which can be done by developers, without needing actual translators/translated content) is much cheaper than discovering the same issues after real translation has been completed β€” at which point, fixing a layout issue might require going back to translators to request shorter alternative phrasings (additional translation cost/cycles), rather than simply adjusting the UI layout itself (a development-only fix, achievable during pseudo-localization testing, before translation cost has even been incurred).


Translation budgeting: per-word costs and the expansion factor

Many translation services charge per source word (i.e., based on the English, or whatever the source language is, word count) β€” rather than per target-language word β€” meaning the expansion ratio doesn't directly affect the cost of the translation itself (a 1,000-source-word document costs the same, regardless of whether the translated output is 1,300 words or 700 words) β€” but it affects downstream costs/considerations:

  • Print/physical materials with fixed page counts β€” a German translation that's 30% longer than the English source might not fit within the same page count, requiring either smaller font sizes, additional pages (cost implications for printing), or editing for brevity (additional translation-review cost)
  • Subtitling/dubbing β€” for video content, translated dialogue that's significantly longer than the original (when spoken at a natural pace) creates timing mismatches β€” subtitles that can't be read in the time the original line is displayed, or dubbed audio that doesn't match the original speaker's mouth movements/scene timing β€” requiring transcreation (adapting, not just translating, content specifically for length/timing constraints) β€” a more involved, more costly process than straightforward translation

Estimating localized content length before translation (for layout/budget planning)

For a new multilingual project, before any translation exists: applying the general expansion-ratio figures (above) to your English (or source-language) content's word/character count β€” via this site's Word Counter tool β€” provides a rough estimate of what range of lengths to expect for specific target languages β€” useful for:

  • Estimating layout space requirements for UI elements, before committing to fixed-width designs that might prove too narrow
  • Estimating page-count impacts for print materials, before finalizing print budgets/page counts based only on source-language length

This remains a rough estimate, not a substitute for pseudo-localization testing (for UI) or consultation with translation/localization specialists (for print/video projects with specific, firm constraints) β€” but provides a useful starting point for initial planning discussions, before more detailed processes are engaged.


How to use the Word & Character Counter on sadiqbd.com

  1. Establish baseline counts for source-language content (English, or whichever language your content originates in) β€” word count and character count, both relevant depending on which expansion-ratio figures/considerations apply to your target languages
  2. Apply general expansion-ratio estimates to these baselines for rough planning purposes β€” recognizing the significant caveat that short strings can deviate substantially from "document-level" average ratios
  3. For UI strings specifically: rather than relying solely on ratio-based estimates for individual, short strings β€” pseudo-localization testing (expanding strings programmatically before translation, and testing the UI with these expanded placeholders) is the more reliable approach for catching layout issues specific to short, individual UI elements

Frequently Asked Questions

Do these expansion ratios apply equally to all types of content (technical documentation, marketing copy, UI strings)? No β€” the commonly-cited ratios are generally derived from/applicable to general prose (documents, articles) β€” UI strings (short labels, button text) can show much more variable, often more extreme (proportionally), expansion because of their short length (as discussed β€” a small absolute increase in character count represents a large percentage increase for a very short string) β€” and marketing/creative content, where transcreation (rather than direct translation) is often used, may not follow "expansion ratio" logic at all, since transcreation deliberately prioritizes impact/tone over length-matching to the source, potentially producing content of very different length than either "direct translation" or "expansion-ratio-based" estimates would suggest.

Is the Word & Character Counter free? Yes β€” completely free, no sign-up required.

Try the Word & Character Counter free at sadiqbd.com β€” count words, characters, and reading time for any text instantly.

Share:
Try the related tool:
Open Word & Character Counter

More Word & Character Counter articles