- Type of document:
- International standard
- ISO TC37/SC4
- ISO 24614-2, "Word segmentation of written texts -- Part
2: Word segmentation for Chinese, Japanese and Korean," specifies rules for
delineating WSUs for Chinese, Japanese and Korean. Some rules are common to all three
languages, though each language also has its own distinct rules for identifying WSUs. The
common features are discussed, then the distinct rules are laid out for Chinese, for Japanese
and for Korean.
- Workflow stage:
- Business value:
- The many applications and fields that need to segment texts into words include translation,
content management, speech technologies, computational linguistics and lexicography.
- Web site:
Send feedback or questions to: firstname.lastname@example.org