Improve Word Count Support For Non-Latin Characters (e.g. Asian characters) Potentially by Showing Character Count | World Anvil

Remove these ads. Join the Worldbuilders Guild

Improve Word Count Support For Non-Latin Characters (e.g. Asian characters) Potentially by Showing Character Count

Feature Upgrade · Articles & templates · Created by Harold.W
accepted
Article Word-Count

What functionality is missing? What is unsatisfying with the current situation?

  Article word count is currently incorrect for languages like Chinese (I don't know other non-Latin languages thus unable to provide more examples). The counted number is much lower than the actual one. After I've written over 5K Chinese characters/words, the world word count is only 1.5K.   For instance, article with content of "中文1 中文2 English1 English2" is counted as 2 words in WorldAnvil, while in MS Word, it has:
  • 8 words ("中文1 中文2" counts as 6 words)
  • 22 characters without space
  • 4 Asian characters ("中" x 2 and ”文")
It would be ideal if WorldAnvil is able to count 8 word out of it. But being able to show "4 Asian characters" is still going to be very helpful.

How does this feature request address the current situation?

  Word count is an extremely useful feature in WorldAnvil. Not being able to use it when writing articles in unsupported languages is a huge pain.    One potential mitigation is probably to also display or allow switching between word and character count. At least in my use case, being able to see character count is an acceptable state, since it pretty much gives the correct number when the whole article is in Chinese.

What are other uses for this feature request?

  I don't speak other languages that have the similar issue but I would guess at least Japanese and Korean are likely to face the same problem.

Follow up


Addressing a question in the votes:   > the MSWord result looks inflated? Shouldn't it output 4?   It doesn't. The common way of counting "words" of text in Chinese is counting "characters", since there's no space between Chinese "words". The "8 words" result from MSWord is by counting Chinese characters and English words separately, which is accurate.   That said, I don't know the complexity of reaching this level of accuracy. But being able to just see the character count in addition to the existing word count can be also very helpful (hopefully this is less difficult to implement).  

The Team's Response

Partially accepted! We won't add a wordcount for writing systems not currently supported at this point, but we will add a character counter that will support them. Thanks for the suggestion!
Current score

35/300 Votes · +5400 points

Votes Cast

  • +100

    by A Revolutionary Mlem
    on 2023-06-19 22:30
  • -300

    by A Uncontrollable Kobold
    on 2023-06-19 16:54
    Word count seems like such a non issue and a waste of dev time.
  • -1

    by A Rambunctious Velociraptor
    on 2023-06-18 21:46
    Putting any level of focus on word count feels kinda toxic to me...
  • +300

    by gcjones216
    on 2023-06-18 16:42
  • +300

    by Athevra
    on 2023-06-16 23:32
  • +300

    by morganarcher
    on 2023-06-14 01:51
  • +100

    by JRLoving
    on 2023-06-13 22:24
  • +300

    by InformationMagpie
    on 2023-06-13 16:09
  • +300

    by Mochimanoban
    on 2023-06-10 13:58
  • +300

    by A Roaring Unicorn
    on 2023-06-10 03:50
  • +300

    by EBJ
    on 2023-06-09 23:09
  • +300

    by cow2face
    on 2023-06-09 07:08
  • +100

    by LoreParmenter
    on 2023-06-08 18:51
  • +300

    by A Beloved Elf
    on 2023-06-08 17:36
  • -1

    by A Beloved Mimic
    on 2023-06-08 10:18
  • +300

    by barriesaxxy
    on 2023-06-08 06:22
    World Anvil is international. Makes sense to maximize support for non-English languages.
  • +300

    by illumiinae
    on 2023-06-07 20:59
  • +100

    by Rilameth
    on 2023-06-07 17:33
  • +100

    by Michael Chandra
    on 2023-06-06 05:20
    Since WorldEmber requires 51+ words and Summercamp 300+, I agree that counting entire sentences as a single word does sound limiting.
  • +1

    by storyauthor
    on 2023-06-06 04:48
  • +300

    by Dalf32
    on 2023-06-06 01:16
  • +100

    by FrostFortyTwo
    on 2023-06-05 14:04
  • -300

    by jorre998
    on 2023-06-05 11:44
  • +1

    by A Rambunctious Cthulhu
    on 2023-06-05 09:59
    Considering that word count is often used as part of challenges, making sure it works well for all languages/alphabets just makes sense. (plus it's overall just a useful feature)
  • +300

    by A Revolutionary Goblin
    on 2023-06-04 16:57
  • -100

    by A Cute Kitten
    on 2023-06-04 16:45
    Not sure why word count would be important to anyone. The exact number of words means literally nothing vs. quality or what you actually want to say. This kinda seems like a waste of dev time and effort. Sorry.
  • +100

    by Alishahr
    on 2023-06-04 12:14
  • +1

    by A Enfeebled T-Rex
    on 2023-06-04 09:50
  • +100

    by Willow H.R. Harper
    on 2023-06-04 08:21
  • +300

    by Angantyr
    on 2023-06-04 07:00
    Adding support to non-Latin-alphabet-based languages could be a step towards helping users, so I'm all hands for. I'm curious as to how the current system works — it doesn't seem like a simple split on whitespace characters.   Still, the MSWord result looks inflated? Shouldn't it output 4?
  • +300

    by Matunas
    on 2023-06-04 02:36
  • +100

    by UnknownWriter88
    on 2023-06-04 01:28
    this would be good for all language characters, not just Asian; ancient and modern alike.
  • +300

    by BasicDragon
    on 2023-06-04 00:22
  • -1

    by A Frightened Mlem
    on 2023-06-03 22:31
    couldn't care less even if i tried
  • +100

    by A Mischievous Goblin
    on 2023-06-03 22:23
    I believe WorldAnvil also has difficulty with Cyrillic.   This would be a great addition, just for the inclusivity of it. You are not the first to ask about this.
  • +300

    by Harold.W
    on 2023-06-03 22:13