Similar hanzi list and Anki deck

Similar Chinese characters cause problems for every learner of the language. Hanzi are a complex enough writing system as it is, without throwing in little hurdles like 日 and 曰 being totally different things.

With that in mind, I’ve compiled a pretty hefty list of similar hanzi (and similar hanzi components). It’s heavily sourced from here and here, but there’s quite a lot of other material in there too. I want to keep editing it to get it as useful as possible.

You can download the list here:

If you have any additions, corrections or objections, please share them in the comments!

I’ve tried to edit the Wiktionary list to remove unhelpful entries and add in missing ones. If an entry seems too obvious, bear in mind that they might appear as small components in characters and then be easier to confuse.

In other words, some pairs might be easy to mix up in some contexts for some people.You can easily delete them from your copy of the Anki deck if you don’t feel the need to study them.

Having said that, some of them may be useless, so please point out those you think can reasonably be removed from the list (taking into account complete beginners and very advanced learners of Chinese).

The list is arranged in pairs. This might seem an odd way to arrange it at first, as it doesn’t allow longer sequences of similar hanzi to be grouped together. Each pair is distinguished by a ‘label’, which is usually pinyin unless the pinyin is identical for a pair, in which case a single-word meaning is given for each item.

But there is a big advantage in the consistency of grouping by pairs. It makes the data much easier to organise, edit and work with, particularly when it comes to automating tasks with it. It also makes it easy to import into Anki, which I’ve already done for you :)

And here’s the list as a huge HTML table (↓ skip to end ↓):

Hanzi 1 Hanzi 1 label Hanzi 2 Hanzi 2 label
ài nuǎn
ài shòu
ài yuán
āi sāng
āi shuāi
āi zhōng
ang mǎo
ang luǎn
áng mǎo
áng yìn
ào yuè
yǒu
bái jiù
bái
bàn biàn
bàng pěng
bèi jiàn
bèi jiàn
bèi
bèi péi
tuán
便 biàn 使 shǐ
biàn xià
biàn cǎi
bīng shuǐ
bīng shuǐ
bīng yǒng
péi
cāng lún
céng huì
chá xiāng
chá xìng
chá yǎo
chá nài
chá
chà zháo
chǎ
chǎn yán
chān chán
chǎng 广 guǎng
chè
chē dōng
chē dōng
chén jiān
chén
chéng guāi
chéng
chéng
chéng shù
chéng róng
chéng jiè
chéng yuè
chéng (deputy) chéng (to bear)
chí
chì jīn
chì
chǒng páng
chōng lào
chóu děng
chōu
chuài chuǎi
chuán háng
chuāng xìn
chūn fèng
chūn quán
chūn quàn
jiǎn
shù
cōng
cōng
dài
dài lìn
dài shì
dǎi
dàn gèn
dān fán
dān tǎn
dān zhōu
dāo wàn
dēng guǐ
zhǐ
shì
diǎn
diào jīn
diào shì
diāo
diāo yún
diāo sháo
dōng guǒ
dōng
dōng shù
duàn jiǎ
dùn zhì
duǒ shū
è wēi
è zhī
ēn
yǒu
fán zhōu
fāng wàn
fāng
féng fèng
féng shèng
fèng quán
fèng quàn
fèng shèng
fēng shī
fēng shǒu
fǒu
巿 shì
jiāo
xià
zài
tiān
yāo
zhuān
fù (double) fù (to return)
gǎn huò
gān qiān
gān
gān 廿 niàn
gāng wǎng
gāng
gāng xiōng
jiān
gèn xuán
gěn liáng
gěn mín
gēn láng
gèng
gèng shǐ
gǒng
gǒng
gōng guān
gòu héng
gòu huáng
gōu kuàng
gōu
qiào
guǎ lìng
guǎ shào
guā zhuǎ
guài yāng
guài
guǎi zhāo
guàn guì
guǐ
guǒ
hài xuán
zhī
hēi xūn
hěn láng
hēng xiǎng
hóu hòu
hóu shū
hóu tiáo
hòu jùn
hòu shū
hòu tiáo
shī
píng
huà shū
huà zhōu
huà shū
huà zhòu
huài huán
huàn miǎn
huàn
huàn yāng
huàn yòu
huáng xīng
huáng máng
huí yīn
huí
huī wēi
huī zhēng
huò yuè
shòu
sǔn
xiào
sài
jiǎ shēn
jiǎ tián
jiǎ xiù
jiǎ yóu
jiā méng
jiā wǎng
jiā zhǒng
jiā zhù
jiā lái
jiā lái
jiǎn shù
jiān lín
jiān jǐn
jiàng
jiàng
jiǎo shuǎi
jiǎo tǒng
jiǎo yòng
jié jué
jié
jiè róng
jiè
jiè
jiè shù
jiè róng
jìn
jīn quán
jīn shì
jīn líng
jìng qíng
jǐng kāi
jīng shēn
jìng (compete) jìng (unexpected)
jiōng tóng
jiù
jiù yuē
jiǔ
jiū qióng
xún
juǎn xiàng
jué
jué
liào
kèn láng
shè
kùn yīn
kùn xián
lán (basket) lán (blue)
láng yín
làng yín
léi luǒ
léi yíng
lèi zhù
lěi
lěi wèi
qún
shǐ
xiào
lì (history) lì (severe)
liáng mín
liáng (beam) liáng (sorghum)
liáo qīng
liào luò
lìng shào
liù xué
shī
luǎn mǎo
lún shū
lún
lún
lún
lún qiǎng
máo zhé
máo
mǎo yìn
méi 沿 yán
méng zhǒng
miàn
qìn
mián tóu
mián yǎn
miǎn
miǎn yāng
miè zāi
miē
mǐn xuě
míng
wèi
nài zōng
nài
nài (how) nài (crab-apple)
nèi róu
nèi shǎn
nián shū
niǎo
niǎo
niú shǒu
niú
nóng
suī
zhǐ
yuán
qiān
qīn xìng
qīn xīn
qìng yàn
qiú yǒu
xiōng
quán quàn
quán zhì
què sǔn
què xiè
quē què
quē xiè
rǎn zài
shú
rén
rén wáng
yuē
róng
róng
róng shù
róng yuè
róu shǎn
ròu yuè
sāng shuāi
sāng zhōng
shé
shěn chén
shēn tián
shēn yóu
shèng téng
shí yòu
shí zhǒu
shì
shì
shǐ suì
shǐ xiān
shǐ yāo
shī shǐ
shī xiān
shī yāo
shī shuài
shī shuài
shòu sǔn
shòu yuán
shù
shù
shù
shù
shù yuè
shū zhòu
shū tiáo
shū zhòu
shuǎ yào
shuǎi yòng
shuāi zhōng
shuǐ yǒng
shùn
xīn
suǒ
suì zhú
suī zhǐ
xīn
xìng
zhái
tài xióng
xiū
tián yóu
tiān
tiān yāo
tíng yán
tíng yán
yāng
tuō (prop) tuō (pull)
wáng
wǎng
wǎng zhù
wāng yín
wāng yóu
wèi zhū
wēi zhēng
wēi zhī
zhǎo
yāo
yín
yuè
yún
西 yǒu
xián huò
xiàn yuè
xiāng xìng
xiāng yǎo
xiāng yāo
xīn xìng
xìng yǎo
yuè
zhuāng
yáng
zhì
yīn xián
yún sháo
zàn zhé
zhù zhuī


↑ skip to top ↑


If you found this useful, consider helping me out in return.

Print Friendly

13 comments to Similar hanzi list and Anki deck

  • Interesting… It feels like this list was partially compiled for those that aren’t really studying Chinese. (Example: does anyone really confuse 食 and 退 or 太 and 六?)

    The pair I always found most annoying was 拔/拨.

    Also, looks like you have a 冡 where you meant to put a 家.

    • Yeah I was thinking that as I went through it. Also seems like it was compiled by a Japanese learner and not a Chinese learner, as the original Wiktionary page has tons of stuff that isn’t used in Chinese, or is just Japanese variants.

      Thanks for the error fix, that’s two within five minutes of publishing it! I’m hoping to edit and expand the list a bit then recompile the Anki deck. Then on with the studying!

  • Thomas

    For me 蓝 and 篮 are very similar, too.

  • Hugh,
    I’m curious where you got the pronunciation for 礻 (listed as zhǐ) from, since it is the radical variant of 示 and pronounced as shì I believe.

    See Wenlin reference here: 礻 [shì] (radical 113, component, same as 示 shì; 示字旁 shìzìpáng).

    Here are some others off the top of my head:

    勿 and 匆
    母 and 毋
    依 and 侬
    should add 巳 to the 己/已 list

    maybe add 甲 to the 申 area?

    I wonder if the list wouldn’t be better focused on simplified/ traditional problems, since some issues such as 依 and 侬 would never occur to someone studying traditional, but I don’t know how feasible that might be.

    I’ve got a few more in my teaching characters and theory book back home and I’ll probably add a few more once I take a look.

    Cheers,
    Jake

    • I second those characters! Those are definitely the types that would be useful.

    • Sorry that’s a mistake, it should be shì like you say. Corrected it now.

      Thanks for the additions, excellent stuff! Keep them coming :)

      Already got the 甲 / 申 and 母 / 毋; the list is in alphabetical order (to help spot duplicate pairs) so they’re not always grouped together.

      I think the list is most useful if it includes all possible mix-ups. Maybe each pair should be tagged for simp / trad? I study both character sets, which opens up a whole new avenue of cross-set confusion, so personally I’d like the list to have everything in it.

  • A friend and me compiled a similar list. Maybe it’s worth comparing if there are more similar ones included.

    http://code.google.com/p/eclectus/source/browse/trunk/libeclectus/data/similarcharacters.csv

  • Adrià Isern

    Thanks for the post, very helpful!
    You could add 兔 vs 免 vs 奂 (vs 央) ;)

  • Kaiwen

    “班 bàn 辨 biàn”

    First one should be 辦 bàn as in (full-form) 辦事, 辦 辨 is definitely a tricky pair, but not a problem in simplified.

  • [...] of Cambridge, U.K. I especially enjoyed his posts about how to handle Anki decks and his list on similar characters. An example? 贝 (bèi) means shellfish and 见 (jiàn) means to [...]

Leave a Reply

  

  

  

You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>