Re: ordering Japanese text

From(投稿者):	mab2001@gmail.com
Newsgroups(投稿グループ):	fj.comp.lang.perl,comp.lang.perl.misc
Subject(見出し):	Re: ordering Japanese text
Date(投稿日時):	6 May 2006 06:40:45 -0700
Organization(所属):	http://groups.google.com
References(祖先記事, 一番最後が直親):	(G) <1146753170.333788.54210@y43g2000cwc.googlegroups.com>
Message-ID(記事識別符号):	(G) <1146922845.464591.177650@g10g2000cwb.googlegroups.com>
Followuped-by(子記事):	(G) <4c5gkqF145s9lU1@uni-berlin.de>

From(投稿者):

mab2001@gmail.com

Newsgroups(投稿グループ):

fj.comp.lang.perl,comp.lang.perl.misc

Subject(見出し):

Re: ordering Japanese text

Date(投稿日時):

6 May 2006 06:40:45 -0700

Organization(所属):

http://groups.google.com

References(祖先記事, 一番最後が直親):

(G) <1146753170.333788.54210@y43g2000cwc.googlegroups.com>

Message-ID(記事識別符号):

(G) <1146922845.464591.177650@g10g2000cwb.googlegroups.com>

Followuped-by(子記事):

(G) <4c5gkqF145s9lU1@uni-berlin.de>

記事全体へのコマンド

After a discussion of this on the perl-i18n mailing list, I've come to
understand the problem a bit more. In Japanese, text ordering is based
on phonetization. But as in english, there are multiple pronunciations
of a particular piece of text. Moreover, the "more correct"
pronunciation among the possibilities is influenced by the context of
the text. So in other words, the problem is intractable if all you have
is the text alone and inefficient even if you have more information
(because of the myriad factors that influence pronunciation).

The solution that I am using then is to store with each piece of
kana/kanji text, a kana-only phonetization of that text. I then rely on
the content editors to know the context of the text and supply an
accurate phonetization in kana. (In other words, I'm putting the
responsibility on someone else!) There does exist a determinate
ordering of the kana-only text and so this becomes a tractable problem.

Mike

Fnews-brouse 1.9(20180406) -- by Mizuno, MWE <mwe@ccsf.jp>
GnuPG Key ID = ECC8A735
GnuPG Key fingerprint = 9BE6 B9E9 55A5 A499 CD51 946E 9BDC 7870 ECC8 A735