Dohyun Kim wrote:
2009/3/2 Hans Hagen
: Yanrui Li wrote:
There are some Chinese punctuations, such as “," (fullwidth comma), "!" (fullwidth exclamation mark) and so on, which are considered to be "chinese" but not "full_width_close" by scrp-cjk.lua. Is this deliberately doing? we have different vectors for chinese and korean and each of them can be optimized, of there can be several for each script
keep in mind that when korean uses chinese, it does not use the chinese rules
Those Chinese punctuations we Koreans seldom use. Nor we normally use characters from U+FF00 .. U+FFFF area. So it's OK to delete this line in scip-ini.lua :
for i=0x0FF00,0x0FFEF do hash[i] = "chinese" end
as requested by Chinese people.
Or it would be better if the order is reversed, that is to say, if the line mentioned above and other adjacent lines comes before the hash table of individual characters.
then we'd need to do each assignment individually; so instead i now have for i=0x0FF00,0x0FFEF do if not hash[i] then hash[i] = "chinese" end end i.e. a test before an assigment so that the first table takes precedence. ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com | www.pragma-pod.nl -----------------------------------------------------------------