Normalize Unihan Z-variants

Discussion in 'Perl Misc' started by Dan Jacobson, Nov 30, 2006.

  1. Dan Jacobson

    Dan Jacobson Guest

    How am I to "normalize" a document full of the latter into the former?
    U+4E32 kZVariant U+F905
    U+4E86 kZVariant U+F9BA
    U+516D kZVariant U+F9D1
    U+5317 kZVariant U+F963
    U+53C3 kZVariant U+F96B...

    Unicode::Normalize apparently is talking about a different kind of
    normalization, not these CJK compatibility ideographs.

    From reading {perluniintro perlunicode perlre Encode::Unicode
    Unicode::UCD} one would think one needs to make a regular expression to
    replace any character in the CJK compatibility ideographs block with a
    lookup of its kZVariant base character.

    Certainly there is some Debian package I can just download?
    Dan Jacobson, Nov 30, 2006
    #1
    1. Advertising

Want to reply to this thread or ask your own question?

It takes just 2 minutes to sign up (and it's free!). Just click the sign up button to choose a username and then you can ask your own questions on the forum.
Similar Threads
  1. arnold
    Replies:
    1
    Views:
    589
    arnold
    Mar 5, 2006
  2. Christos TZOTZIOY Georgiou

    unicodedata . normalize (NFD - NFC) inconsistency

    Christos TZOTZIOY Georgiou, Nov 8, 2004, in forum: Python
    Replies:
    3
    Views:
    872
    Christos TZOTZIOY Georgiou
    Nov 10, 2004
  3. AndyL
    Replies:
    6
    Views:
    414
    John Machin
    May 25, 2006
  4. =?iso-8859-1?B?TWF0dGlhcyBCcuRuZHN0cvZt?=

    Vector, matrix, normalize, rotate. What package?

    =?iso-8859-1?B?TWF0dGlhcyBCcuRuZHN0cvZt?=, Feb 27, 2007, in forum: Python
    Replies:
    5
    Views:
    6,288
  5. Mike
    Replies:
    0
    Views:
    394
Loading...

Share This Page