How to detect text file encoding in Perl

Discussion in 'Perl Misc' started by chaojen.chen, May 20, 2006.

  1. There's an ANSI standard that authoritatively refutes that claim; an
    IANA assigment ( citing
    RFC1345) which codifies the meaning of the term as it's to be used on
    the Internet; and, in more practical terms, there's a whole body of
    IETF standards-track RFCs whose meaning would be destroyed if "ASCII"
    did not mean what it means: a formally-defined 7-bit encoding
    *standard*, ANSI X3.4, and its ISO 646 counterpart.

    Other (mis)usages of the term by non-specialists are widespread, I
    know, but they're still authoritatively wrong, whatever you or I might
    happen to think personally.
    In this case I'd agree with it; but that's hardly the world's most
    authoritiative source of information.

    I thought that needed to be placed on the record, but now I'll try to
    resist any further trolling attempts. :-{
    Alan J. Flavell, May 22, 2006
  2. chaojen.chen

    Dr.Ruud Guest

    Ilya Zakharevich schreef:

    In Microsoft Windows, the phrase "ANSI" refers to the Windows ANSI code
    pages. Most of these are fixed width though there are some variable
    width ones for ideographic languages. Some of these are very close to
    the ISO-8859 series leading many to falsely assume that they are
    Dr.Ruud, May 22, 2006
  3. [A complimentary Cc of this posting was NOT [per weedlist] sent to
    Not enough. When I was trying to find why a friend's Mac "won't
    work", I found that OS X docs also mention ASCII to mean something
    unfathomable... [BTW, the solution was that OS X file system(s) just
    does not accept non-UTF-8 file names - but I did not find it
    documented anywhere when it was important.]

    So it is not only M$ which murks the water...

    Ilya Zakharevich, May 22, 2006
