try to use "locale" with german and coepage 437 (dos)

  • Thread starter mathias wündisch
  • Start date
M

mathias wündisch

Hi,

i look for informations about the use of the perl pragma "locale". the
reason why: i have old input data coded with codepage 437. and i want
to use the normal matchingcharacter \b (non alphabetic char) and \w
(alphabetic char).
i found a lot of site about posix under linux but i work under windows
xp. am i the first with this problem?
i want to tell perl what byte between 32 and 255 is a allphabetical
character (\w) and what is not (\b).
thanks for ideas.

bye, mathias
 
J

jm

mathias wündisch a écrit :
Hi,

i look for informations about the use of the perl pragma "locale". the
reason why: i have old input data coded with codepage 437. and i want
to use the normal matchingcharacter \b (non alphabetic char) and \w
(alphabetic char).
i found a lot of site about posix under linux but i work under windows
xp. am i the first with this problem?
i want to tell perl what byte between 32 and 255 is a allphabetical
character (\w) and what is not (\b).
thanks for ideas.

bye, mathias
What's about using Encode module, with decode function to transform your
IBM/CP 437 data to utf-8, and then using utf-8 (transparently) behaviour
of perl regex?
 
M

mathias wündisch

What's about using Encode module, with decode function to transform your
IBM/CP 437 data to utf-8, and then using utf-8 (transparently) behaviour
of perl regex?

dear jm, ok this is a good idea! i will try this. thanks for
suggestion. m.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,755
Messages
2,569,536
Members
45,007
Latest member
obedient dusk

Latest Threads

Top