parse newline

ela · Feb 8, 2009

It's always a nightmare for me to parse newline characters. No matter \n,
10, 13, I just don't know why some newlines are printed. As the file is
generated by another program, I cannot see the source code, and manual
inspection does not discover any abnormalty.

using chop, chomp or to use replace cannot help. Does anybody have
experience in handling this?

Martijn Lievaart · Feb 8, 2009

It's always a nightmare for me to parse newline characters. No matter
\n, 10, 13, I just don't know why some newlines are printed. As the file
is generated by another program, I cannot see the source code, and
manual inspection does not discover any abnormalty.

using chop, chomp or to use replace cannot help. Does anybody have
experience in handling this?

Something like:

chomp;
s/\x0d$//;

Should always do the trick. If not, you have something very strange going
on.

M4

Jürgen Exner · Feb 8, 2009

ela said:
It's always a nightmare for me to parse newline characters. No matter \n,
10, 13, I just don't know why some newlines are printed. As the file is
generated by another program, I cannot see the source code, and manual
inspection does not discover any abnormalty.

using chop, chomp or to use replace cannot help. Does anybody have
experience in handling this?

Your program is missing a semicolon on line 42.

jue

ela · Feb 8, 2009

Something like:

chomp;
s/\x0d$//;

Should always do the trick. If not, you have something very strange going
on.

Have already tried that before and it doesn't solve...

ela · Feb 8, 2009

Jürgen Exner said:
Your program is missing a semicolon on line 42.

jue

line 42????????

Well I've tried :

chop $identiy;
chomp ($identiy);
chop $identiy;
$identity =~ s/\x0d$//;

All fail.

Jürgen Exner · Feb 8, 2009

What character set are you using? None of the common ASCII-based
character sets (WIndows-1252, ISO-Latin-xxx, Unicode, ...) has a newline
character. See also below.

No matter \n,
10, 13, I just don't know why some newlines are printed. As the file is
generated by another program, I cannot see the source code, and manual
inspection does not discover any abnormalty.

Click to expand...

[...]
Your program is missing a semicolon on line 42.

Click to expand...

line 42????????

Apparently you've never read The Hitchhikers Guide through the Galaxy.

Long form: how do you propose us to fix your code without seeing it?
Have you seen the posting guidelines that are posted here twice a week?

Well I've tried :

chop $identiy;

This will remove the last character of $identiy.
How do you know that last character is actually the newline?

chomp ($identiy);

This will remove a trailing $/ from $identiy, whatever $/ may be set to
on your system (usually "\n").
Did you check that $/ matches the tail of $identiy?

chop $identiy;

This looks identical to the first line?

$identity =~ s/\x0d$//;

This is working on $identity instead of $identiy. Is that what you meant
to do?

One common problem are format incompatibilities between Windows, Mac,
and Unix. They use different characters/character combinations to denote
a line break. Therefore you should be very explicit about if you are
talking about a line feed character(LF), a carriage return
character(CR), or a logical newline entity of your OS.

Aside of that I suspect that you are looking at the wrong spot and your
real problem is somewhere else, like e.g. a misspelled variable name as
above

As strongly suggested in the posting guidelines please post a
self-contained, minimal program that demonstrates your problem, in your
case including some sample input data, preferable as a _DATA_ section.

jue

Cosmic Cruizer · Feb 8, 2009

It's always a nightmare for me to parse newline characters. No matter
\n, 10, 13, I just don't know why some newlines are printed. As the
file is generated by another program, I cannot see the source code,
and manual inspection does not discover any abnormalty.

using chop, chomp or to use replace cannot help. Does anybody have
experience in handling this?

You might be encountering NUL characters. I run into the NUL character
problem when working with Windows event logs. Try to use the following on
your data: =~ s/\0/\t/g; You can change "\t" to whatever you need.

Example:
# Get event data
my $streaingTest = $Event{Strings};

# Change NUL to tab for event data
$streaingTest =~ s/\0/\t/g;

....Cos

Martijn Lievaart · Feb 9, 2009

Have already tried that before and it doesn't solve...

Funny, did you try hexdumping the files?

M4

Martijn Lievaart · Feb 9, 2009

Have already tried that before and it doesn't solve...

BTW, I did mean both statements and in that order.

M4

Ted Zlatanov · Feb 9, 2009

e> Well I've tried :

e> chop $identiy;
e> chomp ($identiy);
e> chop $identiy;
e> $identity =~ s/\x0d$//;

e> All fail.

Assuming your text is in FILE.TXT:

od -t u1 -t a FILE.txt

What does your text show for the lines that are not correctly processed
by your program?

Ted

Tad J McClellan · Feb 10, 2009

Ted Zlatanov said:
Assuming your text is in FILE.TXT: ^^^
^^^
od -t u1 -t a FILE.txt

s/txt/TXT/;

How to ignore newline in Parse::RecDescent	10	Apr 24, 2010
Reading in cooked mode (was Re: Python MSI not installing, log fileshowing name of a Viatnemese comm	8	Mar 23, 2014
Replace <BR> with newline inside <PRE>	0	Jul 25, 2006
1.9 CSV Parsing Issues	5	Nov 4, 2010
reuse code inquiry	3	Dec 5, 2007
Sendmail, semicolons and new lines	5	May 2, 2010
show hidden value in variable.. with mysql	28	Apr 19, 2006
A Exhibition Of Tech Geekers Incompetence: Emacs whitespace-mode	12	Aug 14, 2009

parse newline

ela

Martijn Lievaart

Jürgen Exner

ela

ela

Jürgen Exner

Cosmic Cruizer

Martijn Lievaart

Martijn Lievaart

Ted Zlatanov

Tad J McClellan

Ask a Question

Similar Threads

Staff online

Members online

Forum statistics

Latest Threads