Difference in int cast to char between Windows and redhat linux 9 under JDK 1.4.2_06

Private · Dec 10, 2004

Difference in int cast to char between Windows and redhat linux 9 under JDK
1.4.2_06

With regards to the following; how can I control the charset used for the
cast ? The desired outcome would be for 0x80 to cast to '\u0080'. At
the least I need to be able to run consistently across the two platforms.

Thanks to all for any help on this.

<java code CharTester fragment>

FileInputStream fis=new FileInputStream(args[0]);

for (int i=fis.read();i>-1;i=fis.read())
System.out.print((char)i);

System.out.flush();
fis.close();

</fragment>

<test run under linux>

# hexdump -C < print.doc
00000000 7e 7f 80 81 fc fd ff |~......|
00000007

# java com.asl.hacks.CharTester print.doc
# java com.asl.hacks.CharTester print.doc | hexdump -C
00000000 7e 7f c2 80 c2 81 c3 bc c3 bd c3 bf |~...........|
0000000c

</test>

under windows the same command produces

7e 7f 3f 3f fc fd ff

Alex Kizub · Dec 11, 2004

under windows the same command produces

7e 7f 3f 3f fc fd ff

You use none ASCII characters. Read more about InputStream and Readers and what
is the difference betwen them.
Also check what is the locale on your Windows and Linux systems. I bet they are
different.

Alex Kizub.

Michael Borgwardt · Dec 11, 2004

Private said:
Difference in int cast to char between Windows and redhat linux 9 under JDK
1.4.2_06
No.

With regards to the following; how can I control the charset used for the
cast ?

There is no charset involved in such a cast, both are integer types.

The desired outcome would be for 0x80 to cast to '\u0080'.

And that is what will happen. Always.

<java code CharTester fragment>

FileInputStream fis=new FileInputStream(args[0]);

for (int i=fis.read();i>-1;i=fis.read())
System.out.print((char)i);

Ah, well this is a different thing. You are reading the byte values and
interpreting them as unicode code points. That's equivalent to using
the ISO-8859-1 charset.

But the platform dependence is not in the casting from int, it's in the
call of print(), which uses the platform default encoding to convert the
character back to bytes, which then appear on the program's standard
output.

What is the program supposed to do, anyway?

retriving escape unicode sequences from files ...	1	Aug 4, 2012
retriving escape unicode sequences from files ...	1	Aug 4, 2012
geting error as unxpected symbol read: ". in array initialization	0	Mar 27, 2016
corrupt zip files	10	May 6, 2012
Why file containing 256 bytes is 257 bytes long?	12	Sep 14, 2005
Can't get to_integer to work	6	Sep 25, 2003

Difference in int cast to char between Windows and redhat linux 9 under JDK 1.4.2_06

Private

Alex Kizub

Michael Borgwardt

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads