Please ignore my previous post: Why my Java code is THAT slow than C++?

Kevin · Feb 11, 2006

(I deleted that post online, but in case someone already got it via
email)

I found out that it is not the problem of my code.

The problem is the data file: for that particular task and for that
particular data file, the data file is in a format that favor that c++
code (so that it basically can skip the hash step), while my code is a
more generalized version. If I take special care of that particular
data format, my java code can get good speed too.

Sorry for the bother.

Jeffrey Schwab · Feb 11, 2006

Kevin said:
(I deleted that post online, but in case someone already got it via
email)

I found out that it is not the problem of my code.

The problem is the data file: for that particular task and for that
particular data file, the data file is in a format that favor that c++
code (so that it basically can skip the hash step), while my code is a
more generalized version. If I take special care of that particular
data format, my java code can get good speed too.

What are the new performance numbers?

Kevin · Feb 11, 2006

I just did a test, and my java code now runs as fast as the c++ code
(93 seconds, time including all, I basically use a "stopwatch" for it
beacuse it is what I need -- 1 or 2 seconds of miscount is possible).

Just in case other people may be interested in it, below, I briefly
state how I do the fast file read (for plain ascii in my case):
1) read in the file using InputStream, each time read in 32K data into
a byte[] buffer.
2) write my own "readLine()" method, which scan in the byte[] buffer,
and return a new byte[] as a line.
3) write my own "split(char c)" method, which break one byte[] into
many byte[].
If we want to hash this byte[], then write a string class around it to
provide the hash and other functions, etc. Try not convet them to
java's String, which will be slow.

Thanks all on this group.

William Brogden · Feb 12, 2006

I just did a test, and my java code now runs as fast as the c++ code
(93 seconds, time including all, I basically use a "stopwatch" for it
beacuse it is what I need -- 1 or 2 seconds of miscount is possible).

Just in case other people may be interested in it, below, I briefly
state how I do the fast file read (for plain ascii in my case):
1) read in the file using InputStream, each time read in 32K data into
a byte[] buffer.
2) write my own "readLine()" method, which scan in the byte[] buffer,
and return a new byte[] as a line.

Why a new byte[] when all you need is a start index and count? (Of course
that depends on keeping the initial buffer around.)

3) write my own "split(char c)" method, which break one byte[] into
many byte[].

See above question - the object holding index and count could calculate
a hashcode when it is created.

If we want to hash this byte[], then write a string class around it to
provide the hash and other functions, etc. Try not convet them to
java's String, which will be slow.

Very true!

Whenever I compile my C basic code, the a.exe file is seen as virus by McAfee and then that file is quarantined or removed	0	Aug 23, 2022
Why my java code is THAT slow compared too C++?	16	Feb 11, 2006
React native post-request is not working	1	May 27, 2023
I need help in understanding these files on my phone, Could someone help me understand these files? Urgent help needed. Please help.	1	Jun 4, 2023
Help with my responsive home page	2	Dec 14, 2022
Connected SQLite to my java program but information are not submitted	2	Aug 2, 2022
My first attempt at java code fails. OK, why please?	6	Jun 12, 2010
How can I view / open / render / display a pdf file with c code?	0	Sep 23, 2023

Please ignore my previous post: Why my Java code is THAT slow than C++?

Kevin

Jeffrey Schwab

Kevin

William Brogden

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads