Better data structure?

Marcus Kwok · Oct 10, 2005

I am in the process of converting some legacy code (written in C) to
C++.

The original data structure is used to hold a table of data (Gaussian
distribution, some of you call it a "normal distribution"). There are
two values of relevance: the "z" value (telling how many standard
deviations away from the mean), and the "phi" value (cumulative
probability).

The data we are reading lists "z" values from -4.00 to +4.00, in 0.02
increments. As a result, the old data structure was an array declared
as such:

double probs[401][2];

where probs[0] is the "z" value and probs[1] is the "phi" value.

Later, we want to look up values in the table, and interpolate if the
exact values are not there, as follows:

double z, phi, tmp, ratio;
int indexL, indexH;

/* z is calculated here */

if (z <= -4.0)
phi = probs[0][1];
else if (z >= 4.0)
phi = probs[400][1];
else {
tmp = (z + 4.0) / 0.02;
indexL = floor(tmp);
indexH = ceil(tmp);
if (indexL == indexH)
phi = probs[indexL][1];
else {
ratio = (z - probs[indexL][0]) / 0.02;
phi = probs[indexL][1] + ratio * (probs[indexH][1] - probs[indexL][1]);
}
}

Pretty ugly! So, I am trying to use better data structures, but it
doesn't seem to be a huge improvement:

std::vector< std:air<double, double> > probs;

// z is calculated here

if (z <= -4.0) {
phi = probs.front().second;
}
else if (z >= 4.0) {
phi = probs.back().second;
}
else {
double tmp = (z + 4.0) / 0.02;
int indexL = static_cast<int>(std::floor(tmp));
int indexH = static_cast<int>(std::ceil(tmp));
if (indexL == indexH) {
phi = probs[indexL].second;
}
else {
double ratio = (z - probs[indexL].first) / 0.02;
phi = probs[indexL].second + ratio * (probs[indexH].second - probs[indexL].second);
}
}

Is there a better way to do this, maybe involving a std::map<double, double>?
However, my issue with the std::map<> is the floating-point inaccuracies
when comparing the keys to see if the exact key is in the table.

Jay Nabonne · Oct 10, 2005

double probs[401][2];

struct Prob
{
double z;
double phi;
};

Prob probs[401];

exact values are not there, as follows:

double z, phi, tmp, ratio;
int indexL, indexH;

/* z is calculated here */

if (z <= -4.0)
phi = probs[0][1];

phi = probs[0].phi;

else if (z >= 4.0)
phi = probs[400][1];

phi = probs[400].phi;

else {
tmp = (z + 4.0) / 0.02;
indexL = floor(tmp);
indexH = ceil(tmp);
if (indexL == indexH)
phi = probs[indexL][1];

phi = probs[indexL].phi;

else {
ratio = (z - probs[indexL][0]) / 0.02;

ratio = (z - probs[indexL].z) / 0.02;

phi = probs[indexL][1] + ratio * (probs[indexH][1] - probs[indexL][1]);

phi = probs[indexL].phi + ratio * (probs[indexH].phi - probs[indexL].phi);

}
}
}
}

- Jay

Marcus Kwok · Oct 10, 2005

Jay Nabonne said:
double probs[401][2];

Click to expand...

struct Prob
{
double z;
double phi;
};

Prob probs[401];

exact values are not there, as follows:

double z, phi, tmp, ratio;
int indexL, indexH;

/* z is calculated here */

if (z <= -4.0)
phi = probs[0][1];

Click to expand...

phi = probs[0].phi;

else if (z >= 4.0)
phi = probs[400][1];

Click to expand...

phi = probs[400].phi;

else {
tmp = (z + 4.0) / 0.02;
indexL = floor(tmp);
indexH = ceil(tmp);
if (indexL == indexH)
phi = probs[indexL][1];

Click to expand...

phi = probs[indexL].phi;

else {
ratio = (z - probs[indexL][0]) / 0.02;

Click to expand...

ratio = (z - probs[indexL].z) / 0.02;

phi = probs[indexL][1] + ratio * (probs[indexH][1] - probs[indexL][1]);

Click to expand...

phi = probs[indexL].phi + ratio * (probs[indexH].phi - probs[indexL].phi);

}
}
}
}

Click to expand...

Thanks. However, I was trying to avoid basic arrays and instead use a
std::vector<>, though maybe it will be clearer to create a struct (as
you did) instead of using a std:

air<>.

The main thing I am trying to do is clean up the calculation of the
index, and seeing if the exact value is in the table.

Karl Heinz Buchegger · Oct 11, 2005

Marcus said:
Jay Nabonne said:

double probs[401][2];

Click to expand...

struct Prob
{
double z;
double phi;
};

Prob probs[401];

exact values are not there, as follows:

double z, phi, tmp, ratio;
int indexL, indexH;

/* z is calculated here */

if (z <= -4.0)
phi = probs[0][1];

Click to expand...

phi = probs[0].phi;

else if (z >= 4.0)
phi = probs[400][1];

Click to expand...

phi = probs[400].phi;

else {
tmp = (z + 4.0) / 0.02;
indexL = floor(tmp);
indexH = ceil(tmp);
if (indexL == indexH)
phi = probs[indexL][1];

Click to expand...

phi = probs[indexL].phi;

else {
ratio = (z - probs[indexL][0]) / 0.02;

Click to expand...

ratio = (z - probs[indexL].z) / 0.02;

phi = probs[indexL][1] + ratio * (probs[indexH][1] - probs[indexL][1]);

Click to expand...

phi = probs[indexL].phi + ratio * (probs[indexH].phi - probs[indexL].phi);

}
}
}
}

Click to expand...

Click to expand...

Thanks. However, I was trying to avoid basic arrays and instead use a
std::vector<>, though maybe it will be clearer to create a struct (as
you did) instead of using a std:air<>.

The main thing I am trying to do is clean up the calculation of the
index, and seeing if the exact value is in the table.

I don't think there is much you can do. It already is short, is easy to understand.
I really don't see a need to clean it up.
The only thing I would do is: I would check if it ever happens that IndexL equals
indexH. If it happens, how often does it happen? Due to round-off errors I don't think
it is likely that the calculation of ( z + 4.0 ) / 0.02 equals a whole number when
using double arithmetic. So there is no point in having an 'if' that is almost never
taken. But I may be wrong, only a test could show that.

makc.the.great · Oct 11, 2005

Marcus said:
I am in the process of converting some legacy code (written in C) to
C++.

The original data structure is used to hold a table of data [is] Pretty ugly!
So, I am trying to use better data structures...

In a process of converting legacy code it is nod a good idea to change
anything. There's russian saying, the "better" is an enemy of the
"good".

Marcus Kwok · Oct 11, 2005

Marcus said:
I am in the process of converting some legacy code (written in C) to
C++.

The original data structure is used to hold a table of data [is] Pretty ugly!
So, I am trying to use better data structures...

Click to expand...

In a process of converting legacy code it is nod a good idea to change
anything. There's russian saying, the "better" is an enemy of the
"good".

I am reimplementing the entire program (it's not too terribly big) so I
have full control over the entire code base, so it's not too big of a
deal. Converting it to C++ has allowed me to not worry as much about
resource management (since I use RAII a lot), leaving me more time to
spot inefficiencies in the algorithms we're using.

Marcus Kwok · Oct 11, 2005

Jay Nabonne said:
On Mon, 10 Oct 2005 20:02:10 +0000, Marcus Kwok wrote:

double probs[401][2];

struct Prob
{
double z;
double phi;
};

Prob probs[401];

exact values are not there, as follows:

double z, phi, tmp, ratio;
int indexL, indexH;

/* z is calculated here */

if (z <= -4.0)
phi = probs[0][1];

phi = probs[0].phi;

else if (z >= 4.0)
phi = probs[400][1];

phi = probs[400].phi;

else {
tmp = (z + 4.0) / 0.02;
indexL = floor(tmp);
indexH = ceil(tmp);
if (indexL == indexH)
phi = probs[indexL][1];

phi = probs[indexL].phi;

else {
ratio = (z - probs[indexL][0]) / 0.02;

ratio = (z - probs[indexL].z) / 0.02;

phi = probs[indexL][1] + ratio * (probs[indexH][1] - probs[indexL][1]);

phi = probs[indexL].phi + ratio * (probs[indexH].phi - probs[indexL].phi);

}
}
}
}

Click to expand...

Click to expand...

Marcus said:

Thanks. However, I was trying to avoid basic arrays and instead use a
std::vector<>, though maybe it will be clearer to create a struct (as
you did) instead of using a std:air<>.

The main thing I am trying to do is clean up the calculation of the
index, and seeing if the exact value is in the table.

Click to expand...

Karl Heinz Buchegger said:
I don't think there is much you can do. It already is short, is easy
to understand. I really don't see a need to clean it up. The only
thing I would do is: I would check if it ever happens that IndexL
equals indexH. If it happens, how often does it happen? Due to
round-off errors I don't think it is likely that the calculation of (
z + 4.0 ) / 0.02 equals a whole number when using double arithmetic.
So there is no point in having an 'if' that is almost never taken. But
I may be wrong, only a test could show that.

Thanks for your response. I may go back and look at it later, but a
separate, more urgent issue (completely unrelated to this) has come up
in a different area of the code. Since it works now, I may leave it as
is.

Drawing missing in bitmap in a pure C win32 program	4	Jun 3, 2023
A Better Container Choice?	3	Aug 22, 2013
Programming math challenge gives wrong answer	2	Aug 6, 2023
Collect Excel Data from Website	5	Apr 30, 2022
Weird Behavior with Rays in C and OpenGL	4	Feb 13, 2024
Need Help: Program to Accept 2 Matrices and Show their Sum	0	Aug 21, 2022
Data Structure	3	Mar 9, 2014
SENTINEL CONTROL LOOP WHEN DEALING WITH TWO ARRAYS	1	Oct 26, 2023

Better data structure?

Marcus Kwok

Jay Nabonne

Marcus Kwok

Karl Heinz Buchegger

makc.the.great

Marcus Kwok

Marcus Kwok

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads