Ragged array whose rows are of varying size

er · Jul 4, 2008

Hi All,

I have an array

x00,x01,x02,...,x0K0
x10,x11,x12,...,x1K1
..
..
..
xm0,xm1,xm2,...,xmKm

m is *fixed*,
each of K0, K1,...,Km is occasionally changed i.e. each row is
"resized" *occasionally*
each row is modified *often* by the transform algorithm

does std::vector<vector<some type> > seem fine? or is any reason to
prefer
std::vector<std::shared_ptr<vector<some type>> > from an efficiency
standpoint?

Thanks.

Puppet_Sock · Jul 4, 2008

Hi All,

I have an array

x00,x01,x02,...,x0K0
x10,x11,x12,...,x1K1
.
.
.
xm0,xm1,xm2,...,xmKm

m is *fixed*,
each of K0, K1,...,Km is occasionally changed i.e. each row is
"resized" *occasionally*
each row is modified *often* by the transform algorithm

does std::vector<vector<some type> > seem fine? or is any reason to
prefer
std::vector<std::shared_ptr<vector<some type>> > from an efficiency
standpoint?

It is not really possible to tell from "an efficiency
standpoint" which will be superior. It will depend on
many details of such things as object size, how often
they get moved, how often you index in, how difficult
it is to make copies, how your compiler and library
version arrange things, how your platform arranges
memory and handles paging and on and on.

The usual correct answer when efficiency is an issue
(or any resource) is to work up a test case that is as
similar to your production environment and cases as
you can manage. Work it up with both possible ways of
doing things. And get out your stopwatch and do some
tests. If it is other resources, try it both ways and
see how those other resources are used.

If the difference winds up being unimportant, then do
it whichever way makes program complexity easier to
manage.
Socks

er · Jul 4, 2008

Hi All,

I have an array

x00,x01,x02,...,x0K0
x10,x11,x12,...,x1K1
.
.
.
xm0,xm1,xm2,...,xmKm

m is *fixed*,
each of K0, K1,...,Km is occasionally changed i.e. each row is
"resized" *occasionally*
each row is modified *often* by the transform algorithm

does std::vector<vector<some type> > seem fine? or is any reason to
prefer
std::vector<std::shared_ptr<vector<some type>> > from an efficiency
standpoint?

Thanks.

ps:
typical values: m<10
typical values for K0,...,Km: 100, 1000

er · Jul 4, 2008

It is not really possible to tell from "an efficiency
standpoint" which will be superior. It will depend on
many details of such things as object size, how often
they get moved, how often you index in, how difficult
it is to make copies, how your compiler and library
version arrange things, how your platform arranges
memory and handles paging and on and on.

The usual correct answer when efficiency is an issue
(or any resource) is to work up a test case that is as
similar to your production environment and cases as
you can manage. Work it up with both possible ways of
doing things. And get out your stopwatch and do some
tests. If it is other resources, try it both ways and
see how those other resources are used.

If the difference winds up being unimportant, then do
it whichever way makes program complexity easier to
manage.
Socks

sure, i agree with everything you said, and that's probably what i'll
end up doing.
however, i'd be interested to know the sort of things that come into
play in
determining the tradeoff between std::vector<std::vector> vs
std::vector<boost::shared_ptr<vector>>

for example, what happens
- when i write on array[row j]?
- when i resize array[row j]? is there any memory reallocation for the
rows that aren't j. i would think not, but i'm no expert.

ps2: the x's are scalars (say double).

acehreli · Jul 4, 2008

Hi All,

I have an array

x00,x01,x02,...,x0K0
x10,x11,x12,...,x1K1
.
.
.
xm0,xm1,xm2,...,xmKm

m is *fixed*,

Using a vector for the rows should be fine then (the outer vector
below), because the number of rows never change.

each of K0, K1,...,Km is occasionally changed i.e. each row is
"resized" *occasionally*

Using a vector for each row is fine too.

each row is modified *often* by the transform algorithm

vector for rows is still fine.

does std::vector<vector<some type> > seem fine?
Yes.

or is any reason to
prefer
std::vector<std::shared_ptr<vector<some type>> > from an efficiency
standpoint?

The one with shared_ptr could be unnoticably slower because of the
extra indirection through the shared_ptr. The vector<vector> uses
indirection anyway: sizeof(vector<some_type>) should be constant
regardless of the size of the vector.

If anything, a vector of vector of shared_ptr could make a difference

vector<vector<shared_ptr<some_type> > >

if some_type is very expensive to copy or noncopyable. In that case
you may want to consider boost:

tr_vector as well:

vector<ptr_vector<some_type> >

Ali

er · Jul 4, 2008

Using a vector for the rows should be fine then (the outer vector
below), because the number of rows never change.

Using a vector for each row is fine too.

vector for rows is still fine.

The one with shared_ptr could be unnoticably slower because of the
extra indirection through the shared_ptr. The vector<vector> uses
indirection anyway: sizeof(vector<some_type>) should be constant
regardless of the size of the vector.

If anything, a vector of vector of shared_ptr could make a difference

vector<vector<shared_ptr<some_type> > >

if some_type is very expensive to copy or noncopyable. In that case
you may want to consider boost:tr_vector as well:

vector<ptr_vector<some_type> >

Ali

excellent! thanks!

Jerry Coffin · Jul 4, 2008

Hi All,

I have an array

x00,x01,x02,...,x0K0
x10,x11,x12,...,x1K1
.
.
.
xm0,xm1,xm2,...,xmKm

m is *fixed*,
each of K0, K1,...,Km is occasionally changed i.e. each row is
"resized" *occasionally*
each row is modified *often* by the transform algorithm

does std::vector<vector<some type> > seem fine? or is any reason to
prefer
std::vector<std::shared_ptr<vector<some type>> > from an efficiency
standpoint?

Since you're not changing the number of rows, there's no particularly
good reason to use a column of pointers -- you'd do that to avoid
copying entire rows when the row vector is resized.

er · Jul 5, 2008

Since you're not changing the number of rows, there's no particularly
good reason to use a column of pointers -- you'd do that to avoid
copying entire rows when the row vector is resized.

--
Later,
Jerry.

The universe is a figment of its own imagination.

Absolutely, just wanted it confirmed, just in case. thanks!

Roland Pibinger · Jul 5, 2008

Using a vector for the rows should be fine then (the outer vector
below), because the number of rows never change.

.... if you use reserve() before populating the vector to avoid the
unnecessary copying of elements.

The one with shared_ptr could be unnoticably slower because of the
extra indirection through the shared_ptr. The vector<vector> uses
indirection anyway: sizeof(vector<some_type>) should be constant
regardless of the size of the vector.

shared_ptr uses one dynamic allocation per element. This will slow
down the applicaton, not the 'extra indirection'.

comp.lang.vhdl FAQ part 1 of 4: general	0	Jul 8, 2003
comp.lang.vhdl FAQ part 3 of 4: products & services	0	Jul 8, 2003
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	15	Apr 1, 2006
comp.lang.c Answers to Frequently Asked Questions (FAQ List)	1	Feb 1, 2004

Ragged array whose rows are of varying size

er

Puppet_Sock

er

er

acehreli

er

Jerry Coffin

er

Roland Pibinger

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads