S
shuchalle
Hello,
I am writing a program in Java. I have following requirements.
We have large data set points whose value will range from 100 to 1500.
We need to select 10% of dataset points randomly. So if there were
40000 data points - we need to select 4000 points on random basis.
Now you say - well that's easy. Well - here is the twist.
We need to "skew" the randomness so that more points are selected
towards higher number as in near to 1500 and less points are selected
toward lower end of spectrum that is 100. But all in all -still 10% (or
4000 out of 40000 dataset points) of total points out of data points
should be selected.
We can use some sort of "logarithmic skewage" - if there is such a
word.
Any clever ideas or hints would be much appreciated!
Regards,
AZXML
I am writing a program in Java. I have following requirements.
We have large data set points whose value will range from 100 to 1500.
We need to select 10% of dataset points randomly. So if there were
40000 data points - we need to select 4000 points on random basis.
Now you say - well that's easy. Well - here is the twist.
We need to "skew" the randomness so that more points are selected
towards higher number as in near to 1500 and less points are selected
toward lower end of spectrum that is 100. But all in all -still 10% (or
4000 out of 40000 dataset points) of total points out of data points
should be selected.
We can use some sort of "logarithmic skewage" - if there is such a
word.
Any clever ideas or hints would be much appreciated!
Regards,
AZXML