Jim, thanks for your suggestion.
But your proposal does not have a way to force select according for the given distribution. So it wouldn't produce what I am looking for. To validate, I did follow the steps and it produced a dist very similar to the original one.
Maybe I am not explaning in a clear way.
I have 400k+ rows X N columns of data (huge matrix) . If I call one of N params, as "X" for argument sake.
X's distribution from this data - call it X.dist1.
I have a second dataset which has Xagain, and this time w/ X.dist2
X.dist1 is not too far but is different than X.dist2.
Given the fact that I have plenty of data in the first dataset (400k+), I am hence trying to sub-select data set from 1 so that selected data X distribution is the closest I can get to X.dist2.
Hence I was looking for some capabiltiy under Table - Subset where I can specify normal distribution characteristic OR using a formula and then using Table- subset.
for the formula, I put densities of dataset1 against my desired X.dist2.
But now I need a uniform random way of sampling from the formulae outputs so to force the outcome distribution to be ~ X.dist2
OR any any other way ...
If I am to use a Table-Subset - the solution needs to specify a normal distribution from a parameter to pull from.
Or otherwise, I need to find a uniform random selection from a parameter (formula approach).