Subscribe Bookmark RSS Feed

How do you use JMP to standardize data?

rebecca_drebin0

Community Member

Joined:

Sep 22, 2015

An assignment requires that we standardize the data for a particular variable (runtime) and see what a current value would become after standardizing the data. Any ideas on how to do this? Thanks!

1 ACCEPTED SOLUTION

Accepted Solutions
Solution

And you can create a virtual column this way, without the need to build a formula directly:

9936_Screen Shot 2015-09-23 at 13.19.23.png

4 REPLIES
ron_horne

Super User

Joined:

Jun 23, 2011

hi Rebecca,

the most trivial way would be to do a distribution of the variable and from the red triangle menu to choose save standardized as in the picture.

this will create a new column in the data table with the standardized values of the variable for each row.

9916_pastedImage_0.png

otherwise you can create a new column and insert the Standardized formula manually.

in some cases (such as the fit model platform) you do not need to standardize the data prior to the analysis since you can request the standardized coefficients  in the results. to do this you right click on the parameter estimates table and ask for the Std beta under the Columns option.

9938_pastedImage_1.png

good luck!

billw_jmp

Staff

Joined:

Jul 2, 2014

Hi Rebecca

To add on to Ron's reply standardizing data can also be classified as centering and scaling the data.  Centering is where you subtract the mean from all values and scaling is dividing the centered data by the standard deviation.  You can build the formula for this by first doing Analyze > Distribution and getting the mean and standard deviation values for your data.  You can then make a new column and create a column formula:

Runtime value - mean value/Stdev value.

Best,

Bill

Solution

And you can create a virtual column this way, without the need to build a formula directly:

9936_Screen Shot 2015-09-23 at 13.19.23.png

markbailey

Staff

Joined:

Jun 23, 2011

In addition to solutions already offered, it sounded like you might want to use historical data to determine how to standardize new values. If so, then you simply compute and store the mean and standard deviation of the historical sample, then use this computation: (new-mean)/(standard deviation) to standardize the new value. The computation could be performed in a column formula or with a script, depending on the situation.

Learn it once, use it forever!