Turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- JMP User Community
- :
- Discussions
- :
- How do I bin/normalize data to minimize intra/inter subgroup variance?

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Feb 10, 2015 9:50 AM
(5320 views)

I have output test data with high variance to mean ratio. This data is correlated to several categorical variables. I would like to normalize the output data into small number of subgroups to minimize the variance to mean ratios. I know I can pick the subgroups and JMP will calculate intra-subgroup and inter-subgroup variance. If I pick good subgroup factors to normalize, then most of the variance is inter-group and much less intra-group. Is there a process that will explore a large number of factors and show intra to inter variance, so I can select an optimum subgroup? Thanks.

1 ACCEPTED SOLUTION

Accepted Solutions

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Hi, Cousin Paul!

I hate to start a message apologizing for it's possible ignorance, but without more context, it's kind of a shot in the dark...

Have you explored the Partition platform? It's binary recursive partitioning analyses will certainly "bin" your large-number-of-factors data for you in a manner that minimize the intragroup heterogeneity and maximizes intergroup heterogeneity between the two (and only two!) groups at each split.

This may not be exactly what you want, but it is certainly easily done!

1 REPLY 1

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Hi, Cousin Paul!

I hate to start a message apologizing for it's possible ignorance, but without more context, it's kind of a shot in the dark...

Have you explored the Partition platform? It's binary recursive partitioning analyses will certainly "bin" your large-number-of-factors data for you in a manner that minimize the intragroup heterogeneity and maximizes intergroup heterogeneity between the two (and only two!) groups at each split.

This may not be exactly what you want, but it is certainly easily done!