cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
  • Register to attend Discovery Summit 2025 Online: Early Users Edition, Sept. 24-25.
  • New JMP features coming to desktops everywhere this September. Sign up to learn more at jmp.com/launch.
Choose Language Hide Translation Bar
Zappy
Level III

Partition On Proportion(percentage) Responses

Dear JMP Community,

 

I have tried to find similar post but so far none is addressing the question I have.

I attached here also dummy data.

 

I am doing data mining to profile and quantify the factors vs responses effect.

To put it simple, I have 2 factors (categorical with 2 levels) and 1 response (yield loss) which is a percentage.

We record each day's yield loss vs what 2 factors level we are using on that day.

For each day, our quantity produce is different.

 

The modeling technique I'm using is Partition.

 

Partition directly calculate Mean percentage as show in the output below.

As each day the quantity we produce is different, it is not accurate that Partition will simply calculate the Mean of the Yield Loss%.

Picture1.png

My question is:

1. How do we calculate the Mean weighted percentage in this case using Partition? I'm seeing an option in the Partition Platform for "Weight", but I'm not sure how to utilize that.

2. Is there a feature in Partition we could use Median instead of Mean? (since the yield loss distribution are generally skewed).

 

Thanks for your advice,

Zappy

 

1 REPLY 1
Victor_G
Super User

Re: Partition On Proportion(percentage) Responses

Hi @Zappy,

You could maybe add a column with the number of batches/quantity per day, and use this information as Frequency information when launching the Partition platform : https://www.jmp.com/support/help/en/18.2/#page/jmp/launch-the-partition-platform.shtml#ww1276002

The Decision Tree do not create splits directly based on mean values, see https://www.jmp.com/support/help/en/18.2/#page/jmp/statistical-details-for-the-splitting-criterion.s... for more info. So I think you could still get valuable insights using the platform directly.
If not, you could maybe try transforming your data, maybe by binning your raw data into classes of values first or log-transform your raw data (if the distribution is approximately log-normal), before partitioning with Decision Tree ?

Hope this answer will help you,

Victor GUILLER

"It is not unusual for a well-designed experiment to analyze itself" (Box, Hunter and Hunter)

Recommended Articles