The World Statistics Day celebration continues here in the Community. We all need reliable data for sound decision making. Do you have a data source that you trust most? Head over to Discussions to tell us about it.
Choose Language Hide Translation Bar
Highlighted
jasongao
Level II

got duplicate items after using split

Got a table attached here (test.jmp). After I used split function like below, I got duplicate items and the N number is also not correct. Anybody knows what is wrong? Thanks a lot! 

Table: 

Cap.JPG

 

Split setting:

Capture.JPG

 

Result I got, which contains duplicate items:

Capture1.JPG

 

 

1 ACCEPTED SOLUTION

Accepted Solutions
Highlighted
Jeff_Perkinson
Community Manager Community Manager

Re: got duplicate items after using split

You need to put supplier_wafer_id into the Group role.

 

That will create one row per supplier_wafer_id.

 

2019-11-18_17-59-45.171.png

-Jeff

View solution in original post

5 REPLIES 5
Highlighted
dale_lehman
Level VI

Re: got duplicate items after using split

That's not what I get.  First of all, I would not select all 3 columns to keep - the one you are splitting and the one you split by will already be used - just select "Keep all."  When I do that I get 9 rows and the ID column shows no duplicates.  But another strange thing is that the ID column contains a string of around 13 characters - in your image it looks like you have a single letter.  So, I'm not sure what is going on since it looks like you get 9 rows like I do.  But I would not select all 3 columns to keep when two of them are being used in the dialog to split and split by.

Highlighted
jasongao
Level II

Re: got duplicate items after using split

Thanks. I think I uploaded the wrong file, but it basically the same. If you look carefully, it has duplicate items for 13 characters item too.

I re-uploaded the file. I tried keep all, it is the same.

Highlighted
Jeff_Perkinson
Community Manager Community Manager

Re: got duplicate items after using split

You need to put supplier_wafer_id into the Group role.

 

That will create one row per supplier_wafer_id.

 

2019-11-18_17-59-45.171.png

-Jeff

View solution in original post

Highlighted
dale_lehman
Level VI

Re: got duplicate items after using split

Indeed, that works. And Tabulate does the same thing. Personally I like tabulate better than splitting and stacking tables.
Highlighted
dale_lehman
Level VI

Re: got duplicate items after using split

If you look at the distribution of defect_class and select a couple and subset them, you will see that the duplicate ID rows show up for some defect classes and not others.  I'm not entirely sure what is happening, but I think what you might want to use is Analyze - Tabulate and put defect_class in the columns and ID in the rows and then put the N_rows int the table portion (selecting either mean, N, max, or some other statistic, as appropriate).

Article Labels

    There are no labels assigned to this post.