cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
Choose Language Hide Translation Bar
SKHDS
Level I

Duplicate Row Selection Including the Original Row

Hello. I have a data table that contains 960K rows and 49 columns. There are duplicate rows (from 2-4 rows) based on one of the 49 columns. I've used the Row Selection - Select Duplicate Rows function after selecting this one column that contains the duplicate values. When the results came back, the duplicate rows became highlighted but the original row above the duplicate row/s is not. Is there a way for JMP to select ALL rows containing the duplicated values including the original row with the identical value? I don't have knowledge of writing a script but I would think the built-in functions in JMP would allow me to do this row selection. I'm using JMP16 on Windows 10. If someone knows the solution or a trick to do this, that 'll be greatly appreciated.

2 REPLIES 2
txnelson
Super User

Re: Duplicate Row Selection Including the Original Row

The reason the Select Duplicate Rows function does not select all of the matching rows, is because it is assumed that you want to retain one of the matching rows. So when the Select Duplicate Rows function returns with the selected rows, all you have to do is to right click on one of the selected rows, and select, Delete Rows. You will then be left with only one copy of the duplicated rows.
What is your need to see all matching rows?
You can select the cells you want matched for a given row, and then right click and choose, Select Matching Cells, and you will see all of the rows where exact matches are found.
Jim

Re: Duplicate Row Selection Including the Original Row

Hi,

 

An easy way to do this interactively is with the Summary platform, in the tables menu. Place the column(s) of interest in the Group role, then hit OK.

 

A summary table, linked to the original table, is created. Selecting any row in the summary table selects the source rows in the original table from which the summary row derives.

 

In your case, you likely want to select rows with an entry other than "1" in the N Rows column. If you'd like to select all of these at once:

- right-click on a "1" in the N Row column and select "Select matching rows" from the context-sensitive menu

- now, right-click on any colored field to the left of the first column (that is, on any selected row) and select "Invert Selection". This selects all rows NOT containing a 1 in the N Rows column... which is all rows where duplicates exist in the main table. Since the tables are linked, all corresponding rows in the main table are now selected.

 

Cheers,

Brady