This is a pretty elementary question, but how should one go about analysis of a large human clinical research data set with multiple rows for a given participant?
Lets say there are columns titled "ParticipantID", "Visit", and "HasBeard" each containing categorical data.
For each ParticipantID (i.e. 001; 002; 003) there are multiple study Visits (i.e. V1; V2; V3 etc) with each Visit containing unique data points on various measures such as "HasBeard".
Thus there are multiple rows for a single Participant ID (i.e. Participant 001, V1; Participant 001, V2; Participant 001, V3 etc.)
SEE ATTACHED.
If I were to do an analysis on a certain column measure such as "HasBeard" that contains categorical data (0 = has no beard, 1 = has beard), and want to know the distribution of Participants that have a beard at V3 but exclude those that already had a beard at V1, what is the most efficient and elegant method?
Given the provided image, I want to only return Participant 002, once.
Do I want to combine a Data Filter and a Local Data Filter? Do I want to exclude certain rows first using the Exclude function?
What is the best way to leverage JMP PRO for this sort of basic work?