*Update
Sorry I should be more clear on the question. The data i currently have looks like:
id email
1 a
1 b
2 a
2 c
3 d
We can see that id 1 2 and email a b c are actually the same person because the info overlaps.
What I'm trying to do here is to come up with a match table with a new key which i can use to match back to the original table to identify those overlap ones are actually the same person:
newID id/email
111 1
111 2
111 a
111 b
111 c
222 d
The newID doesn't need to be in a specific form, it can even equal to the original id, as long as we can use it to identify a "unique person".
-------------------------------------
Hi, I have data with 2 columns, one is individual id, and one is email. The problem with this data is that the relationship between individual id and email is multiple to multiple, meaning one individual id could has multiple emails, and one email could have multiple individual ids. Thus i’d need to create a mapping table of a new unique id to all individualID/email, which specifies that those individual ids or emails if any overlap are actually the same person. I couldn’t really think of a way to do it without script. Could anyone help? Thank you!