- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
which distance does JMP use in clustering?
Hello everybody.
I'm trying to understand which distance does JMP use in hierarchical clustering: the documentation in https://www.jmp.com/support/help/en/15.1/index.shtml#page/jmp/distance-method-formulas.shtml#ww17780... seems to say "squared euclidean distance" but if I save the distance matrix (after clustering) what I get is the euclidean distance (not squared).
And what about the k-means method? It should use the euclidean distance, but I couldn't find the formulas.
Thank you in advance.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: which distance does JMP use in clustering?
Hi @francesco_della ,
JMP uses Euclidean Distance for the initial distance matrix calculation between observations and then the method chosen for calculating distances between the clusters. This is true for any of the Clustering methods within JMP. The only exception is if you provide the data as a distance matrix and choose the data is distance matrix option in the dialog for Hierarchical Clustering.
For k- Means, the JMP help refers to the SAS FASTCLUS Procedure documentation found here.
Hope that helps.
Data Scientist, Life Sciences - Global Technical Enablement
JMP Statistical Discovery, LLC. - Denver, CO
Tel: +1-919-531-9927 ▪ Mobile: +1-303-378-7419 ▪ E-mail: chris.kirchberg@jmp.com
www.jmp.com
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: which distance does JMP use in clustering?
thank you so much for your prompt answer, Chris
I was just confused because the formula in the documentation definitely shows the SQUARED euclidean distance
thank you again, regards
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: which distance does JMP use in clustering?
No problem. The squared distance is used in the subsequent between cluster distance calculations in some methods, but the initial distance matrix between the observations is not squared.
Best,
Data Scientist, Life Sciences - Global Technical Enablement
JMP Statistical Discovery, LLC. - Denver, CO
Tel: +1-919-531-9927 ▪ Mobile: +1-303-378-7419 ▪ E-mail: chris.kirchberg@jmp.com
www.jmp.com