cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
JMP is taking Discovery online, April 16 and 18. Register today and join us for interactive sessions featuring popular presentation topics, networking, and discussions with the experts.
Choose Language Hide Translation Bar

which distance does JMP use in clustering?

Hello everybody.

I'm trying to understand which distance does JMP use in hierarchical clustering: the documentation in https://www.jmp.com/support/help/en/15.1/index.shtml#page/jmp/distance-method-formulas.shtml#ww17780... seems to say "squared euclidean distance" but if I save the distance matrix (after clustering) what I get is the euclidean distance (not squared).

And what about the k-means method? It should use the euclidean distance, but I couldn't find the formulas.

Thank you in advance.

3 REPLIES 3

Re: which distance does JMP use in clustering?

Hi @francesco_della ,

 

JMP uses Euclidean Distance for the initial distance matrix calculation between observations and then the method chosen for calculating distances between the clusters. This is true for any of the Clustering methods within JMP. The only exception is if you provide the data as a distance matrix and choose the data is distance matrix option in the dialog for Hierarchical Clustering.

 

For k- Means, the JMP help refers to the SAS FASTCLUS Procedure documentation found here.

 

Hope that helps.

Chris Kirchberg, M.S.2
Data Scientist, Life Sciences - Global Technical Enablement
JMP Statistical Discovery, LLC. - Denver, CO
Tel: +1-919-531-9927 ▪ Mobile: +1-303-378-7419 ▪ E-mail: chris.kirchberg@jmp.com
www.jmp.com

Re: which distance does JMP use in clustering?

thank you so much for your prompt answer, Chris
I was just confused because the formula in the documentation definitely shows the SQUARED euclidean distance
thank you again, regards

Re: which distance does JMP use in clustering?

Hi @francesco_della 

No problem. The squared distance is used in the subsequent between cluster distance calculations in some methods, but the initial distance matrix between the observations is not squared.

Best,

Chris Kirchberg, M.S.2
Data Scientist, Life Sciences - Global Technical Enablement
JMP Statistical Discovery, LLC. - Denver, CO
Tel: +1-919-531-9927 ▪ Mobile: +1-303-378-7419 ▪ E-mail: chris.kirchberg@jmp.com
www.jmp.com