cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
The Discovery Summit 2025 Call for Content is open! Submit an abstract today to present at our premier analytics conference.
See how to use JMP Live to centralize and share reports within groups. Webinar with Q&A April 4, 2pm ET.
Choose Language Hide Translation Bar
View Original Published Thread

which distance does JMP use in clustering?

francesco_della
Level II

Hello everybody.

I'm trying to understand which distance does JMP use in hierarchical clustering: the documentation in https://www.jmp.com/support/help/en/15.1/index.shtml#page/jmp/distance-method-formulas.shtml#ww17780... seems to say "squared euclidean distance" but if I save the distance matrix (after clustering) what I get is the euclidean distance (not squared).

And what about the k-means method? It should use the euclidean distance, but I couldn't find the formulas.

Thank you in advance.

3 REPLIES 3


Re: which distance does JMP use in clustering?

Hi @francesco_della ,

 

JMP uses Euclidean Distance for the initial distance matrix calculation between observations and then the method chosen for calculating distances between the clusters. This is true for any of the Clustering methods within JMP. The only exception is if you provide the data as a distance matrix and choose the data is distance matrix option in the dialog for Hierarchical Clustering.

 

For k- Means, the JMP help refers to the SAS FASTCLUS Procedure documentation found here.

 

Hope that helps.

Chris Kirchberg, M.S.2
Data Scientist, Life Sciences - Global Technical Enablement
JMP Statistical Discovery, LLC. - Denver, CO
Tel: +1-919-531-9927 ▪ Mobile: +1-303-378-7419 ▪ E-mail: chris.kirchberg@jmp.com
www.jmp.com


Re: which distance does JMP use in clustering?

thank you so much for your prompt answer, Chris
I was just confused because the formula in the documentation definitely shows the SQUARED euclidean distance
thank you again, regards


Re: which distance does JMP use in clustering?

Hi @francesco_della 

No problem. The squared distance is used in the subsequent between cluster distance calculations in some methods, but the initial distance matrix between the observations is not squared.

Best,

Chris Kirchberg, M.S.2
Data Scientist, Life Sciences - Global Technical Enablement
JMP Statistical Discovery, LLC. - Denver, CO
Tel: +1-919-531-9927 ▪ Mobile: +1-303-378-7419 ▪ E-mail: chris.kirchberg@jmp.com
www.jmp.com