Turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- JMP User Community
- :
- Discussions
- :
- Discussions
- :
- What distance is saved when I click "save clusters" in the K-means clustering re...

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

What distance is saved when I click "save clusters" in the K-means clustering report

Feb 22, 2016 11:30 AM
(1899 views)

I am wondering what distance is saved for each row when I click "save clusters" in the Kmeans clustering report. I used the K-means method to participate my data table of 10000 rows into 50 clusters. When I clicked "save clusters", I saved two columns. One is the cluster column, which indicate which cluster the row is assigned to; the other one is called "Distance". I am wondering what distance is the "Distance". I found that the distance between each row and the cluster center is much smaller than the "Distance".

- Tags:
- cluster

2 REPLIES 2

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Re: What distance is saved when I click "save clusters" in the K-means clustering report

This is taken from the Multivariate Methods book available in JMP under Help==>Books==>Multivariate Methods

Jim

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Get Direct Link
- Email to a Friend
- Report Inappropriate Content

Re: What distance is saved when I click "save clusters" in the K-means clustering report

Thanks, Jim. It looks like that the distance is calculated as the Euclidean length between two vectors. Then which two vectors are used to calculate the "Distance"? Is it the distance between each row and the center of the cluster of that row? Or is it the distance between each row and the mean of all rows? However, rrom my calculation, the "Distance" is larger than both cases.