Introduction
JMP 12 introduces a brand new platform for Multiple Correspondence Analysis. This technique helps understand and interpret associations among categorical variables. It is especially helpful when the variables have many levels. Usually the raw data are the frequencies of the occurrence of each combination of the responses and predictor levels. The technique is not restricted to frequencies, though. It may also be used with ratings using the doubling technique and with continuous responses using a combination of ranking and doubling. (See Greenacre 2007). In both of the previous cases, it is only a matter of simple data transformation before using the MCA platform. This script prepares ratings or continuous response data in a contingency-style layout for the MCA platform.
First Example: Ratings
The first example is based on the ratings of college basketball teams by various news sources in the Basketball data table from the Sample Data folder.
Note: the MCA platform in JMP requires that the response columns be stacked first. This script accepts the data in the format above, applies the data transformation, and then stacks the data automatically before launching the MCA platform.
Using the Script
Follow these simple steps.
- Open your multivariate data table in the format shown above.
- Open and run the script.
- Select the response data columns and click Response. (CSN through Sports Illustrated in this case)
- Select the predictor data column and click Predictor. (School in this case)
- Enter the collective name for the responses. (News Source in this case)
- Enter the name for the data values. (Rating in this case)
- Select Rating or Ranking for the Response Type in this case.
- Click OK.
The responses are individually ranked and then doubled to produce a negative pole and a positive pole for each response.
These new doublings are then stacked for the MCA.
Finally, the MCA platform is launched with the doubles and the predictor for interpretation.
The various news sources define the space and their doubles appear in opposite directions from the origin of the plot. Note that about 43% of the inertia in the original ten dimensions is represented in the first two dimensions shown here. It might be worthwhile to pursue more insight using the 3D Correspondence Analysis feature of the MCA platform.
Second Example: Continuous Responses
The second example is based on European Union economic indicators. (See attachment below.)
Follow the same steps as outline above but use the columns in the EU Indicators data table as shown below.
Be sure to select Continuous as the Response Type before clicking OK.
In this example, about 80% of the inertia is displayed by the first two dimensions.
Acknowledgement
Special thanks to Jianfeng Ding, JMP Development, for help with this script.
Reference
Greenacre, Michael (2007) Correspondence Analysis in Practice, Second Edition, Boca Raton: Chapman & Hall/CRC, pp. 182-184.