cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
The Discovery Summit 2025 Call for Content is open! Submit an abstract today to present at our premier analytics conference.
Choose Language Hide Translation Bar
View Original Published Thread

Questions Regarding Normality Test and Advanced Statistical Analysis

AndyMFMD
Level I

Dear JMP Community,

Greetings, I am a student who has recently begun studying statistics. I have a strong interest in applying the knowledge I've learned in a research project. I apologize in advance if my questions seem basic or inappropriate. Currently, I am conducting research by distributing questionnaires to 406 diabetes patients. The questionnaire consists of gender data (categorical), age data (continuous), and 6 variables with Likert scales, where each variable has its own domain. After inputting the data into MS Excel, I have calculated the total answer scores (continuous), the average value of respondents' answers (continuous), and also performed categorization from variables (categorical). Then, I imported this data into JMP Pro 18.

In the initial stage of analysis, I want to perform a normality test. Regarding this, I have several questions:

1. How
 do I perform a normality test on the 6 variables and their domains? Do I need to conduct normality tests for all variables and their domains simultaneously or separately? Or do you have any suggestions that I could follow given my current situation where I'm experiencing difficulties with the normality test?

2. What should I do if the data is normally distributed or not normally distributed? This is important because the next stages I plan to conduct are Confirmatory Factor Analysis (CFA) and then Structural Equation Modeling (SEM).I greatly appreciate any input and advice you can provide to help me understand the appropriate steps in analyzing this data.

Thank you for your attention and assistance. Sincerely,
Andy

1 ACCEPTED SOLUTION

Accepted Solutions


Re: Questions Regarding Normality Test and Advanced Statistical Analysis

Great questions, Andy!

 

A key assumption underlying maximum likelihood estimation in SEM is that the observed variables come from a multivariate normal distribution. There are a few ways to check this assumption, and usually, visual checks (rather than formal tests of normality) are sufficient. Prior to conducting CFA or SEM, I’d suggest the following checks:

 

Assess univariate normality with Analyze > Distribution. This platform displays a histogram for each variable. Visually checking histograms is usually sufficient for SEM, but if you’d like, you can also look at QQ plots and conduct formal tests that assess whether each variable comes from a Normal distribution.

  • In the Distribution platform, click the red triangle menu for a given variable, then select Continuous Fit > Fit Normal.
  • Then, click the red triangle menu for the Fitted Normal Distribution. To view a QQ plot, select Diagnostic Plots > QQ Plot, and to conduct a formal test of normality (the Shapiro-Wilk test), click Goodness of Fit:

haleyyaremych_0-1738338712312.png

  • In the QQ plot, if the observed data closely follow the straight line, that’s good evidence for normality.
  • The null hypothesis of the Shapiro-Wilk test is that the observed data come from a Normal distribution. A small p-value means we should reject the null hypothesis; therefore, a nonsignificant p-value is good evidence for normality.

haleyyaremych_1-1738338712313.png

 

To get a sense of the pairwise relations among your variables, look at bivariate scatterplots with Analyze > Multivariate Methods > Multivariate.

 

As an additional check, you might also look for multivariate outliers. This can be done from within the SEM platform in JMP Pro. After launching the SEM platform, click on the topmost red triangle menu, and select Launch Explore Outliers:

haleyyaremych_2-1738338712314.png

 

If these checks indicate that your data are not normally distributed, or are otherwise not well-behaved, you have a few options in the SEM platform. After fitting your model(s), again click on the topmost red triangle menu. Under Inference, there are two options: Robust Inference and Bootstrap Inference. Robust Inference will recompute standard errors (SEs) and model fit statistics using the sandwich correction. This correction results in SEs and fit statistics that are robust to nonnormality. Bootstrap Inference will use bootstrapping to estimate SEs and model fit statistics, and the details of the bootstrapping process (e.g., the number of samples drawn) can be set by the user. These are both viable options to obtain valid inferences in the event that your data are not multivariate normal.

haleyyaremych_3-1738338712315.png

 

Hope this helps!

Haley

View solution in original post

3 REPLIES 3


Re: Questions Regarding Normality Test and Advanced Statistical Analysis

Great questions, Andy!

 

A key assumption underlying maximum likelihood estimation in SEM is that the observed variables come from a multivariate normal distribution. There are a few ways to check this assumption, and usually, visual checks (rather than formal tests of normality) are sufficient. Prior to conducting CFA or SEM, I’d suggest the following checks:

 

Assess univariate normality with Analyze > Distribution. This platform displays a histogram for each variable. Visually checking histograms is usually sufficient for SEM, but if you’d like, you can also look at QQ plots and conduct formal tests that assess whether each variable comes from a Normal distribution.

  • In the Distribution platform, click the red triangle menu for a given variable, then select Continuous Fit > Fit Normal.
  • Then, click the red triangle menu for the Fitted Normal Distribution. To view a QQ plot, select Diagnostic Plots > QQ Plot, and to conduct a formal test of normality (the Shapiro-Wilk test), click Goodness of Fit:

haleyyaremych_0-1738338712312.png

  • In the QQ plot, if the observed data closely follow the straight line, that’s good evidence for normality.
  • The null hypothesis of the Shapiro-Wilk test is that the observed data come from a Normal distribution. A small p-value means we should reject the null hypothesis; therefore, a nonsignificant p-value is good evidence for normality.

haleyyaremych_1-1738338712313.png

 

To get a sense of the pairwise relations among your variables, look at bivariate scatterplots with Analyze > Multivariate Methods > Multivariate.

 

As an additional check, you might also look for multivariate outliers. This can be done from within the SEM platform in JMP Pro. After launching the SEM platform, click on the topmost red triangle menu, and select Launch Explore Outliers:

haleyyaremych_2-1738338712314.png

 

If these checks indicate that your data are not normally distributed, or are otherwise not well-behaved, you have a few options in the SEM platform. After fitting your model(s), again click on the topmost red triangle menu. Under Inference, there are two options: Robust Inference and Bootstrap Inference. Robust Inference will recompute standard errors (SEs) and model fit statistics using the sandwich correction. This correction results in SEs and fit statistics that are robust to nonnormality. Bootstrap Inference will use bootstrapping to estimate SEs and model fit statistics, and the details of the bootstrapping process (e.g., the number of samples drawn) can be set by the user. These are both viable options to obtain valid inferences in the event that your data are not multivariate normal.

haleyyaremych_3-1738338712315.png

 

Hope this helps!

Haley

AndyMFMD
Level I


Re: Questions Regarding Normality Test and Advanced Statistical Analysis

Dear Mrs/Ms. Haley

Thank you very much for your help and guidance! I greatly appreciate your quick response and kindness in helping me understand the analysis steps. I will follow your advice and soon send the images of each step's results as soon as possible.

I hope this information can be beneficial to many people. I also hope to communicate with you again after I send the results of each step. Thank you again for your help!

Best regards
Andy
AndyMFMD
Level I


Re: Questions Regarding Normality Test and Advanced Statistical Analysis

Subject: Progress Update - JMP Analysis Trial
 
Dear Mrs./Ms Haley,
 
I hope this email finds you well. I would like to update you on the progress of the JMP analysis trial that was suggested. I have conducted the analysis and found some interesting results.
 
I have attached my findings from the JMP analysis. Based on the analysis results, I found that all variables have significant Shapiro-Wilk p-values, indicating that the data is not normally distributed. I would like to seek your advice on what to do next.
 
I would also like to clarify if I made any mistakes in data collection or analysis that led to the data not being normally distributed. I have re-checked the variable domains and found that none of them are normally distributed.
 
I hope you can help me understand these analysis results and provide guidance on the next steps. Thank you for your attention and assistance.
 
I have attached the JMP analysis results for your reference.
 
Thank you,
Andy