- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Questions Regarding Normality Test and Advanced Statistical Analysis
Dear JMP Community,
Greetings, I am a student who has recently begun studying statistics. I have a strong interest in applying the knowledge I've learned in a research project. I apologize in advance if my questions seem basic or inappropriate. Currently, I am conducting research by distributing questionnaires to 406 diabetes patients. The questionnaire consists of gender data (categorical), age data (continuous), and 6 variables with Likert scales, where each variable has its own domain. After inputting the data into MS Excel, I have calculated the total answer scores (continuous), the average value of respondents' answers (continuous), and also performed categorization from variables (categorical). Then, I imported this data into JMP Pro 18.
In the initial stage of analysis, I want to perform a normality test. Regarding this, I have several questions:
1. How do I perform a normality test on the 6 variables and their domains? Do I need to conduct normality tests for all variables and their domains simultaneously or separately? Or do you have any suggestions that I could follow given my current situation where I'm experiencing difficulties with the normality test?
2. What should I do if the data is normally distributed or not normally distributed? This is important because the next stages I plan to conduct are Confirmatory Factor Analysis (CFA) and then Structural Equation Modeling (SEM).I greatly appreciate any input and advice you can provide to help me understand the appropriate steps in analyzing this data.
Thank you for your attention and assistance. Sincerely,
Andy
Accepted Solutions
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Questions Regarding Normality Test and Advanced Statistical Analysis
Great questions, Andy!
A key assumption underlying maximum likelihood estimation in SEM is that the observed variables come from a multivariate normal distribution. There are a few ways to check this assumption, and usually, visual checks (rather than formal tests of normality) are sufficient. Prior to conducting CFA or SEM, I’d suggest the following checks:
Assess univariate normality with Analyze > Distribution. This platform displays a histogram for each variable. Visually checking histograms is usually sufficient for SEM, but if you’d like, you can also look at QQ plots and conduct formal tests that assess whether each variable comes from a Normal distribution.
- In the Distribution platform, click the red triangle menu for a given variable, then select Continuous Fit > Fit Normal.
- Then, click the red triangle menu for the Fitted Normal Distribution. To view a QQ plot, select Diagnostic Plots > QQ Plot, and to conduct a formal test of normality (the Shapiro-Wilk test), click Goodness of Fit:
- In the QQ plot, if the observed data closely follow the straight line, that’s good evidence for normality.
- The null hypothesis of the Shapiro-Wilk test is that the observed data come from a Normal distribution. A small p-value means we should reject the null hypothesis; therefore, a nonsignificant p-value is good evidence for normality.
To get a sense of the pairwise relations among your variables, look at bivariate scatterplots with Analyze > Multivariate Methods > Multivariate.
As an additional check, you might also look for multivariate outliers. This can be done from within the SEM platform in JMP Pro. After launching the SEM platform, click on the topmost red triangle menu, and select Launch Explore Outliers:
If these checks indicate that your data are not normally distributed, or are otherwise not well-behaved, you have a few options in the SEM platform. After fitting your model(s), again click on the topmost red triangle menu. Under Inference, there are two options: Robust Inference and Bootstrap Inference. Robust Inference will recompute standard errors (SEs) and model fit statistics using the sandwich correction. This correction results in SEs and fit statistics that are robust to nonnormality. Bootstrap Inference will use bootstrapping to estimate SEs and model fit statistics, and the details of the bootstrapping process (e.g., the number of samples drawn) can be set by the user. These are both viable options to obtain valid inferences in the event that your data are not multivariate normal.
Hope this helps!
Haley
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Questions Regarding Normality Test and Advanced Statistical Analysis
Great questions, Andy!
A key assumption underlying maximum likelihood estimation in SEM is that the observed variables come from a multivariate normal distribution. There are a few ways to check this assumption, and usually, visual checks (rather than formal tests of normality) are sufficient. Prior to conducting CFA or SEM, I’d suggest the following checks:
Assess univariate normality with Analyze > Distribution. This platform displays a histogram for each variable. Visually checking histograms is usually sufficient for SEM, but if you’d like, you can also look at QQ plots and conduct formal tests that assess whether each variable comes from a Normal distribution.
- In the Distribution platform, click the red triangle menu for a given variable, then select Continuous Fit > Fit Normal.
- Then, click the red triangle menu for the Fitted Normal Distribution. To view a QQ plot, select Diagnostic Plots > QQ Plot, and to conduct a formal test of normality (the Shapiro-Wilk test), click Goodness of Fit:
- In the QQ plot, if the observed data closely follow the straight line, that’s good evidence for normality.
- The null hypothesis of the Shapiro-Wilk test is that the observed data come from a Normal distribution. A small p-value means we should reject the null hypothesis; therefore, a nonsignificant p-value is good evidence for normality.
To get a sense of the pairwise relations among your variables, look at bivariate scatterplots with Analyze > Multivariate Methods > Multivariate.
As an additional check, you might also look for multivariate outliers. This can be done from within the SEM platform in JMP Pro. After launching the SEM platform, click on the topmost red triangle menu, and select Launch Explore Outliers:
If these checks indicate that your data are not normally distributed, or are otherwise not well-behaved, you have a few options in the SEM platform. After fitting your model(s), again click on the topmost red triangle menu. Under Inference, there are two options: Robust Inference and Bootstrap Inference. Robust Inference will recompute standard errors (SEs) and model fit statistics using the sandwich correction. This correction results in SEs and fit statistics that are robust to nonnormality. Bootstrap Inference will use bootstrapping to estimate SEs and model fit statistics, and the details of the bootstrapping process (e.g., the number of samples drawn) can be set by the user. These are both viable options to obtain valid inferences in the event that your data are not multivariate normal.
Hope this helps!
Haley
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content
Re: Questions Regarding Normality Test and Advanced Statistical Analysis
Thank you very much for your help and guidance! I greatly appreciate your quick response and kindness in helping me understand the analysis steps. I will follow your advice and soon send the images of each step's results as soon as possible.
I hope this information can be beneficial to many people. I also hope to communicate with you again after I send the results of each step. Thank you again for your help!
Best regards
Andy
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Get Direct Link
- Report Inappropriate Content