Here is an example:
Generalized linear model with nominal response example
Data file: Example1Fp52.43_cH8.jmp
Expression of a crustacean homolog of human mitochondrial carrier protein Aralar1 was studied in 4 clones of Daphnia exposed to acute hypoxia (3 hypoxia and 3 control replicates). 4 clones x 2 treatments x 3 replicates = 24 rows.
First, a 2-way anova was done using RPKMs as the response variable and hypoxia and clone as factors:
Fit Model(
Y( :RPKM ),
Effects( :Clone, :acuteHypoxia, :Clone * :acuteHypoxia ),
Personality( Standard Least Squares ),
Emphasis( Minimal Report ),
Run(
:RPKM << {Lack of Fit( 0 ), Plot Actual by Predicted( 0 ),
Plot Regression( 0 ), Plot Residual by Predicted( 0 ),
Plot Effect Leverage( 0 )}
)
)
Which results in:
Response RPKM
Effect Tests
Source Nparm DF Sum of Squares F Ratio Prob > F
Clone 3 3 10222.900 3.6649 0.0349
acuteHypoxia 1 1 42430.209 45.6339 <.0001
Clone*acuteHypoxia 3 3 12083.132 4.3318 0.0205
OK, things are significant, but the effects of clone and the interaction will not ever survive any kind of multiple test correction (this is just one example of ~6K transcripts in the RNAseq data).
Then 24 dummy rows were created with the reads column containing the expected number of reads in case there is no effect of clones or hypoxia and also no difference among the 3 replicates in each combination of clones/hypoxia – i.e. the average number of reads in the actual 24 rows. Dummy rows were labeled “Exp” in the :ObsExp column and the original actual observations were labeled “Obs”. Now we want to know whether, and, if so, due to which independent variables do Obs differ from Exp. To achieve this I run (observe than now the raw :reads, not RPKM’s are used as the Freq’s:
Fit Model(
Freq( :reads ),
Y( :ObsExp ),
Effects(
:Clone,
:acuteHypoxia,
:acuteHypoxia * :Clone,
:rep[:Clone, :acuteHypoxia]
),
Personality( Nominal Logistic ),
Run( Likelihood Ratio Tests( 1 ), Wald Tests( 0 ) ),
SendToReport( Dispatch( {}, "Parameter Estimates", OutlineBox, {Close( 1 )} ) )
)
Which results, as expected in way, way more power:
Effect Likelihood Ratio Tests
Source Nparm DF L-R ChiSquare Prob>ChiSq
Clone 3 3 208.563245 <.0001
acuteHypoxia 1 1 114.216936 <.0001
acuteHypoxia*Clone 3 3 54.2042677 <.0001
rep[Clone,acuteHypoxia] 16 16 92.4764669 <.0001
Note: Exact same results are obtained using this:
Personality( Generalized Linear Model ),
GLM Distribution( Binomial ),
Link Function( Logit )