Thanks Lou,
I would expect the switch from Generalized Poisson to Poisson would yield more differentially expressed genes given that Generlaized Poisson is a "correction" of sorts due to overdispersion (more variance than "expected"). The filtering does surprise me a little, but as I think about it, there are fewer genes (fewer statistical tests), so the FDR correction (adjusted p-value) may not be as stringent and thus more genes called significant at that particular cuttoff.
As you have noted, the previous method gene lists are included in the next methods gene list (it is a subset of the method below). It is due to the progressive nature of filtering and corrections.
On a more practical note, if you are screening through a list and looking for subsets of genes to follow up on, the largest list is likely to include those genes that are on the boarder of being "statistically significant" for that p-value threshold and still might be of interest if the differences are large enough to be useful/meaningful. Meaning, something with a p-value of 0.08 (or 0.1) might not make the cutoff of 0.05, but have a 2 fold difference in expression which could change a cells behavior. It may be that there is too much unexplained variability between the samples to be very confident of that difference or that something unknown is competing with the experiment design (would need to take a look at the variation of expression within each group for that gene to know more).
I would definitely include that gene(s) that are on the border in my list of ones to investigate further if it can be afforded. All depends on the purpose of the experiment.
Chris Kirchberg, M.S.2
Data Scientist, Life Sciences - Global Technical Enablement
JMP Statistical Discovery, LLC. - Denver, CO
Tel: +1-919-531-9927 ▪ Mobile: +1-303-378-7419 ▪ E-mail: chris.kirchberg@jmp.com
www.jmp.com