Sunday, October 16, 2011

The unholly alliance between \(\phi_t\) and AIC

In the wild world of mark-recapture analysis, there is a well-worn path to publication.  Two of the key parameters are the survival and recapture probabilities in the marked population[1].  One of mile-markers is comparing two models for each parameter: either the parameter is constant, or it is different for each interval (survival) or occasion (recapture).  Sometimes an enlightened paper will skip this comparison, but there's really no guarantee.
Clearly, it's crucial to know whether the survival rate varies over time.

If you were looking at this question using hypothesis testing, you would take the following as your null hypothesis.

\[H_0 : \phi_1 = \phi_2 = ... = \phi_K
The alternative would be that each phi is independent at each time step or something in between.  If you're using hypothesis testing, this brings up, front and center, the problem that H_0 is crazy.  No ecologist would honestly accept that as a null hypothesis.  Survival varies over time, end of story.  I'm pretty sure that the reason this test comes up is that MARK has a button for it.

Besides the fact that the null hypothesis is crazy[2] (though related), small mark-recapture studies, and most are small, will lack the power to find important temporal variation in survival[3].  If biologists were willing to give their theory a little bit more weight in deciding the issue, it would be sensible to take \phi to have some distribution over time (a "random" effect) and just deal with the estimates.  Even if the estimates were nothing special, at least it would save the world from the umpteenth discussion of whether \phi varies over time.

To further muddy the waters, these days the comparison is carried out using AIC rather than hypothesis testing.  I like ranking a set of well-fitting models as well as anybody else, but in this case it just masks the fact that the study was too underpowered to even contemplate the comparison to begin with.  At least when a hypothesis-driven comparison is not significant most biologists know that you shouldn't make a big deal of it without a power analysis[4].  With a table of AIC values, an inappropriate comparison is harder to spot.  One indicator is a slew of "comparable" models which ought to remind the reader that the models are only "comparable" in light of the data on hand. 

Now back to that paper...

[1] Survival in the marked and unmarked populations can be quite different, I'm sure hawks love mice with neon marker on their back.
[2] Common, there's survival variation when you look within a single manufacturing lot of CPU's, forget frogs in the wild.
[3] I just read one in a reasonable journal where the 95% confidence intervals basically stretched from 0.5-0.95, and the _title_ of the paper relied on the statistical lack of difference.
[4] Through the hard work of generations of biostatisticians, most biologists will recognize that making a big deal from a lack of significance is questionable.

Thursday, October 13, 2011

The Shotgun Approach to Hypothesis Formation

It's like going into a dark basement in a zombie movie.  You know there are zombies down there, so you might as well start shooting.  Each shotgun blast illuminates more zombies for you, and the process continues.  This is why academics start on a topic and continue until it sucks all the life out of them.  To continue the analogy, if you die at your desk it's because you ran out of bullets.

Sunday, October 9, 2011

If ... then ...

If P then Q is only false if P and not Q, if not P, then whatever.  Who knew?

It does help to get these things straight from a blog post if you don't breathe math in your career.

Saturday, October 8, 2011


 system.time(while(sum(runif(10^6,0,10^(-316)) == 0) == 0 ) {})

My Apologies

My apologies to the audience.  As a graduate student in the middle of writing up paper #1, this blog is taking a back seat to the giant simulation and analysis-fest that is my life.  Further posting will resume when the beast is sent off to co-authors or XKCD #303 applies.