Article written by :: (RSS)

tigtog (aka Viv) is the founder of this blog. She lives in Sydney, Australia: husband, 2 kids, cat, house, garden, just enough wine-racks and (sigh) far too few bookshelves.

This author has written 3286 posts for Hoyden About Town. Read more about tigtog »

16 responses to “SIWOTI: Sagan’s ECREE maxim is NOT nonsense”

  1. orlando

    Phew. I enjoyed that.

  2. Mindy

    I anticipate sie will remain unconvinced.

    I think so. I suspect that sie is enjoying the hell out of themselves and hasn’t had this much fun for ages. I hope that you are enjoying your extreme nit pickiness as well. That is one thing that I really enjoy about TOD – you can come back the next day once you’ve thought up a really great comeback and stir it all up again. Unless some other bugger got there before you, but sometimes even then.

  3. Sally

    I wouldn’t call “evidence sufficient to reject the relevant null hypotheses at a statistical probability equal to or greater than the 95% confidence level” extrordinary evidence. This could still have you wrong one time in twenty. Considering how many extraordinary claims there are around you could be believing many strange things.

    I would consider the 95% confidence level useful evidence for ordinary claims but not convincing evidence of extraordinary claims.. Ordinary claims would be consistent with other scientific theories for which there is good evidence such as “eating oranges reduces the chances of getting colds” or “stretching before exercise prevents injury”. If you say you’ve built a machine that breaks conservation of energy it would need much more evidence than gave a 95% confidence interval to convince me.

    The likelihood of the initial “extraordinary claim” effects the assesment of the results of the statistical test. This is similar to the statistics behind positive predictive value of medical testing. Say you have a test for a rare disease that only affects one in a thousand people. Suppose the test gives you a false positive one time in 20 (equivalent to a 95% confidence interval). If someone tests positive then their chance of having the disease (equivalent to the chance that the claim is correct) is still only one in 50. The difficulty with this argument is in quantifying the likelihood of the initial claim.

  4. cim

    What I do find problematic about ECREE is that when it comes to social sciences the choice of null hypothesis is essentially arbitrary, and has in practice been a default/privileged perspective. So ECREE can be used as an excuse to completely ignore lived experience (correctly so, within its own terms, even as you expand and restate them) – and so you get experiments like the recent sexual proposition one to prove beyond all reasonable doubt something that was – from a non-privileged or even vaguely empathic standpoint – completely obvious.

    I’d argue that “fat and health” has ended up caught in the same sort of ECREE trap, in the medical sciences, as some bad but not caught-at-the-time research ends up setting an unjustified null hypothesis. The mass of the electron, I believe, ended up in much the same sort of trap for the physical sciences, but I can’t find the reference for that right now – I think I originally read it as a Feynman story, but I can’t remember which.

    (shorter: ECREE is great as long as you pick the right null hypothesis, but how do you tell when you’ve picked the wrong one?)

  5. cim

    tigtog: Oh, absolutely, that experiment was definitely a good demonstration of the scientific method, and a well-designed experiment. But – take fat and health as the example – the null hypothesis used right now is essentially that fat strongly causes bad health. There’s seen no need to question this hypothesis in a lot of papers, just as one wouldn’t generally start a physics experiment with a preliminary test that gravity was still working as expected. It’s “established science”.

    So experiments confirming the null hypothesis get subjected to generally less scrutiny over possible flaws in their methodology than ones that fail to confirm it. The ones confirming the null hypothesis are only making an “ordinary” claim (just as no-one would expect me to do a full scientific analysis if I claimed I wasn’t psychic, but would if I claimed that gravitational attraction is independent of mass)

    Ordinarily, this works quite well, but it breaks down when previous experiments have created a new null hypothesis that isn’t a real null (gravitational attraction does not exist) but is nevertheless less strongly supported by the evidence than a significantly different hypothesis. (It works fine for incremental – Newton to Einstein, for instance – improvements on hypotheses, of course, but not for complete changes)

  6. Sally

    tigtog,
    I may have taken the ECREE in a different context. I tend to view the 95% confidence interval published study as “ordinary evidence”, and less as not really evidence at all. Although some types of less formal evidence may still be stronger depending on the situation.

    I see ECREE being used to explain why I demand much higher levels of evidence for “your” bizarre claims than I would for “my” completely plausible ones. Which does seem rather arrogant unless there is a good objective definition of what an extraordinary claim is.

    Also cim I would have considered many of Einstein’s theories to be extraordinary claims rather than incremental changes, even though the changes to measured quantities in most cases were small. The confidence level on the accumulated evidence (with Newton as the null hypothesis) would be huge by now though.

  7. cim

    tigtog: Yes, you’re right – “underlying hypothesis” probably is much closer to what I mean. So I think there can be a problem with ECREE if the underlying hypotheses aren’t properly acknowledged, since it’s easy to treat experiments as requiring different levels of quality – and flaws in experimental methods as having different importance – depending on whether the experiment appears to confirm or deny current scientific thinking.

    Here’s the Feynman essay I was thinking of – about half way down he talks about Millikan and electron charge (not mass, as I misremembered), and how the assumption that the previous experiments were largely correct caused problems in actually getting to the correct answer
    http://www.lhup.edu/~DSIMANEK/cargocul.htm
    Unlike Feynman, I don’t believe for a second that scientists have stopped making that sort of mistake.

    And yes, more experiments of that sort would be good, definitely – but I think there’s a definite weakness of the current scientific process – no idea how to solve it – in that it would give those experiments far more scrutiny than one that reinforced the status quo thinking, despite the status quo not being one that you would get to nowadays if you started from scratch with the experiments.

The commenting period has expired for this post. If you wish to re-open the discussion, please do so in the latest Open Thread.