Accounting for Computer Scientists — Martin Kleppmann‘s blog
Accounting for Computer Scientists
Accounting for Computer Scientists
organisms don't come with a barcode on their foreheads telling us who they are related to. We have to try to figure out who they're related to, and when we understand the relationships, then we know the history, because the relationships define the history.
So we work with hypotheses about history, and we test these hypotheses against each other and try to come up with the one that's most consistent with the data that we've got.
Perhaps my more learned friends in stochastic processes can assist me in surveying these two qualitative descriptions of microevolution and macroevolution, and answer this question: under what conditions could one conclude the overall process of evolution to be non-ergodic?
Biological evolution has two big ideas. One of them has to do with how the process occurs, and that's called microevolution. It's evolution going on right now. Evolution is going on in your body right now. You've got about 10^13th bacteria in each gram of your feces, and they have enough mutations in them to cover the entire bacterial genome. Every time you flush the toilet, you flush an entire new set of information on bacterial genomes down the toilets. It's going on all the time. Now, the other major theme is macroevolution. This process of microevolution has created a history, and the history also constrains the process. The process has been going on for 3.8 billion years. It has created a history that had unique events in it, and things happened in that history that now constrain further microevolution going on today.
If the microevolutionary search is random, constrained by natural selection yet emancipated by neutral selection (allowing nature to massively parallelize the search for serendipitous discoveries), it appears to me that the total process would be non-ergodic.
Georg Feulner and Stefan Rahmstorf of the Potsdam Institute for Climate Impact Research used a global climate model to examine the effect of a Maunder-type minimum on global mean temperature by 2100. The model reproduced the cooling of past solar minima, but when simulating the future the authors found that the solar effect was overwhelmed by the much larger temperature increase due to greenhouse-gas emissions.
This article is a 2-paragraph snippet of the cited work that tries to assess the impact of a minimum in solar activity on global cooling, as happened in the 1600s (the Maunder minimum).
This snippet just tells me that a global climate model backtests the Maunder minimum of the 1600s and makes some prediction about the future---how is that at all a useful piece of information? We know that there are countless models, many of which make perfect sense, that backtest the past well but have nothing useful to say about the future!
(I don't know about the actual paper, beyond my a priori tepidity on climate prediction in light of how little those researchers have done to allay my fears of fat-tailed errors).
Even when “significance” is properly defined and P values are carefully calculated, statistical inference is plagued by many other problems. Chief among them is the “multiplicity” issue — the testing of many hypotheses simultaneously. When several drugs are tested at once, or a single drug is tested on several groups, chances of getting a statistically significant but false result rise rapidly. Experiments on altered gene activity in diseases may test 20,000 genes at once, for instance. Using a P value of .05, such studies could find 1,000 genes that appear to differ even if none are actually involved in the disease. Setting a higher threshold of statistical significance will eliminate some of those flukes, but only at the cost of eliminating truly changed genes from the list. In metabolic diseases such as diabetes, for example, many genes truly differ in activity, but the changes are so small that statistical tests will dismiss most as mere fluctuations. Of hundreds of genes that misbehave, standard stats might identify only one or two. Altering the threshold to nab 80 percent of the true culprits might produce a list of 13,000 genes — of which over 12,000 are actually innocent.
For me, as someone not trained by statisticians but who works with noisy engineering systems (radar, communication links and networks, etc.), this article presented me with a really valuable compendium of the most common ways scientific studies, especially in social and behavioral sciences, but also studies in medicine and genetics, fail and can be safely discarded.
Non-practitioners (non-noise-engineers and non-statisticians) may not realize it but statistics is actually a deeply philosophical human endeavor, and this makes a grave side-point: mathematics is really good at describing things, but whether those things exist in real life can only be answered by a human who is interested.
Anyway, the article discusses the arbitrariness and uselessness of P values, especially in today's era of large-scale data-gathering systems; mistaking statistical significance for practical significance; not weighing the probability of missed detections versus false alarms (ROC curves have huge importance in electrical engineering [1]); the fact that when you run thousands of trials, at least some will not be randomized by chance; the importance of replication and how rarely it's done in the social sciences.
All these points were very valuable to my understanding of this important topic. John Ioannidis published a 2005 paper, "Why most published research findings are false" [2] indicting many sub-fields of public health and medical research, that I am not mathematically mature enough to understand, and I'm hoping that the Science News article covers a lot of Ioannidis' topics in "simple" English.
All of this is very important to people who want to de-fragilize our society and reduce the impact of catastrophic errors of understanding and decrease the power that nerds hold over our lives---nerds being unimaginative, unreflective people who take facts handed to them by some higher authority as gospel and cannot imagine a world where they are untrue or inapplicable.
[1] ROC curve: http://en.wikipedia.org/wiki/Receiver_operating_characteristic
[2] Ioannidis (2005): http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1182327/
It saddens me to realize that for every thoughtful person, atheist or religious, there are ten of these unintrospective, ignorant, and usually journalistic types, who cannot imagine a world outside the fashion (idea) catalogs they take so much smug satisfaction in worshipping. From a discussion of the unmatched Samuel Butler:
"The discovery of Mendel’s Laws, and then DNA, finally put paid to the Lamarckian theory of evolution. It seemed the idea of a creature that could ‘will’ its own evolutionary direction was quite untenable. The genetic blue-print we pass on is the one we are born with and it operates quite independently of any use we make of it or any plans we may have for it." (http://www.threemonkeysonline.com/als/_samuel_butler_sociobiology.html)
This writer, so unfamiliar with the skeptical and ephemeral nature of knowledge (and even science), would no doubt just add epigenetics, and the willful expression of one of many myriad genetic capabilities, to their stupid worthless idea-catalog.
"The Iliad and Odyssey have been used as text-books for education during at least two thousand five hundred years, and yet it is only during the last forty or fifty that people have begun to see that they are by different authors. Can there be any more scathing satire upon the value of literary criticism?" (Samuel Butler, 1892, The Humour of Homer)
For many months I've grown increasingly dissatisfied with my
keyboard+papers+desk+monitor arrangement and it clicked when I saw
Mark's Daily Apple: http://www.marksdailyapple.com/standing-at-work/
... [But] imagine how hard it is to be looking at a collection of people waiting in line to be slaughtered thinking they are there for a Broadway show. And there is no point telling them---they would punish you.
(Taleb)
4) Artin's Algebra, or something like it
9) Abbot's Understanding Analysis, or something like it
8) Maybe: Cramer's Mathematical Methods of Statistics
10) Borges' Labyrinths, in the original Spanish
3) Victor Hugo's Les Miserables, in the original French
7) Dante's Inferno, in the original Italian
6) Homer's Iliad, in the original Homeric Greek
2)
5)
1)