Bonferroni校正

簡介

舉個例子：如要在同一數據集上檢驗兩個獨立的假設，顯著水平設為常見的0.05。此時用於檢驗該兩個假設應使用更嚴格的0.025。即0.05* (1/2)。該方法是由Carlo Emilio Bonferroni發展的，因此稱Bonferroni校正。

這樣做的理由是基於這樣一個事實：在同一數據集上進行多個假設的檢驗，每20個假設中就有一個可能純粹由於機率，而達到0.05的顯著水平。

維基百科原文

Bonferroni correction

Bonferroni correction states that if an experimenter is testing n independent hypotheses on a set of data, then the statistical significance level that should be used for each hypothesis separately is 1/n times what it would be if only one hypothesis were tested.

For example, to test two independent hypotheses on the same data at 0.05 significance level, instead of using a p value threshold of 0.05, one would use a stricter threshold of 0.025.

The Bonferroni correction is a safeguard against multiple tests of statistical significance on the same data, where 1 out of every 20 hypothesis-tests will appear to be significant at the α = 0.05 level purely due to chance. It was developed by Carlo Emilio Bonferroni.

A less restrictive criterion is the rough false discovery rate giving (3/4)0.05 = 0.0375 for n = 2 and (21/40)0.05 = 0.02625 for n = 20.

數據分析中常碰見多重檢驗問題(multiple testing).Benjamini於1995年提出一種方法，是假陽性的.在統計學上，這也就等價於控制FDR不能超過5%.

根據Benjamini在他的文章中所證明的定理，控制fdr的步驟實際上非常簡單。

設總共有m個候選基因，每個基因對應的p值從小到大排列分別是p(1),p(2),...,p(m),

The False Discovery Rate (FDR) of a set of predictions is the expected percent of false predictions in the set of predictions. For example if the algorithm returns 100 genes with a false discovery rate of .3 then we should expect 70 of them to be correct.

Bonferroni校正

簡介

維基百科原文

參考文獻

相關詞條

熱門詞條