E & Emin

Estimating the expected proportion of

false confidence intervals

...

(Symbols) Significant Not significant Total

-------------------------------------------------------------

Null true F m₀ - F m₀

Alter. true T m₁ - T m₁

-------------------------------------------------------------

Total S m - S m

Symbols ( a is used instead of alpha)

m = large number of experiments (p-values)

m₀ = number of true null hypotheses in m

m₁ = number of true alternative hypotheses in m

m = m₀+m₁ ; m₁=m₁'+m₁'' +m₁'''....(etc.)

S = F+T = number of significant results (discoveries) at the level a₀ in m

F = number of false discoveries in S

T = number of true discoveries in S

a = F/m₀ = significance level, i.e. proportion of false discoveries in m₀

a = probability that a 100×(1-a)-percent confidence interval (in m₀ or m₁) is false;

i.e. proportion of false 100×(1-a)-percent confidence intervals in m such intervals

f = T/m₁ = proportion of true discoveries in m₁

F = a₀×m₀ ; T = f × m₁

Q = F/S = actual (or nearly-exactly estimated) proportion of false discoveries in S

Qmg (= "Q-maximal-graphical") = estimate of Q obtained from a histogram

Qmax = estimate of Q obtained from known values m, S, a (as in my paper

published in JASA, 1989 - see below: Ref. 1.)

E = proportion of false 100×(1-a)-percent confidence intervals in S intervals

Emax = calculated largest expected value of E

Emin = calculated smallest expected value of E

In a large known number (m) of experiments, a₀and a are known, and the number (S) of significant results (in which p<a) is also known (because it can be enumerated). In the case that Q can be nearly-exactly estimated, we can also nearly-exactly calculate the proportion (E) of false confidence intervals (in S):

(See derivations below !)

E = [QS + (S-QS)×a/f ] / S ............. (1)

In (1) a and S are known, Q is also known if it can be estimated with a satisfactory precision, and f can be calculated from the following formula:

f = (S-QS) / [m -(QS/a₀)] .............. (2)

where S, Q, m and a₀ are known.

Derivation of (2):

Q = F/S ; QS = F = m₀×a₀; m₀ = QS/a₀

m₁×f = T = S-F = S - QS

f = T/m₁ = (S-QS)/m₁ where m₁= m-m₀ = m - (QS/a₀)

f = (S-QS) / [m - (QS/a₀) ; this is formula (2).

Derivation of (1):

F false discoveries (significant at the level a₀<a) give F false 100×(1-a)-percent confidence intervals. There are T true discoveries with T 100(1-a)-percent confidence intervals comprising m₁×a false confidence intervals. [Namely: m₁×a = (m₁' + m₁'' + m₁'''+.....)×a]

T=m₁×f ; T×a =m₁×f×a ; m₁×a = T×a/f

E = [F+ Ta/f] / S = [QS+ (S-QS)a/f] / S ; this is formula (1).

We insert f from (2) into (1) and calculate E.

I beg everybody to let me know if anybody has published the above simple derivations, as well as the formulae (1) and (2) in any form, and where that has been published. (There are so many published papers which I have never seen, and I don't know where to look or whom to ask).

Also, I beg you to tell me what mistakes I have made!

If we use Qmg as an estimate of Q in (1) ad (2), the obtained values could, perhaps, be rather near to the actual values of E and f.

If we insert the value Qmax into (1) and (2), we obtain f = 1 and the corresponding low value of E.

In a special case, where a₀= a , we have:

E = (F+ Ta/f) / S = (m₀×a₀ + Ta/f) / S = (m₀×a + Ta/f) / S =

= (m₀×a + m₁×f×a/f) / S = (m₀×a + m₁×a) / S = a×(m₀+m₁) / S = ma / S

E = a×m / S ............. (4)

The latter formula (4) has been published in my paper (JASA; 1989) in this form:

E = a×n / r > Qmax ; where n and r stand instead of m and S, respectively (see below: Ref. 1.). In (4): m, S and a are known, and, if m is very large, we can simply and almost-exactly calculate the actual value (E) of the proportion of false confidence intervals in S.

_______________________________________________________

REFERENCE:

1. Sorić, B. (1989). Statistical "discoveries" and effect-size estimation. J. Amer. Statist. Assoc., 84, 608-610. http://www.jstor.org/pss/2289950

--------------------------------------------------------------------------

I beg to be notified about any

mistakes that may exist above!

branko.soric@zg.t-com.hr

Go to: Home , TABLE

----------------------------------------------

June - September, 2009

Branko Soric

................................................................................................................................................................

................................................................................................................................................................