statistics stuff to think about

assignment to treatments

apropos("^power")  ## base-R functions
library("sos"); findFn("{power analysis}")
power.prop.test(n=15,p1=0.1,p2=0.2)
## 
##      Two-sample comparison of proportions power calculation 
## 
##               n = 15
##              p1 = 0.1
##              p2 = 0.2
##       sig.level = 0.05
##           power = 0.1141268
##     alternative = two.sided
## 
## NOTE: n is number in *each* group
power.prop.test(power=0.8,p1=0.1,p2=0.2,sig.level=0.05)
## 
##      Two-sample comparison of proportions power calculation 
## 
##               n = 198.9634
##              p1 = 0.1
##              p2 = 0.2
##       sig.level = 0.05
##           power = 0.8
##     alternative = two.sided
## 
## NOTE: n is number in *each* group
x1 = c(1.5,2.5,2.1)
x2 = c(1.1,1.4,1.5)
t.test(x1,x2)
## 
##  Welch Two Sample t-test
## 
## data:  x1 and x2
## t = 2.226, df = 2.6648, p-value = 0.1236
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
##  -0.3759079  1.7759079
## sample estimates:
## mean of x mean of y 
##  2.033333  1.333333

	(1)
(Intercept)	8.010	(6.206)
mpg	-0.187 *	(0.088)
trunk	-0.013	(0.105)
length	0.055	(0.036)
turn	-0.200	(0.140)
N	74
R2	0.251
logLik	-173.832
AIC	359.665
* p < 0.001; p < 0.01; * p < 0.05.

outline

experimental design

the most important thing

randomization

Yellowstone aspen regeneration (Brice et al. 2021)

replication

power analysis

power analysis example

power analysis example (continued)

increasing power (control)

statistical philosophy

the other most important thing

fishing expeditions

“The Garden of Forking Paths” (Gelman and Loken 2014)

don’t lean on p-values too much

don’t lean on p-values too much

regression table (OK)

coefficient plot (better!)

statistical tests

assumptions

diagnostics

(the data set)

diagnostics

diagnostics (`performance` pkg)

dealing with violations

what should you use?

computational platforms

Criteria

Excel

stats packages

R

t-test (R)

t-test (Excel)

Further resources

References

outline

experimental design

the most important thing

randomization

Yellowstone aspen regeneration (Brice et al. 2021)

replication

power analysis

power analysis example

power analysis example (continued)

increasing power (control)

statistical philosophy

the other most important thing

fishing expeditions

“The Garden of Forking Paths” (Gelman and Loken 2014)

don’t lean on p-values too much

don’t lean on p-values too much

regression table (OK)

coefficient plot (better!)

statistical tests

assumptions

diagnostics

(the data set)

diagnostics

diagnostics (performance pkg)

dealing with violations

what should you use?

computational platforms

Criteria

Excel

stats packages

R

t-test (R)

t-test (Excel)

Further resources

References

diagnostics (`performance` pkg)