Prediction

set.seed(123) mean_obs <- c() for(i in 1:100){ obs = 2*12.5 + rnorm(10, mean = 0, sd = 2) mean_obs[i]<-mean(obs) } plot(x,y) abline(lmod, col = 'blue') points(rep(11.5, length(new_obs)), new_obs, col='red') points(rep(12.5, length(mean_obs)), mean_obs, col = 'green')

data(fat, package = 'faraway')
lmod <- lm(brozek ~ age + weight + height + neck + chest + 
             abdom + hip + thigh + knee + ankle + biceps + 
             forearm + wrist, data=fat)
summary(lmod)

## 
## Call:
## lm(formula = brozek ~ age + weight + height + neck + chest + 
##     abdom + hip + thigh + knee + ankle + biceps + forearm + wrist, 
##     data = fat)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -10.264  -2.572  -0.097   2.898   9.327 
## 
## Coefficients:
##              Estimate Std. Error t value Pr(>|t|)    
## (Intercept) -15.29255   16.06992  -0.952  0.34225    
## age           0.05679    0.02996   1.895  0.05929 .  
## weight       -0.08031    0.04958  -1.620  0.10660    
## height       -0.06460    0.08893  -0.726  0.46830    
## neck         -0.43754    0.21533  -2.032  0.04327 *  
## chest        -0.02360    0.09184  -0.257  0.79740    
## abdom         0.88543    0.08008  11.057  < 2e-16 ***
## hip          -0.19842    0.13516  -1.468  0.14341    
## thigh         0.23190    0.13372   1.734  0.08418 .  
## knee         -0.01168    0.22414  -0.052  0.95850    
## ankle         0.16354    0.20514   0.797  0.42614    
## biceps        0.15280    0.15851   0.964  0.33605    
## forearm       0.43049    0.18445   2.334  0.02044 *  
## wrist        -1.47654    0.49552  -2.980  0.00318 ** 
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 3.988 on 238 degrees of freedom
## Multiple R-squared:  0.749,  Adjusted R-squared:  0.7353 
## F-statistic: 54.63 on 13 and 238 DF,  p-value: < 2.2e-16

## (Intercept) age weight height neck chest ## 1.00 43.00 176.50 70.00 38.00 99.65 ## abdom hip thigh knee ankle biceps ## 90.95 99.30 59.00 38.50 22.80 32.05 ## forearm wrist ## 28.70 18.30

## (Intercept) age weight height neck chest ## 1.000 67.000 225.650 74.500 41.845 116.340 ## abdom hip thigh knee ankle biceps ## 110.760 112.125 68.545 42.645 25.445 37.200 ## forearm wrist ## 31.745 19.800

The Natural Predictor

Confidence Intervals for Predictions

What we can predict?

What we can predict?

Variance of the Estimation Error for the Mean

Variance of the Prediction Error for a new response

CI for Mean Response

CI for New Response

Prediction Interval

Prediction VS Confidence

Simulated Example

Simulated Example

Simulated Example

Example - Body Fat

Response for the median values of the predictors

Response for the median values of the predictors

Response for the median values of the predictors

Construct PI and CI

Interpretation of CI

Interpretation of PI

Extrapolation

Measurements are at 95th percentile

Graphical (Simulated Data)

Graphical (Simulated Data)

Other Uncertainty

What Can Go Wrong with Predictions?

What Can Go Wrong with Predictions?

What Can Go Wrong with Predictions?