Inference in regression: F-test Jan 25 2019

Homework #1

Solutions on canvas

I graded the initial data analysis.

Homework #3

I’ve posted an example with some guidelines as 01-initial-data-analysis-report, but I started from 01-initial-data-analysis-draft.

Key things I’ll be looking for in HW #3:

Today

Motivation

t-tests on individual parameters only allow us to ask a limited number of questions.

To ask questions about more than one coefficient we need something more complicted.

F-tests do this by comparing nested models. In practice, the hard part is translating a scientific question in a comparison of two models.

F-test

Let Ω denote a larger model of interest with p parameters
and ω a smaller model that represents some simplification of Ω with q parameters.

Intuition: If both models “fit” as well as each other, we should prefer the simpler model, ω. If Ω shows substantially better fit than ω, that suggests the simplification is not justified.

How do we measure fit? What is substantially better fit?

F-statistic

F = \frac{(\RSS{\omega} - \RSS{\Omega})/(p - q)}{\RSS{\Omega}/(n - p)}

Null hypothesis: the simplification to Ω implied by the simpler model, ω.

Under the null hypothesis, the F-statistic has an F-distribution with pq and np degrees of freedom.

Leads to tests of the form: reject H0 for F>Fpq,np(α).

Deriving this fact is beyond this class (take Linear Models).

Example: Overall regression F-test

The overall regression F-test asks if any predictors are related to the response.

Full model: Y=Xβ+ϵ,ϵN(0,σ2I)
Reduced model: Y=β0+ϵ

Null hypothesis: H0:β1=β2==βp1=0
All the parameters (other than the intercept) are zero.

Alternative hypothesis: At least one parameter is non-zero.

Exercise: question #1 on handout

If there is evidence against the null hypothesis:

If there is no evidence against the null hypothesis:

Example: One predictor

Null hypothesis: βj=0

Equivalent to the t-test, reject null if |tj|=|βj^SE(βj^)|>tnpα/2

In fact, in this case, F=tj2.

Exercise: questions #2 & #3 on handout

Other examples

Exercise: questions #4 & #5 on handout

We can’t do F-tests when