Lucy D’Agostino McGowan
Assumption | What it means | How do you check? | How do you fix? |
---|---|---|---|
Linearity | The relationship between the outcome and explanatory variable or predictor is linear holding all other variables constant | Residuals vs. fits plot Marginal effects plots |
fit a better model (transformations, polynomial terms, more / different variables, etc.) |
Assumption | What it means | How do you check? | How do you fix? |
---|---|---|---|
Linearity | The relationship between the outcome and explanatory variable or predictor is linear holding all other variables constant | Residuals vs. fits plot Marginal effects plots |
fit a better model (transformations, polynomial terms, more / different variables, etc.) |
Assumption | What it means | How do you check? | How do you fix? |
---|---|---|---|
Linearity | The relationship between the outcome and explanatory variable or predictor is linear holding all other variables constant | Residuals vs. fits plot Marginal effects plots |
fit a better model (transformations, polynomial terms, more / different variables, etc.) |
Assumption | What it means | How do you check? | How do you fix? |
---|---|---|---|
Linearity | The relationship between the outcome and explanatory variable or predictor is linear holding all other variables constant | Residuals vs. fits plot Marginal effects plots |
fit a better model (transformations, polynomial terms, more / different variables, etc.) |
Zero Mean |
Assumption | What it means | How do you check? | How do you fix? |
---|---|---|---|
Linearity | The relationship between the outcome and explanatory variable or predictor is linear holding all other variables constant | Residuals vs. fits plot Marginal effects plots |
fit a better model (transformations, polynomial terms, more / different variables, etc.) |
Zero Mean | The error distribution is centered at zero | by default | – |
Constant Variance |
Assumption | What it means | How do you check? | How do you fix? |
---|---|---|---|
Linearity | The relationship between the outcome and explanatory variable or predictor is linear holding all other variables constant | Residuals vs. fits plot Marginal effects plots |
fit a better model (transformations, polynomial terms, more / different variables, etc.) |
Zero Mean | The error distribution is centered at zero | by default | – |
Constant Variance | The variability in the errors is the same for all values of the predictor variable | Residuals vs fits plot | fit a better model (try taking the log or square root of the outcome) |
Independence |
Try taking the log of the outcome.
Try taking the log of the outcome.
Call:
lm(formula = log(NumMDs) ~ NumHospitals, data = MetroHealth83)
Residuals:
Min 1Q Median 3Q Max
-1.24906 -0.55027 -0.03063 0.49255 1.11936
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 5.75892 0.10046 57.32 <2e-16 ***
NumHospitals 0.14499 0.01047 13.84 <2e-16 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 0.6054 on 81 degrees of freedom
Multiple R-squared: 0.7029, Adjusted R-squared: 0.6992
F-statistic: 191.6 on 1 and 81 DF, p-value: < 2.2e-16
How do you interpret \(\hat\beta_1\)?
Try taking the square root of the outcome.
Try taking the square root of the outcome.
Call:
lm(formula = sqrt(NumMDs) ~ NumHospitals, data = MetroHealth83)
Residuals:
Min 1Q Median 3Q Max
-18.086 -5.845 -2.030 7.001 17.994
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 14.0329 1.4686 9.555 6.36e-15 ***
NumHospitals 2.9148 0.1531 19.036 < 2e-16 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 8.85 on 81 degrees of freedom
Multiple R-squared: 0.8173, Adjusted R-squared: 0.8151
F-statistic: 362.4 on 1 and 81 DF, p-value: < 2.2e-16
How do you interpret \(\hat\beta_1\)?
Assumption | What it means | How do you check? | How do you fix? |
---|---|---|---|
Linearity | The relationship between the outcome and explanatory variable or predictor is linear holding all other variables constant | Residuals vs. fits plot Marginal effects plots |
fit a better model (transformations, polynomial terms, more / different variables, etc.) |
Zero Mean | The error distribution is centered at zero | by default | – |
Constant Variance | The variability in the errors is the same for all values of the predictor variable | Residuals vs fits plot | fit a better model (try taking the log or square root of the outcome) |
Independence | The errors are assumed to be independent from one another | 👀 data generation | Find better data or fit a fancier model |
Random |
Assumption | What it means | How do you check? | How do you fix? |
---|---|---|---|
Linearity | The relationship between the outcome and explanatory variable or predictor is linear holding all other variables constant | Residuals vs. fits plot Marginal effects plots |
fit a better model (transformations, polynomial terms, more / different variables, etc.) |
Zero Mean | The error distribution is centered at zero | by default | – |
Constant Variance | The variability in the errors is the same for all values of the predictor variable | Residuals vs fits plot | fit a better model (try taking the log or square root of the outcome) |
Independence | The errors are assumed to be independent from one another | 👀 data generation | Find better data or fit a fancier model |
Random | The data are obtained using a random process | 👀 data generation | Find better data |
Normality |
Assumption | What it means | How do you check? | How do you fix? |
---|---|---|---|
Linearity | The relationship between the outcome and explanatory variable or predictor is linear holding all other variables constant | Residuals vs. fits plot Marginal effects plots |
fit a better model (transformations, polynomial terms, more / different variables, etc.) |
Zero Mean | The error distribution is centered at zero | by default | – |
Constant Variance | The variability in the errors is the same for all values of the predictor variable | Residuals vs fits plot | fit a better model (try taking the log or square root of the outcome) |
Independence | The errors are assumed to be independent from one another | 👀 data generation | Find better data or fit a fancier model |
Random | The data are obtained using a random process | 👀 data generation | Find better data |
Normality | The random errors follow a normal distribution | QQ-plot / residual histogram | fit a better model |