Questions from last class?

ANOVA and $R^{2}$

Min	Median	Max	Mean	Std.Dev
95000	540000	1520000	559898.7	225448.1

Submit your response to the following question on Ed Discussion.

The $R^{2}$ of the model for price from area of houses in Duke Forest is 44.5%. Which of the following is the correct interpretation of this value?

Area correctly predicts 44.5% of price for houses in Duke Forest.
44.5% of the variability in price for houses in Duke Forest can be explained by area.
44.5% of the variability in area for houses in Duke Forest can be explained by price.
44.5% of the time price for houses in Duke Forest can be predicted by area.

Do you think this model is useful for explaining variability in the price of Duke Forest houses?

Use the rsq() function from the yardstick package (part of tidymodels)

rsq(duke_forest_aug, truth = price, estimate = .fitted)

# A tibble: 1 × 3
  .metric .estimator .estimate
  <chr>   <chr>          <dbl>
1 rsq     standard       0.445

Alternatively, use glance() to construct a single row summary of the model fit, including $R^{2}$ :

glance(duke_forest_fit)$r.squared

[1] 0.4451945