When the errors show a non-random pattern in a residual plot, that means that your model probably is not appropriate for linear regression. True or false and why?

Question

20-11-2020
Mathematics

contestada

When the errors show a non-random pattern in a residual plot, that means that your model probably is not appropriate for linear regression. True or false and why?

Respuesta :

Otras preguntas

Write an algebraic expression to represent the number of pens that can be bought with 30 cents if each pen costs c cents

Which of these statement is conclusive evidence that mixing sugar in water is a physical change? A). there is no change in color B).there is no precipitate fo

Explain the importance of graphs in interpreting data.

this is a citizen's right to travel freely between states.

if g is the variable which mathemtical sentence expresses the information below?

What is an aphorism? A. a short saying with a message B. a moral teaching for children C. a list of virtues for self-reflection D. a section of an autobiography

what would 6x + 14z - 3x be?

A pronghorn antelope can travel 105 miles in 3 hours. If it continued traveling at the same speed, how far could a pronghorn travel in 11 hours

"Which parts of this passage contain a biblical allusion? a.So lived the clansmen in cheer and revel a winsome life, till one began to fashion evils, that field

Healthy bodies can come in a variety of shapes and sizes. How would changes in the types of bodies used in advertisements can influence the health choices of th

jimthompson5910 jimthompson5910 · Answer 1 · 2020-11-20T21:26:20+01:00

Answer: True

The errors represent how far off the guess is to the true value. The points on the residual plot must be scattered randomly around 0, in both positive and negative regions. Ideally, the smaller the errors, the better the fit. So we want these errors to be as close to 0 as possible. This is why we try to minimize the sum of the squared error (SSE) when trying to find the linear regression equation. The r and r^2 value are related to this idea as well.

If we don't have randomly scattered errors, and some pattern shows up, then this means a linear equation is not a good fit. Another model such as a quadratic model may be the better option.

As for the "why" this works, try to think of a person throwing darts. Their accuracy isn't perfect so they'll likely miss on the left and right sides of the target. Stuff on the left is negative territory, while stuff on the right is positive. Each side is fairly equal assuming the thrower isn't biased in some way. That's why we have randomly scattered points in both regions. In the case of a regression line, that's where our guess goes while the actual data point is the observed value. The difference between the two is the error.