Question

What is the correlation coefficient and what is its significance? Explain why correlation coefficient is bound between -1 and 1. How correlation coefficient depicts itself in scatterplots? Aside from probabilistic models, explain what is least squares line fitting.

Bivariate numerical data
ANSWERED
asked 2021-02-01
What is the correlation coefficient and what is its significance? Explain why correlation coefficient is bound between -1 and 1. How correlation coefficient depicts itself in scatterplots? Aside from probabilistic models, explain what is least squares line fitting.

Answers (1)

2021-02-02
Step 1
Correlation:
Correlation a measure which indicates the “go-togetherness” of two data sets. It can be denoted as r. The value of correlation coefficient lies between –1 and +1. The positive 1 indicates that the two data sets are perfect and both are in same direction. The negative 1 indicates that the two data sets are perfect and both are in opposite direction. It will be zero when there is no relationship between the two data sets.
Correlation coefficient, r:
The Karl Pearson’s product-moment correlation coefficient or simply, the Pearson’s correlation coefficient is a measure of the strength of a linear association between two variables and is denoted by r or \(\displaystyle{r}_{{{x}{y}}}\).
The coefficient of correlation \(\displaystyle{r}_{{{x}{y}}}\) between two variables x and y for the bivariate data set \(\displaystyle{\left({x}_{{i}},{y}_{{i}}\right)}{f}{\quad\text{or}\quad}{i}={1},{2},{3}…{N}\) is given below:
\(\displaystyle{r}_{{{x}{y}}}=\frac{{{n}{\left(\sum{x}{y}\right)}-{\left(\sum{x}\right)}{\left(\sum{y}\right)}}}{{\sqrt{{{\left[{n}{\left(\sum{x}^{{2}}\right)}-{\left(\sum{x}^{{2}}\right)}\right]}\times{\left[{n}{\left(\sum{y}^{{2}}\right)}-{\left(\sum{y}^{{2}}\right)}\right]}}}}}\)
Step 2
Scatterplot and correlation:
A scatterplot is a type of data display that shows the relationship between two numerical variables. Each member of the data set gets plotted as a point whose (x, y) coordinates relates to its values for the two variables.
When the y variable tends to increase as the x variable increases, it can be said that there is a positive correlation between the variables. In other words, when the points on the scatterplot produce a lower left to upper right pattern, there is a positive correlation between the variables.
When the y variable tends to decrease as the x variable increases, it can be said that there is a negative correlation between the variables. In other words, when the points on the scatterplot produce an upper left to lower right pattern, there is a positive correlation between the variables.
When all the points on a scatterplot lie on a straight line, it can be said that there is a perfect correlation between the two variables.
A scatterplot in which the points do not have a linear trend (either positive or negative) is called a zero correlation or a near-zero correlation.
Form of the association between variables:
The form of the association describes whether the data points follow a linear pattern or some other complicated curves. For data if it appears that a line would do a reasonable job of summarizing the overall pattern in the data. Then, the association between two variables is linear.
Direction of association:
If the increase in the values of one variable increases the values of another variable, then the direction is positive. If the increase in the values of one variable decreases the values of another variable, then the direction is negative.
Strength of the association:
The association is said to be strong if all the points are close to the straight line. It is said to be weak if all points are far away from the straight line and it is said to be moderate if the data points are moderately close to straight line.
Step 3
Least squares line fitting:
Regression analysis estimates the relationship among variables. That is, it estimates the relationship between one dependent variable and one or more independent variables.
The general form of first-order regression model is \(\displaystyle{y}-\cap=\beta_{{0}}+\beta_{{1}}{x}+\epsilon\), Where, the variable y is the dependent variable that is to be modelled or predicted, the variable x is the independent variable that is used to predict the dependent variable, and ε is the error term.
The difference between of the observed value of y and predicted value of value of y is called as residual. Hence, the value of residual is represented as \(\displaystyle{y}–{\left({y}-\cap\right)}\).
If the sum of the squares of the residuals is expressed as smallest sum possible, then the straight line satisfies the least squares property. The regression line of the straight line satisfies the least-squares property then that "best'' fits the points in a scatterplot.
0
 
Best answer

expert advice

Need a better answer?

Relevant Questions

asked 2021-06-19
When two targets are presented close together in a rapid visual stream, the second target is often missed. Psychologists call this phenomenon the attentional blink (AB). A study published in Advances in Cognitive Psychology (July 2013) investigated whether simultaneous or preceding sounds could reduce AB. Twenty subjects were presented a rapid visual stream of symbols and letters on a computer screen and asked to identify the first and second letters (the targets). After several trials, the subject's AB magnitude was measured as the difference between the percentages of first target and second target letters correctly identified. Each subject performed the task under each of three conditions. In the Simultaneous condition, a sound (tone) was presented simultaneously with the second target; in the Alert condition, a sound was presented prior to the coming of the second target; and in the No-Tone condition, no sound was presented with the second target. Scatterplots of AB magnitude for each possible pair of conditions are shown below as well as the least squares line for each. a. Which pair of conditions produces the least squares line with the steepest estimated slope? b. Which pair of conditions produces the least squares line with the largest SSE? c. Which pair of conditions produces the least squares line with the smallest estímate of σ?
asked 2021-02-19
A 10 kg objectexperiences a horizontal force which causes it to accelerate at 5 \(\displaystyle\frac{{m}}{{s}^{{2}}}\), moving it a distance of 20 m, horizontally.How much work is done by the force?
A ball is connected to a rope and swung around in uniform circular motion.The tension in the rope is measured at 10 N and the radius of thecircle is 1 m. How much work is done in one revolution around the circle?
A 10 kg weight issuspended in the air by a strong cable. How much work is done, perunit time, in suspending the weight?
A 5 kg block is moved up a 30 degree incline by a force of 50 N, parallel to the incline. The coefficient of kinetic friction between the block and the incline is .25. How much work is done by the 50 N force in moving the block a distance of 10 meters? What is the total workdone on the block over the same distance?
What is the kinetic energy of a 2 kg ball that travels a distance of 50 metersin 5 seconds?
A ball is thrown vertically with a velocity of 25 m/s. How high does it go? What is its velocity when it reaches a height of 25 m?
A ball with enough speed can complete a vertical loop. With what speed must the ballenter the loop to complete a 2 m loop? (Keep in mind that the velocity of the ball is not constant throughout the loop).
asked 2021-04-20
A 1.3 kg toaster is not plugged in. The coefficient ofstatic friction between the toaster and a horizontal countertop is 0.35. To make the toaster start moving, you carelesslypull on its electric cord.
PART A: For the cord tension to beas small as possible, you should pull at what angle above thehorizontal?
PART B: With this angle, how largemust the tension be?
asked 2021-05-22
Sheila is in Ms. Cai's class . She noticed that the graph of the perimeter for the "dented square" in problem 3-61 was a line . "I wonder what the graph of its area looks like ," she said to her teammates .
a. Write an equation for the area of the "dented square" if xx represents the length of the large square and yy represents the area of the square.
b. On graph paper , graph the rule you found for the area in part (a). Why does a 1st−quadrant graph make sense for this situation? Are there other values of xx that cannot work in this situation? Be sure to include an indication of this on your graph, as necessary.
c. Explain to Sheila what the graph of the area looks like.
d. Use the graph to approximate xx when the area of the shape is 20 square units.
...