Lesson: Data Interpretation - 02

Correlation and Scatter plots

If two variables have a relationship such that when one variable changes, the other changes in a predictable way, the two variables are correlated. For example, there is a correlation between the number of hours an employee works each week and the amount of money the employee earns—the more hours the employee works, the more money the employee earns. Note that in this case, as the first variable increases, the second variable increases as well. These two variables are positively correlated.

Sometimes, when one variable increases, a second variable decreases. For example, the more that a store charges for a particular item, the fewer of that item will be sold. In this case, these two variables are negatively correlated.

Video Correlation and Scatter-plots

Sometimes two variables are not correlated, that is, a change in one variable does not affect the other variable in any way. For example, the number of cans of soda that a person drinks each day is not correlated with the amount of money the person earns.

One way to determine whether two variables are correlated or not is to sketch a scatter plot. A scatter plot is a graph on which the x–axis represents the value of one variable and the y–axis represents the value of the other variable. Several values of one variable and the corresponding values of the other variable are measured and plotted on the graph:

  • If the points appear to form a straight line, or are close to forming a straight line, then it is likely that the variables are correlated.
  • If the line has a positive slope (right to left), the variables are positively correlated.
  • If the line has a negative slope (left to right), the variables are negatively correlated.
  • If the points on the scatter plot seem to be located more or less at random, then it is likely that the variables are not correlated.

It is rare that the points on a scatter plot will all lie exactly on the same line. However, if there is a strong correlation, it is likely that there will be a line that could be drawn on the scatter plot that comes close to all of the points. Statisticians call the line that comes the closest to all of the points on a scatter plot the “line of best fit.” Without performing any computations, it is possible to visualize the location of the line of best fit, as the diagrams below show:

Next to display next topic in the chapter.

Test Prep Lessons With Video Lessons and Explained MCQ

Large number of solved practice MCQ with explanations. Video Lessons and 10 Fully explained Grand/Full Tests.

More Educational and Fun Stuff

More In-depth knowledge about what you need. Detailed about test preparation, English Writing, TOEFL, and IELTS. Education and your well being

Analytical Reasoning with Explained Questions
All in this Category