Subject: Business Statistics
As a statistical tool, correlation analysis is employed to determine the relationship between variables. It should be emphasized that one of the most popular statistical methods used by applied statisticians is correlation analysis.
Graphs having plots between two variables are called scatter plots. The values of one variable are stored on the X-axis, whereas the values of another variable are kept on the Y-axis. The diagram produced by displaying these pairs of X and Y data is called a scatter diagram. The two factors under consideration are said to be connected if the depicted dots demonstrate an upward or downward trend. There is a significant relationship between the dots if they are close together and show a trend of either growing or falling values.
Correaltion:
A statistical tool called correlation is used to assess how closely two or more variables are related.
Graphic method or scatter diagram method:
It is the most straightforward technique for examining correlation between two variables. This method maintains the values of one variable in the X-axis and another variable in the Y-axis. The diagram created by locating these pairs of X and Y values is referred to as a scatter diagram. The two factors under consideration are said to be connected if the depicted dots demonstrate an upward or downward trend. There is a significant relationship between the dots if they are closely spaced and show a trend of either growing or decreasing value.
Karl Pearson's correlation coefficient simply assesses the degree of linear relationship between two variables. Karl Pearson's correlation coefficient between X and Y is typically represented by the symbols rxy, r(X,Y), or just r. It is also known as a correlation, the simple correlation coefficient, or the product moment correlation coefficient. This is how it is explained:
r=\{\frac{COV(X,Y)}{\{sqrt{var(X) \}{sqrt{var(Y)}\}
where, COV(X,Y) is read as covarience between X and Y. This measures the simultraneous changes between two variables.
and COV(X,Y) = 1/n\{sum{(X - \{overline{X})\} }\} (Y -\{overline{Y}\})
=1/n\{sum{XY - \{overline{X}\} \{overline{Y}\}}\}
Example:
Calculate the coefficient of correlation for the following data:
X | 2 | 3 | 4 | 5 | 6 |
Y | 7 | 9 | 10 | 14 | 15 |
Solution:
X | Y |
x=X -\{\overline{X}\} (\{\overline{X}\}= 4) |
x2 |
y= Y -\{\overline{y}\} (\{\overline{Y}\}= 11 ) |
y2 | xy |
2 | 7 | -2 | 4 | -4 | 16 | 8 |
3 | 9 | -1 | 1 |
-2 |
4 | 2 |
4 | 10 | 0 | 0 | -1 | 1 | 0 |
5 | 14 | 1 | 1 | 3 | 9 | 3 |
6 | 15 | 2 | 4 | 4 | 16 | 8 |
\{\sum{X}\}=20 | \{\sum{Y}\}=55 | \{\sum{x}\}=0 | \{\sum{x2}\}=10 | \{\sum{y}\}=0 | \{\sum{y2}\}=46 | \{\sum{xy}\}=21 |
we have,\{\overline{X}\} =\{\frac{\{\sum{X}}{n}\} =\{\frac{20}{5}\} = 4
\{\overline{Y}\} =\{\frac{\{\sum{Y}}{n}\} =\{\frac{55}{5}\= 11
now correlation coefficient, r =\{\frac{\{sum{xy}\} }{ \{sqrt{x2}\} \{sqrt{y2}\} }\} = \{\frac{21}{\{sqrt{10}\} \{sqrt{46}\} }\} =0.98
therefore, r= 0.98. this shows that there is almost perfect positive correlation between X and Y.
References
Chaudary, A.K. (2061).Business statistics. kathmandu:Bhundipuran Prakshan
Dhakal Bashanta (2014).Business Statistics,Buddha academic publisher
© 2021 Saralmind. All Rights Reserved.