DaveInSanFran
New member
- Joined
- Aug 22, 2011
- Messages
- 4
New to statistics. Trying to assist wife with coursework. We submitted an answer we know to be incorrect, but just couldn't determine what we did wrong.
Data set was something like this:
absences grade
0 93
1 92
2 87
3 78
4 61
5 50
to determine correlation coefficient, we created a table to compute sum, x sum x^2, sum y, sum y^2, sum xy.
We used a simplified Pearson product: SS(xy)/sqrt[SS(x) SS(y)] An online solver gave us a negative correlation around -0.9, but our calculations equaled over 13. Is it possible that the data set produces some numerical instability? In particular, I am concerned about the 0,93 datapoint and the calculation of XY = 0.
We repeated the calculations 3 times using the textbook formulas but could not get a negative correlation coeffient between 0 and -1. (Sorry, I don't have the work available here.) do you see any obvious flaws in this dataset?
Data set was something like this:
absences grade
0 93
1 92
2 87
3 78
4 61
5 50
to determine correlation coefficient, we created a table to compute sum, x sum x^2, sum y, sum y^2, sum xy.
We used a simplified Pearson product: SS(xy)/sqrt[SS(x) SS(y)] An online solver gave us a negative correlation around -0.9, but our calculations equaled over 13. Is it possible that the data set produces some numerical instability? In particular, I am concerned about the 0,93 datapoint and the calculation of XY = 0.
We repeated the calculations 3 times using the textbook formulas but could not get a negative correlation coeffient between 0 and -1. (Sorry, I don't have the work available here.) do you see any obvious flaws in this dataset?