Graduate Statistics Topic 5: Benchmark: Correlation and Regression Project

PSY-520 Graduate Statistics

Topic 5– Benchmark – Correlation and Regression Project

Directions: Use the following information to complete the questions below. While APA format is not required for the body of this assignment, solid academic writing is expected, and documentation of sources should be presented using APA formatting guidelines, which can be found in the APA Style Guide, located in the Student Success Center.

Player #Players Age (x)Batting Averages (y)XYX^2Y^2
126338878867614244
2243187632576101124
333318104941089101124
43331610428108999856
525315787562599225
64131512915168199225
724312748857697344
829307890384194249
93430410336115692416
1028302842878491204
1128301842878490601
123730011100136990000
133428910132115688804
14322969472102487616
153929511505152187025
1624294705657686436
172429470565768646
1830293879090085849
193828910982144483521
20342889792115682944
213628710332129682369
22322879184102482369
23332869438108981796
2431285883596181225
2528284795278480656
2631284880496180656
2729278806284177284
2826276712667676176
2929275797584175625
3022274602848475076
3124274657657675076
3231273846396174529
3323271623352973441
3429271785984173441
3526270702067672900
3629268777284171824
37372689916136971824
3826267694267671289
3925267667562571289
R=.06∑x=1164∑y=11338∑xy=338870∑x^2=34082∑y^2=3208088
      
ModelBStd ErrorStandardized Coefficients BetatSig 
1 Constant275.14617.817 15.443.000 
Age.522.589.144.885.382 

-The R-value of the linear model represents a correlation of 0.144. This is a low correlation. R square represents the variation of the DV, batting averages. This is a weak correlation of 0.6%.

  • Select at least three variables that you believe have a linear relationship.
    • Specify which variable is dependent and which are independent.
    • Independent variable: MLB baseball player’s ages
    • Dependent variable: Batting averages in 2016
    • 3rd variable: Player # used to chronologically order players to help eliminate outliers
  • Collect the data for these variables and describe your data collection technique and why it was appropriate as well as why the sample size was best.
  • -Data was collected through a website that listed the top 2016 MLB baseball players by age and batting average. There were 39 players total which reflected the total top batters in 2016 aside from the total MLB baseball player population.
    • Submit the data collected by submitting the SPSS data file with your submission.
    • Frequency Table:
  • Find the Correlation coefficient for each of the possible pairings of dependent and independent variables and describe the relationship in terms of strength and direction.
  • -The constant is the batting average (y). The sig for the DV is 0, and the ID, age is 382 which is Pearson’ correlation coefficient (r). A sig with a small value between 0 and 1, the greater chance the correlation sign similar would be observed.
  • Coefficients
  • Unstandardized Coefficients
  • Find a linear model of the relationship between the three (or more) variables of interest.
Model SummaryModelRR square Adjusted r square Std Error of the estimate 1.873^2.762.749874.779

-A linear correlation is best represented with a regression line (straight). Based upon the data collected, the relationship between ages of MLB players and batting averages is positive, although weak. There is no statistical significance of a strong correlation in the questioned relationship.

  • Explain the validity of the model.

Reference

Witte, R. S., & Witte, J. S. (2015).  Statistics  (10th ed.). Hoboken, NJ: Wiley.




Click following link to download this document

PSY-520 Graduate Statistics Topic 5– Benchmark – Correlation and Regression Project.docx







Place an Order

Plagiarism Free!