Sign In
Not register? Register Now!
Pages:
2 pages/≈550 words
Sources:
Check Instructions
Style:
APA
Subject:
Mathematics & Economics
Type:
Statistics Project
Language:
English (U.S.)
Document:
MS Word
Date:
Total cost:
$ 10.37
Topic:

SCM Project 1. Lending Club is a lending company in San Franscisco

Statistics Project Instructions:

The Lending Club is a lending company based on San Franscisco, CA. They connect borrowers with investors through an online marketplace. They have provided a publically available data from 2007-2011. I have it under Project files. Please see the data with its data dictionary (This is in another separate excel sheet which explains what all the variables mean).
You just got an interview as an analyst for the Lending Club. The client wants you to analyze this big amount of information.
Start by making initial observations of the data. What types of variables are present? Is there anything that catches your eye? A good analyst checks the data carefully. See the Quartz Guide to Bad Data: https://github(dot)com/Quartz/bad-data-guide (链接到外部网站。)
Use at least two ways to summarize the qualitative data present in the data set with frequency distributions and the various graphs/charts we have used in the class for Chapter 2.
Do the same thing with the quantitative data present. These four ways should be different aspects from the data set. Interpret your results.
Pick two of the above graphs you chose and describe the shape of those distributions.
Why did you use the certain graphs you did? Are there any benefits over the other?
Now I want you to take two variables you think might be related. Create a scatterplot. Find the covariance, correlation and interpret the results.
For the 2 examples you chose on Step 4, give me the best central tendency measure you feel is right for the data sets. Then find their sample variances.
Create a box plot for me for one of the examples.
Depending on the distribution you get for Step 8, let me know where the limits of the observations lie within 2 standard deviations of the mean. What does this mean in relation to the variable?
Finally give me a summary of what you have discovered as a whole from this data set. You want the Lending Club to know that you are very interested in working with them. Give them something to think about.
Due October 11th online. Submit on Canvas. Send me whatever work you have done with Excel or any other tool you wish to use all in 1 document. Send me formulas/code used. DO NOT Handwrite the calculations. We will discuss tools in class.

Statistics Project Sample Content Preview:

SCM Project 1
Student’s Name
Institutional Affiliation
SCM Project 1
1 Start by making initial observations of the data. What types of variables are present? Is there anything that catches your eye?
The types of variables present in the data provided are the following with their respective examples:
* Quantitative-numerical, i.e. continuous and discrete. E.g., loan amount, funded amount, funded amount inventory, term, installment, annual income, etc.
* Qualitative-categorical i.e. nominal, ordinal, ratio and interval. E.g. grade, subgrade, employment title, sub-grade, home ownership, loan status, address state etc.
Surprisingly, the variables present in the data can be sub-grouped into either continuous or discrete if it’s numerical and nominal, ordinal, ratio or interval if it’s qualitative. Conclusively, the data has all variables.
2 Use at least two ways to summarize the qualitative data present in the data set with frequency distributions and the various graphs/charts we have used in the class for Chapter 2.
Two ways to summarize the qualitative data are by use of frequency bar graphs and a regular pie chart. The pie chart and bar plot depicts the variable ‘grades’ in the data.
3 Do the same thing with the quantitative data present. These four ways should contain different aspects of the data set. Interpret your results.
In summary, the quantitative data bar plots and histograms are in use. A bar plot of the ‘loan amount’ variable.
4 Pick two of the above graphs you chose and describe the shape of those distributions.
The two graphs; a histogram of loan amount variable and a bar plot of grades, can be used to describe the shape of a distribution. The histogram shows a positively skewed or right-skewed distribution similar to the bar plot’s non-symmetric distribution, which can also be termed as positively skewed.
* Why did you use certain graphs?
From the two charts, one can tell the frequency labeled on the y-axis and see how the type of data spreads across the entire datasets.
* Are there any benefits over the other?
Besides, bar plots and histograms can visually elaborate on measures that describe a distribution, i.e. mode, minimum, maximum, etc. as compared to a pie chart.
5 Now I want you to take two variables you think might be related. Create a scatterplot. Find the covariance, correlation and interpret the results.
Chose loan amount...
Updated on
Get the Whole Paper!
Not exactly what you need?
Do you need a custom essay? Order right now:

👀 Other Visitors are Viewing These APA Statistics Project Samples:

HIRE A WRITER FROM $11.95 / PAGE
ORDER WITH 15% DISCOUNT!