Sign In
Not register? Register Now!
Pages:
2 pages/≈550 words
Sources:
Check Instructions
Style:
Other
Subject:
Mathematics & Economics
Type:
Statistics Project
Language:
English (U.S.)
Document:
MS Word
Date:
Total cost:
$ 10.37
Topic:

The Lending Club is a lending company based on San Franscisco, CA.

Statistics Project Instructions:

The Lending Club is a lending company based on San Franscisco, CA.
They connect borrowers with investors through an online marketplace. They have provided a publically available data from 2007-2011. I have it under Project files and it is available on here as well: https://www(dot)lendingclub(dot)com/info/download-data.action Please see the data with its data dictionary (This is in another separate excel sheet which explains what all the variables mean). You just got an interview as an analyst for the Lending Club. The client wants you to analyze this big amount of information.1. Start by making initial observations of the data. What types of variables are present? Is there anything that catches your eye? A good analyst checks the data carefully. See the Quartz Guide to Bad Data: https://github(dot)com/Quartz/bad-data-guide 2. Use at least two ways to summarize the qualitative data present in the data set with frequency distributions and the various graphs/charts we have used in the class for Chapter 2. 3. Do the same thing with the quantitative data present. These four ways should be different aspects from the data set. Interpret your results. 4. Pick two of the above graphs you chose and describe the shape of those distributions. 5. Why did you use the certain graphs you did? Are there any benefits over the other?6. Now I want you to take two variables you think might be related. Create a scatterplot. Find the covariance, correlation and interpret the results. 7. For the 2 examples you chose on Step 4, give me the best central tendency measure you feel is right for the data sets. Then find their sample variances.8. Create a box plot for me for one of the examples. 9. Depending on the distribution you get for Step 8, let me know where the limits of the observations lie within 2 standard deviations of the mean. What does this mean in relation to the variable?Finally give me a summary of what you have discovered as a whole from this data set. You want the Lending Club to know that you are very interested in working with them. Give them something to think about. DO NOT Handwrite the calculations. We will discuss tools in class.

Statistics Project Sample Content Preview:

Analyze This Big Amount of Information
Student’s Name
Institutional Affiliation
Analyze This Big Amount of Information
1 Start by making initial observations of the data. What types of variables are present? Is there anything that catches your eye?
The types of variables present in the data provided are the following with their respective examples:
* Quantitative-numerical, i.e., continuous and discrete. E.g. loan amount, funded amount, funded amount inventory, term, installment, annual income, etc.
* Qualitative-categorical i.e. nominal, ordinal, ratio and interval. E.g. grade, subgrade, employment title, sub-grade, home ownership, loan status, address state etc.
Surprisingly, the variables present in the data can be sub-grouped into either continuous or discrete if it’s numerical and nominal, ordinal, ratio, or interval if it’s qualitative. Conclusively, the data has all variables.
2 Use at least two ways to summarize the qualitative data present in the data set with frequency distributions and the various graphs/charts we have used in the class for Chapter 2.
Two ways to summarize the qualitative data are by use of frequency bar graphs and a regular pie chart. Below is a pie chart and bar plot of the variable ‘home ownership’ in the data.
3 Do the same thing with the quantitative data present. These four ways should be different aspects from the data set. Interpret your results.
To summarize, the quantitative data bar plots and histograms are used. Below are a bar plot and histogram of the ‘total recovered principal’ variable.
4 Pick two of the above graphs you chose and describe the shape of those distributions.
The two graphs, a histogram of total recovered principle variable and a bar plot of home proprietorship, can be used to describe the shape of a distribution. The histogram shows a positively skewed or right-skewed distribution similar to the bar plot’s non-symmetric distribution, which is negatively skewed.
* Why did you use certain graphs you did?
From the two charts, one can tell the frequency on the histogram’s y-axis and the height of the bars of the bar plot that show how each type of data is spread across the entire datasets.
* Are there any benefits over the other?
Additionally, bar plots and histograms can visually elaborate on measures that describe a distribution, i.e. mode, minimum, maximum, etc. as compared to a pie chart.
5 Now I want you to take two variables you think might be related. Create a scatterplot. Find the covariance, correlation, and interpret the results.
I chose the funded amount inventory and total payment inventory.
Covariance= 22432687
Correlation= 0.8190714
The covariance of both va...
Updated on
Get the Whole Paper!
Not exactly what you need?
Do you need a custom essay? Order right now:

👀 Other Visitors are Viewing These Other Statistics Project Samples:

HIRE A WRITER FROM $11.95 / PAGE
ORDER WITH 15% DISCOUNT!