Sign In
Not register? Register Now!
Pages:
2 pages/≈550 words
Sources:
Check Instructions
Style:
APA
Subject:
IT & Computer Science
Type:
Other (Not Listed)
Language:
English (U.S.)
Document:
MS Word
Date:
Total cost:
$ 12.15
Topic:

Statistics and Significance Testing

Other (Not Listed) Instructions:

DEAR WRITER, PLEASE offer your bid for this order!
1 . It is perfectly fine to slightly modify the code so that it runs on Python 3.x instead of 2.x. Please be sure to include a short note documenting the change you made to get it to run (it should be very minor). Do not use Jupyter notebooks, R or some other language. You must fill in the .py file to receive a grade.
3. For EVERY unique map reduce job, you have to clear out the global data structures! Otherwise, you're persisting information from one MR job to the other, and you will get wrong results. The following may help to clarify: (a) a map reduce JOB is run over a set of documents (the 'corpus') since you're doing word count over a corpus, (b) when computing the execution times of map and reduce, make sure to take this into account. Specifically, you will have to ADD (not average) the execution times for a mapper over the corpus (e.g., if the mapper takes 3 seconds to process doc_1, 2 seconds for doc_2 and 4 seconds for doc_3, the total time taken by the mapper is 9 seconds). You are, of course, generating corpora ten times for each document length (say 10). This will give you ten mapper and ten reducer times for that length. These ten numbers will (respectively) each be averaged into one, and these averaged numbers will be your 'data point' for that length, along with error bars. You can use any tool you want for visualization (Excel works well enough, but you could use matplotlib, or other external tools. The plots will be attached with your submission).
4. In your submission, try to be as organized as possible, but you should absolutely include all supporting data that you can, including plots and raw timing data that you used for the plots. Like real life, the HW doesn't have a single right or wrong answer. We are more interested in seeing the effort you made and good reasoning for decisions or assumptions you found yourself making in the context of the assignment.

Other (Not Listed) Sample Content Preview:

Statistics and Significance Testing
Name
Institution Affiliations
PART I
My primary goal in giving you this exercise is to reinforce the importance of basic statistics and significance testing. Feel free to look up any and every resource on the Web to brush up on what you need to know. We're aiming for conceptual clarity and application, not memorization.
Test subject ('student')

GPA (Fall 2018)

GPA (Spring 2019)

1
2
3
4
5
6
7
8
9
10

3.44481587332
3.40753716919
3.67967040671
3.49235971237
3.35806029563
3.59876412408
3.20857956506
3.49077194424
3.4864916754
3.39281679695

3.63235695644
3.2735320054
3.47597264719
3.45727477966
3.20981902735
3.56866681634
3.15388226146
3.56383533036
3.62542408166
3.00151409103

1 [10 points] The Dean wants to know if the average GPA of the students is at least 3.5, the default in previous semesters being that it was below 3.5. Using g to represent the sample GPA mean for fall and G to represent the population GPA mean, write down the Dean's null (H0) and alternate (Ha) hypothesis.
Answer
Null hypothesis
H0 : g= 3.5
Alternative hypothesis
H1: G≠ 3.5
2. [10 points] Assuming the population variance of the two semesters above are equal (but unknown), individually conduct the Student's t-test for each semester. In either semester, can you (individually) reject the null hypothesis?
An...
Updated on
Get the Whole Paper!
Not exactly what you need?
Do you need a custom essay? Order right now:

You Might Also Like Other Topics Related to standardized testing:

HIRE A WRITER FROM $11.95 / PAGE
ORDER WITH 15% DISCOUNT!