1
0

Update title and add initial text for week 9

This commit is contained in:
Jip J. Dekker 2018-05-04 10:25:21 +10:00
parent 2260c16ebc
commit fa2d73b624
No known key found for this signature in database
GPG Key ID: 9CB1B58997DD0D58
3 changed files with 56 additions and 3 deletions

View File

@ -13,7 +13,7 @@
\usepackage{bookmark}
\begin{document}
\title{Evidence and experiments}
\title{Week 7 - Evidence and experiments}
\author{
Jai Bheeman \and Kelvin Davis \and Jip J. Dekker \and Nelson Frew \and Tony
Silvestere

View File

@ -14,7 +14,7 @@
\usepackage{bookmark}
\begin{document}
\title{Evidence and experiments}
\title{Week 8 - Quantitative data analysis}
\author{
Jai Bheeman \and Kelvin Davis \and Jip J. Dekker \and Nelson Frew \and Tony
Silvestere

View File

@ -14,7 +14,7 @@
\usepackage{bookmark}
\begin{document}
\title{Evidence and experiments}
\title{Week 9 - Correlation and Regression}
\author{
Jai Bheeman \and Kelvin Davis \and Jip J. Dekker \and Nelson Frew \and Tony
Silvestere
@ -24,9 +24,62 @@
\section{Introduction} \label{sec:introduction}
\section{Method} \label{sec:method}
Provided with a set of 132 unique records of the top 200 male tennis players,
we sought to investigate the relationship between the height of particular
individuals with their respective weights. We conducted basic statistical
correlation analyses of the two variables with both Pearson's and Spearman's
correlation coefficients to achieve this. Further, to understand the
correlations more deeply, we carried out these correlation tests on the full
population of cleaned data (removed duplicates etc), alongside several random
samples and samples of ranking ranges within the top 200. To this end, we made
use of Microsoft Excel tools and functions of the Python library SciPy.
\section{Results} \label{sec:results}
We performed seperate statistical analyses on 10 different samples of the
population, as well as the population itself. This included 5 separate subsets
of the rankings (top 20 and 50, middle 20, bottom 20 and 50) and 5 seperate
randomly chosen samples of 20 players.
\\ \\
\Cref{tab:excel-results} shows the the results for the conducted tests.
\begin{table}[ht]
\centering
\begin{tabular}{|l|r|r|}
\hline
\textbf{Test Set} & \textbf{Pearson's Coefficient} & \textbf{Spearman's Coefficient} \\
\hline
\textbf{Population} & 0.77953 & 0.73925 \\
\textbf{Top 20} & 0.80743 & 0.80345 \\
\textbf{Middle 20} & 0.54134 & 0.36565 \\
\textbf{Bottom 20} & 0.84046 & 0.88172 \\
\textbf{Top 50} & 0.80072 & 0.78979 \\
\textbf{Bottom 50} & 0.84237 & 0.81355 \\
\textbf{Random Set \#1} & 0.84243 & 0.80237 \\
\textbf{Random Set \#2} & 0.56564 & 0.58714 \\
\textbf{Random Set \#3} & 0.59223 & 0.63662 \\
\textbf{Random Set \#4} & 0.65091 & 0.58471 \\
\textbf{Random Set \#5} & 0.86203 & 0.77832
\\ \hline
\end{tabular}
\caption{TODO: Insert better caption for this table. All data is rounded to 5 decimal
places}
\label{tab:excel-results}
\end{table}
\section{Discussion} \label{sec:discussion}
The results generally indicate that there is a fairly strong positive
correlation between the weight and weight of an individual tennis player,
within the top 200 male players. The population maintains a strong positive
correlation with both Pearson's and Spearman's correlation coefficients,
indicating that a relationship may exist. Our population samples show
promising consistency with this, with 6 seperate samples having values above
0.6 with both techniques. The sample taken from the middle 20 players,
however, shows a relatively weaker correlation compared with the top 20 and
middle 20, which provides some insight into the distribution of the strongest
correlated heights and weights amongst the rankings. All five random samples
of 20 taken from the population indicate however that there does appear to be
a consistent trend through the population, which corresponds accurately with
the coefficients on the general population.
\end{document}