8,153
edits
Changes
From Fellrnr.com, Running tips
Test
,no edit summary
* We have 876703 finishes
* We have 754851 runner
* We have 12438 runners who had both +ve and -ve splits
=Method=
The New York City and Chicago marathons use electronic timing that records the start, finish and half way times for runners. These races are reasonably flathave little elevation change; NYC has around 900 feet (300m) of ascent and descent, and Chicago is effectively flat. Data is publically available for the New York marathon from 2000 to 2011 and the Chicago marathon from 1998 to 2012. (The results of the 2007 Chicago marathon were excluded as that year the race was unusually hot.) The results were grouped within race location using name and age so that multiple finishes by the same individual could be compared. This gives data for 26 races, covering 876,703 results for 754,851 runners.
=Runners with multiple entries=
We have 72,259 runners with multiple entries, with the distribution shown below.
This is section looks at the distribution of splits. Each finish has its split percentage calculated based on the overall finish time and the half way time. The distribution is analyzed for all runners, then for different groups of runners based on their finish time.
==Distribution of All Splits==
This section looks at the distribution of splits for all finishes. Overall, 87% finishes were positive splits and 13% were negative splits. This ratio is similar way when the splits are divided by finish time, with the exception of those finishing in over five hours, where the percentage of negative splits is somewhat lower. However, the distribution of splits varies significantly with finish time. For , with the fastest runners the distribution is having a quite narrowdistribution, and the distribution widening with each successively slower group has a wider spread of splits. The peak of the curve is also at a more positive split value for slower groups. This suggests that faster runners tend to run a more even pace than slower runners.
{| class="wikitable"
|- valign="top"
|[[File:HistogramOfSplits_All_Partial_Finish_5.0_to_9.9.jpg|none|thumb|400px|The distribution of all splits for runners finishing in over 5:00.]]
|}
Below is a table of statistics for the breakdown of percentage splits, which give another view on the data. Again you can see that the faster runners have a narrower spread of positive, even splits (standard deviation) and negative their average splits by finish time. For this table a split is considered are closer to even if it is +/-0.5%.
{| class="wikitable"
! Finish Time
| 9.3
|}
Below is an alternative breakdown of splits into positive, even and negative splits by finish time. For this table a split is considered even if it is +/-0.5%.
{| class="wikitable"
! Finish Time
|}
==Distribution of Best Splits==
This section looks at the finishes of runners with more than one finish in a city. This allows us to look at which split percentage resulted in their best time. This is a much smaller subset of the results, including ~22% of the overall results (192,585 of 876,703). For this subset of finishes the elite runners (sub 2:30) have a higher percentage of negative splits, but runners finishing between 2:30 and 5:00 have a lower percentage of negative splits when compared with all finishes shown above. The most significant difference however is that the best splits have a much narrower spread. The ; that is the best splits are generally closer to even, with fewer finishes in either the extreme negative or positive range.
{| class="wikitable"
|- valign="top"
|[[File:HistogramOfSplits_Best_Partial_Finish_5.0_to_9.9.jpg|none|thumb|400px|The distribution of splits for runners finishing in over 5:00.]]
|}
{| class="wikitable"
! Finish Time
! Count
! Mean
! Interquartile Mean
! Median
! Standard Deviation
| 7.07
|}
Below is the alternative breakdown of splits into positive, even and negative splits by finish time.
{| class="wikitable"
! Finish Time
|}
==Distribution of Best Performance for Runners with Both Positive and Negative Split==
{| class="wikitable"
|- valign="top"
|[[File:HistogramOfBestSplits_Extreme_Partial_Finish_5.0_to_9.9.jpg|none|thumb|400px|The distribution of splits for runners finishing in over 5:00.]]
|}
Here's a tabular breakdown of the same data. <span style='color:#FF0000'>
{| class="wikitable"
! </span>Finish Time
! Negative Split
! Positive Split! Count ! Mean! Interquartile Mean! Median! Standard Deviation
|-
| 2:00 to 2:30
| 3769%| 2931%| 35
| 0.02
| -0.29
|-
| 2:30 to 3:00
| 2149%| 4351%| 190
| 0.69
| 0.32
|-
| 3:00 to 4:00
| 3044%| 5256%| 3,182 3182
| 1.71
| 1.03
|-
| 4:00 to 5:00
| 4153%| 4447%| 6,135 6135
| 1.64
| 0.48
|-
| over 5:00
| 5061%| 3739%| 2,883 2883
| 1.48
| -0.13
| 6.75
|-
| totalTotal| 4052%| 4448%| 12,425 12425
| 1.6
| 0.49
| 5.19
|}
{| class="wikitable"
! Finish Time
! Negative Split (<-0.5%)
! Even Split (-0.5% to 0.5%)! Positive Split (>0.5%)
|-
| 2:00 to 2:30
|}
=Variations by Age and Gender=
{| class="wikitable"
|- valign="top"
* The name provided on the race results for a given runner may vary depending on the use of nicknames or due to data entry errors.
=Table of Split Times=
This table shows sample split percentages for various finish times.
{| class="wikitable"
! Finish Time