nogilnick.
About
Blog
Plots















Analysis of the Preliminary 2020 Election Results

Sat, 14 Nov 2020

Data Science, Data Visualization, Election, Election2020, Statistics, Voting

The 2020 general election in the U.S.A. is interesting in several regards. Perhaps chief among these is the influence of the COVID-19 pandemic on voting patterns and the subsequent shift to more reliance on mail-in voting. This post presents analysis of the preliminary election results and attempts to explain some of the observed trends. Figure 1 compares the counts of votes received by the two leading candidates in 2016 and 2020. The total number of votes shared by the two in 2020 is roughly 148.2 million, up 20% (24.7 million) from 2016. At the same time, estimates constructed from the U.S. Census Bureau indicate the population grew by roughly 2% (7 million) during this same period. Assuming the number of population members that are eligible to vote is proportional to the size of the population itself, the percentage change in voter turnout is crudely approximated to be about 15%.

Figure 1: Popular Vote Results in 2016 and 2020

The distribution of candidate support by county is similar to that from the 2016 election. Additionally, the relationship between population size and party support is slightly stronger in the 2020 election than in the 2016 election. Namely, R²=0.26 in 2016 and R²=0.29 in 2020, an increase of 11.5%.

Figure 2: Candidate Support by County in 2020

Again, the charts in Figure 2 suggest that, in general, Republicans win a larger number of counties by wide margins. At the same time, from Figure 1, the Democratic candidate is favored in the popular vote with roughly 76.4 million votes compared to 71.8 million votes for the Republican. Taken in conjunction, this once again suggests that Democrats lead in counties with larger populations and this is corroborated by the modest that is observed.

A choropleth chart is created which compares voter turnout in the 2016 election to the preliminary results from 2020. The numbers are adjusted using approximate population estimates provided by the U.S. Census Bureau.

Figure 3: Change in Voter Turnout in (2016 to 2020)

Next, change in support is considered for the two leading parties: Republican and Democrat. The influence of party candidates in other parties is neglected such that x = (1 - y), where x is the ratio support for the Republican candidate and y is the support for the Democratic candidate.

Figure 4: Change in Party Support (2016 to 2020)

Figure 5 shows the counties that exhibit the greatest magnitude of change in support for either party. The color axis may be used to identify the direction of the change: Republican to Democrat or vice versa.

Figure 5: Top Moving Counties (2016 to 2020)

Next, counties that flipped party when compared with the result from 2016 are explored. For the sake of comparison, counties that did not change their party between the two elections are differentiated with lighter colors.

Figure 6: Counties that Changed Party Support (2016 to 2020)

In Figure 6, each county is colored with one of four colors which correspond to one of the following four statuses:
  • Dark Red: Democrat Majority (2016) and Republican Majority (2020)
  • Light Red: Republican Majority (2016) and Republican Majority (2020)
  • Light Blue: Democrat Majority (2016) and Democrat Majority (2020)
  • Dark Blue: Republican Majority (2016) and Democrat Majority (2020)
Sixty counties flipped from supporting the Republican candidate in 2016 to supporting the Democratic one in 2020 and 19 flipped the other way. In total, the Republican candidate won 83.5% (2625) of counties in 2016 and 82.2% (2584) in 2020, down 1.5% (41) counties from 2016.

Finally, predictive models are constructed which attempt to estimate support for each candidate using the data from the previous post. In this way, a model uses historical information about counties to predict support for a specific candidate in the 2020 election.

Figure 7: County Historical Factors Influencing the 2020 Election

Figure 7 shows the most influential factors in determining candidate support for Trump, support for Biden, and overall voter turnout. The results obtained from these models are similar to those from the previous post, though there are a number of differences. The "White or Asian" demographic category became more influential in driving voter turnout, suggesting that these groups were partially responsible for the increase in turnout in 2020. With respect to support for the Democratic candidate, county population and the percentage of residents with graduate degrees became relatively more important in 2020.