Figure 1: Mortality Records per Year
As seen in Figure 1, there is a dashed line that sharply falls in the year 1972. It is stated in some available documentation that only a sample of 50% of records were recorded for 1972, though no reason for this is provided. In later plots that display the raw counts of records, the counts for 1972 are multiplied by 2 to correct for this.Figure 2: Distribution of Yearly Change in Count
Next, Figure 3 shows the average recorded age by year over the 62 years of data. The average age at death from the records gradually increases over time from about 60 and 64 to about 70 and 77 for men and women respectively, an increase of 17% and 20% respectively.Figure 3: Average Recorded Age per Year
As the average can be skewed by untimely deaths, Figure 4 shows the three quartiles for age. In all quartiles, women tend to outlive men. In fact, during the 1990s, the median recorded age for women is less than two years below men at the 75th percentile.Figure 4: Quartiles of Recorded Age per Year
The next chart shows the percent of records over time having an underlying cause of death relating to neoplasms or circulatory disease. These two are very broad categories of disease, but are primarily dominated by cancer deaths for neoplasm and heart attacks, strokes, and other conditions relating to atherosclerosis for circulatory disease.Figure 5: Percent of Deaths Related to Circulatory Disease and Neoplasms
As seen in Figure 5, diseases of the circulatory system makeup a staggering 54% of deaths in 1959 and begin to steadily fall in the 1970s and onward. This reduction in deaths due to heart disease is attributed to a variety of factors including increased awareness, lifestyle modifications, new surgeries like bypass surgeries, new drugs, and others.Figure 6: Count of Deaths due to Lung and Related Cancers
Figure 6 shows the precipitous rise of deaths due to lung cancer over the period. The number increases from 35k in 1959 to a peak of almost 160k in 2005, an increase of over 350%!Figure 7: Top 10 Cancer Deaths in 1960, 1980, 2000, and 2019
Figure 7 shows the ten leading types of cancer represented in the records for four different years. Colon and breast cancer lead in 1960 but fall to 2nd and 3rd place in the later three plots.Figure 8: Records for Cancer of Pancreas, Colon, and Breast
One type of neoplastic disease which shows steady growth over the period is that of pancreatic cancer. In fact, in the most recent data, pancreatic cancer is the leading cause of cancer death, excluding lung cancer. Figure 9 shows the counts of cancer deaths involving the pancreas, colon, and breast. Colon and breast cancer rates have decreased somewhat in recent years, likely due to improvements screening and treatment options. However, pancreatic cancer rates have continued to grow largely unchecked, likely due to the relative lack of good treatment options for the disease.Figure 9: Rates of Infectious, Parasitic, and Digestive Diseases
As can be seen from the red line, the primary factor driving the increase in infectious diseases is the AIDs epidemic of the late 80s and early to mid-90s. Figure 10 shows the top 9 infectious disease categories for 1995.Figure 10: Top 9 Infectious Disease Categories for 1995
Diseases involving HIV dominate the results. Fortunately, the numbers begin declining in the years following 1995 and continue to decline into the present, likely due to increased awareness, prevention, and improved treatment options.Figure 11: Deaths due to Pneumonia, Influenza, and LRTI
There is a large spike of 92k deaths in the year 1998, the vast majority of which fall under the ICD-9 code 486: Pneumonia, organism unspecified. Also of note, the number of deaths falling into these categories looks unremarkable for the year 2020. Specifically, there is not a compensatory drop in pneumonia deaths.Figure 12: Records with Pneumonia, Circulatory and Provisional Codes
This is shown more explicitly in Figure 12. For the year of 2020, there is a massive spike in records under provisional codes (blue line), from 0 the previous year to 351k records. The specific provisional code used is U071, for Covid-19.Figure 13: Count of Additional Records in 2020 by ICD-10 Category
Figure 13 shows the breakdown of the additional records by their top-level ICD-10 letter. In addition to the U-codes, there are also large increases in records with codes for diseases of the circulatory, endocrine, and nervous systems. Brighter color bars indicate a larger standard deviation computed on changes from all previous years. The change in I-codes is about a 2-sigma move, while the change in G and X-codes are 4.3 and 8-sigma moves respectively. The magnitude of many of these changes suggest they are related to the pandemic in some way.