 Original Article
 Open Access
 Published:
Why life expectancy overpredicts crude death rate
Genus volume 79, Article number: 9 (2023)
Abstract
Life expectancy is widely used and reported; for example, the UN Human Development Index uses life expectancy as a key component. But many users of period life expectancy do not understand and interpret life expectancy as demographers do. In particular, period life expectancy almost always overpredicts the crude death rate. Indeed, in most observed populations, the annual deaths recorded are always less than one may expect in the corresponding stationary population. To explain this overprediction, in this paper we analyze how deviations from stationarity affect the crude death rate. We use theory to show that small deviations from the stationary age structure (as high as 20% at each age) always lead to overprediction. Then we examine global data to show that overprediction is widespread, occurring even in populations where the deviation from stationarity is large. Finally, we show that populations around the world and over many decades have age structures that are almost always far from stationary (or indeed, stability). But we also show that the deviation is often due to the demographic transition, with population bulges at the middle ages where mortality is low.
Introduction
Life expectancy is a widely reported statistic, for example by official governmental agencies every year, or various United Nations agencies such as the Human Development Program (United Nations Development Programme, 2022). However, a common (but incorrect) interpretation of period life expectancy \({e}_{0}\) is that it measures actual lifespan in that period’s population. In other words, the number of deaths in an observed population should be \(\left({\text{t}}\text{otal }{\text{p}}{\text{opulation}}\right)/{e}_{0}\). But in several countries, the actual number of deaths is much smaller than the ratio would predict. For example, in the United States with a population of 337 million and a life expectancy at birth of 77 years, we should expect an annual number of deaths of \(\left(337/77\right)=4.38\) million. But the actual number is about 3.28 million (Department of Economic United Nations & Population Division Social Affairs, 2022). Such a large overprediction is surprising (at least to us).
Mere inequality is of course no surprise to a demographer—as the latter would say, only in a "stationary" population is the average age at death given by \({e}_{0}\), and only in a stationary population do deaths equal \(\left({\text{t}}\text{otal }{\text{p}}{\text{opulation}}\right)/{e}_{0}\). That corresponding stationary population is of course defined by having zero growth and the mortality rates of an observed population. In an observed population, the number of deaths depends on the actual age structure, not on a hypothetical "stationary" age structure. One can readily imagine population structures where the number of deaths is much higher (e.g., a population with everyone over 70) or much lower (e.g., a population where everyone is between 15 and 25) than in a stationary population. But such examples are unrealistic and they also cut both ways, in the sense of implying either higher or lower actual deaths. Here we answer the following questions that grow out of the above observations. Why do we so easily find cases where the ratio \(\left({\text{t}}\text{otal }{\text{p}}{\text{opulation}}\right)/{e}_{0}\) is greater than the number of deaths? Is the overprediction typical, and if so, why? What do we find in a global sample, using the United Nations World Population Prospects (hereafter, WPP) (Department of Economic United Nations & Population Division Social Affairs, 2022), where data cover a wide range but data quality is uneven? What do we find in highquality data, as in the Human Mortality Database (hereafter HMD) (Berkeley USA University of California & Max Planck Institute For Germany, 2022)? Does the pattern of agestructures change in a common way across countries, and if so why?
It is obvious that crude death rate depends on the agepatterns of both mortality and population, but how? We derive an equation that reveals how these two factors determine the difference between the actual crude death rate and the stationary crude death rate. For small perturbations, we obtain a sensitivity for the crude death rate. Our sensitivity has a pleasing mathematical similarity to the famous Keyfitz (1977) and Demetrius (1978) "entropy"like sensitivity of life expectancy. As in the latter case (Vaupel, 1986), we show that our result reveals the agespecific sensitivity of crude death rate to change in the population age structure, and that the sensitivity differs in high and low mortality contexts. Then we use our analysis to show that small deviations in agestructure from stationarity always lead to overprediction. Our result suggests that the extent of overprediction depends on the size of the deviation from stationarity.
Next we explore crude death rates across many countries and time periods, using data from both HMD and WPP to compute the difference between crude death rate and \(\left(1/{e}_{0}\right)\). We find a striking global overprediction of the crude death rate. Then we use longterm Swedish data to provide a useful perspective on the crude death rate over a long period with distinct demographic regimes. Of course, overprediction does not always occur, and we show when underprediction occurs and discuss reasons why.
Finally, we explore the deviation from stationarity in populations across many countries and periods. We use data from the WPP and the HMD to show that the agestructural deviation from stationarity is usually small enough that overprediction will almost always be found. Then we examine more closely how the deviation from stationarity changes over time. Using examples, we argue that the demographic transition produced deviations in the form of population bulges that have always led to what we call overprediction. We also provide a longerterm perspective by examining agestructural change in Sweden from 1751 to 2021. We conclude with a brief discussion.
Crude death rate and age structure
Consider a population (in a specified year) in which individuals at age \(x\) have a death rate \(\mu \left(x\right)\) and a probability \(l\left(x\right)\) of surviving to at least age \(x\). We consider only females here; two sexes can be included with more algebra. Say that the population’s agestructure is described by a fraction \(u\left(x\right)\) of individuals at age \(x\). The percapita death rate is called (by demographers) the crude death rate, and is
We use a percapita rate so the results apply for any total population.
In a stationary population with the same agespecific mortality rates, the fraction \({u}_{s}\left(x\right)\) of individuals at age \(x\) is \(\left(l\left(x\right)/{e}_{0}\right)\), and the crude death rate is
(The above follows from (1), using \(l\left(0\right)=1\)). How do these two crude rates compare?
Sensitivity of crude death rate
Say that an observed population has a nonstationary age structure:
Then Eq. (1) implies that the crude death rate is
Here and later the lower limit on the integral is always zero; the upper limit may be taken as infinity, or some finite number that exceeds the largest age at death. The difference between the crude death rates in an actual and a corresponding stationary population is
This difference is a sensitivity when \(g\left(x\right)\) is small.
Take \(g\left(x\right)\) to be small enough so that we can take logarithms in (3) and expand to get:
This is of course only a linear expansion, but later numerical work shows that the approximation is reasonable for \(g\left(x\right)\) as large as say \(0.2\), which corresponds to a 20% deviation from stationary age structure. Using this expansion in (5), the sensitivity of the crude death rate is thus
As we next show, the above expression quantifies the intuition that the actual value of sensitivity depends on the age pattern of the deviations \(g\left(x\right)\).
Sensitivity and entropylike measures
The sensitivity (8) shows that the contribution of a deviation in the agestructure at age \(x\) is proportional to the ratio:
We use the minus sign because the logarithm is always negative. Thus a positive deviation at any age, \(g\left(x\right)>0\), will decrease \(\delta {C}_{D}\), and the converse.
There is a striking similarity between the sensitivity in (8) and the sensitivity of life expectancy to mortality change (Demetrius, 1978; Goldman & Lord, 1986; Keyfitz, 1977; Vaupel, 1986). In the latter, \(g\left(x\right)\) is a small fractional change in mortality rate at age \(x\), and the resulting change in life expectancy is
Thus the contribution of a change at age \(x\) to \(\delta {e}_{0}\) is proportional to the ratio
as discussed by Vaupel (1986). If we ignore the mortality factor in (9), the agepattern of the remaining contribution to \(\delta {C}_{D}\) is similar to that of \(\delta {e}_{0}\). As we now show, the mortality factor in (9) does not alter the essential pattern.
Figure 1 shows the agespecific sensitivity weights from Eq. (9) for Sweden in four different years (data from the HMD). In highmortality regimes (the earliest year, 1850) with high infant mortality and oldage mortality, the maximum value of sensitivity weight (0.013, 12.6% of the overall weight) occurs at age zero. With the decline in infant mortality (1950), the sensitivity weight at age zero drops dramatically and the maximum value shifts to age 80. As the mortality rate decreases further, the maximum value moves to age 87 and age 90 in 2000 and 2019, respectively. This figure supports demographic intuition, and shows that a decrease in the proportion of very young and/or old people will lead to a high positive \(\delta {C}_{D}\). But do we often see such an age pattern, or the opposite, and why?
Sensitivity and the Kullback–Leibler distance
We now focus on the agestructure of the population. In the definition of \(g\left(x\right)\) in (3), take logarithms on both sides to see that
Now insert this in (8) to get our first expression for sensitivity:
The integrand on the right measures the difference between the real agestructure \(u\left(x\right)\) and the stationary structure \({u}_{s}\left(x\right)\) for a given age \(x\). The overall departure from stationarity can be measured by the similar Kullback–Leibler distance (Kullback & Leibler, 1951):
The inequality on the right above is well known.
Given that mortality rates are bounded between some \(A>0\) and some \(B>0\), the sensitivity in Eq. (13) is bounded between two negative numbers:
Thus we have a key result: when a population has a structure slightly different from stationary, we always have an overestimation:
And from (14, 15), the overestimation should increase with the Kullback–Leibler distance.
Observed crude death rates
We have shown mathematically that overprediction is typical in a population with about 20% deviation from stationary. But the deviations in observed populations could be larger, or different. How typical is overprediction in populations across time and space?
We find, as shown in Fig. 2, that most populations display the pattern we expect: negative sensitivity and overprediction. The difference between the actual crude death rate (\({C}_{D}\)) and \(\left(1/{e}_{0}\right)\) is negative in most populations and years in the HMD (94.54%) and in the WPP (95.41%).
An interesting feature of this comparison (Fig. 2) is that underprediction (i.e., \({C}_{D}<\left(1/{e}_{0}\right)\)) typically occurs when life expectancy is very high or very low relative to the average. In countries with low \({e}_{0}\), we likely have limited or poor data on population at old ages. In countries with high \({e}_{0}\), we likely have populations with longterm low fertility and thus a historically unusual age structures, dominated by an increasing percentage of older individuals. Clearly, overprediction will not be seen in such populations.
Figure 3 reveals the longterm pattern of the sensitivity of crude death rate in Sweden. In the predemographic transition stage (until about 1850) we expect the population to fluctuate around stationarity, and indeed the difference between actual and stationary crude death rates fluctuates around 0. During the demographic transition (beginning in 1850 but most pronounced in the period 1900–1950) the differences become negative and decrease to a low around 1950 and then increase. Additional perspective on these changes is provided below, when we examine the changing pattern of age structure.
Changing age structures: patterns and causes
The deviation in agestructure is key to the crude death rate, but how far are populations from their corresponding stationary structures? To answer this question, we again use annual data from the HMD for 40 countries over a range of years (the longest period is 1751–2021 for Sweden). We also examined a larger set of data for the period 1950–2021 from the WPP for 126 countries and areas whose populations were over 5 million. Smaller countries were omitted as too sensitive to fluctuations.
Agestructural deviations
Recall that we compare a population’s actual age structure \(u\left(x\right)\), and the corresponding stationary structure \({u}_{s}\left(x\right)\). These yield the proportional deviations \(g\left(x\right)\) (defined in (3)). They also yield the observed crude death rate \({C}_{D}\) and the crude death rate \({C}_{Ds}\) for the corresponding stationary population with the same death rates.
Using the data sources above, we compute and present the frequency distribution of the \(g\left(x\right)\) values in Fig. 4. The peak values of \(g\left(x\right)\) occur near but slightly above 0, and most of values of \(g\left(x\right)\) are under 20% (about 8 in 10 for the WPP data, and about 9.5 in 10 for the HMD data). Hence agespecific deviations in observed populations from stationarity are often small enough that overprediction will be found.
Overall deviation
We used the Kullback–Leibler distance (\({K}_{u}\)) to measure the overall deviation in the age distribution relative to the stationary distribution. Figure 5 shows the frequency and distribution of the \({K}_{u}\) values. All the \({K}_{u}\) values observed here are (naturally) positive, with peaks at 0.02 and means at 0.04 (HMD) and 0.13 (WPP). Therefore, observed age distributions are almost always nonstationary. Given that WPP data cover a wider range and are of uneven quality, we expect and find that WPP data show a wider range than HMD data, ranging from 0 to 0.5.
Note that qualitatively similar results are obtained using an alternative indicator:
and are in the Appendix (Figures 10, 11, 12, 13).
Trends in overall deviation
To capture trends, we first present the \({K}_{u}\) values for three countries with highquality data in Europe, Asia, and North America from 1900 to 2020 (Fig. 6). Taking Italy (left panel) as an example, if we ignore the wide swings due to the 1918 influenza epidemic and the two World Wars (Glei et al., 2015), \({K}_{u}\) values rose from 1900 to 1950 and steadily declined thereafter, stabilizing at levels varying between 0.02 and 0.03 after 1975. Japan and the United States follow a similar trajectory, as do many other industrialized countries.
For a longterm perspective, consider Sweden (Fig. 7) with data starting in 1751. We ignore noise in the early historical data (especially for years preceding 1870–80) caused by lower data quality (Barbieri et al., 2015). In the years before the Industrial Revolution (1750–1850), age structure fluctuated with no systematic trend, as seen in the \({K}_{u}\) values in Sweden for that period. As industrialization took hold, and the demographic transition began, the distance \({K}_{u}\) first rose till about 1900, and then fell steadily until leveling off at a small value after about 2000.
Bulges in age distribution
The distance \({K}_{u}\) measures the deviation across the entire age distribution, but does not provide agespecific information (e.g., which distribution at a certain age is lower or higher). So here we compare the observed and stationary age distributions in a more detailed way by plotting their ratio.
Start with Denmark (Fig. 8, topleft panel). In the predemographic transition stage in which both birth and death rates are high (1850), the ratios decreased with age, falling below 1 at age 35, so the actual population was “younger” than the stationary population. In the early stage the demographic transition (1950) resulted in a bulge at ages 0–5. In later years that bulge travels to later ages, whereas there is a continuing reduction in birth rates and thus in the percentage as at young ages.
Those changing depressions and bulges are observed in many other industrialized countries, as seen in the figure. A striking common feature in those bulges is that they are concentrated in the age range 10–50, which typically is the lowest mortality range. In other words, there is a larger proportion of individuals with lower mortality in the observed population than that in the stationary population. This explains the overprediction.
In many countries, the above temporal patterns lead to a negative correlation between deviation and sensitivity, as shown in Fig. 9. What this means is that the difference between the actual crude death rate and the stationary value becomes more negative (and so larger in magnitude) as the distance \({K}_{u}\) decreases. A similar negative correlation is seen in for many years in the longterm data for Sweden. Focusing on the period 1900 to 1970, Fig. 3 shows that \({C}_{D}\) falls relative to the stationary \({C}_{Ds}\), while over the same period Fig. 7 shows that the distance \({K}_{u}\) decreases.
Finally here we note that the distance \({K}_{u}\) is the main driver of the crude death rate (as compared with shifts in the agepattern of mortality). Thus, there is relatively high discrimination of the deviation for distinguishing the mismatch pattern ("actual deaths > expected deaths" and "actual deaths < expected deaths"), as the corresponding area under the receiver operating characteristic curve (AUC) for \({K}_{u}\) is 0.74 in HMD and 0.7 in WPP data.
Stable vs. stationary populations
Much interest in demography has been devoted to stationary populations (Coale, 1972; Keyfitz, 1965). McCann (1976) analyzed crude death rate in stable populations growing at some constant rate, distinct from the stationary populations we consider here. Given that populations are rarely stable, and indeed the demographic transition means that growth rates must change with time, analyses based on stable growth are unlikely to be useful here. Even so, any period fertility and mortality leads to a growth rate and a purely hypothetical stable population. So we asked whether that stable agedistribution could be used instead of the stationary distribution. The answer was negative, but we present the comparison in the Appendix.
Discussion
The overprediction of the crude death rate in terms of life expectancy at birth is widely observed in practice and is puzzling for nondemographers and demographers alike. Here we explored this discrepancy in several ways. We showed analytically that whenever a population age distribution is close to stationary, the stationary population has many more deaths than the actual number. Secondly, our formulas measure precisely the agespecific sensitivity of the crude death rate. In this context, we mention also the more general results in the interesting papers by Aburto et al. (2020), Vaupel (2021) and Nigri et al. (2022).
We used a large data set and found that overprediction of crude death rate is widespread. The deviation between the observed and stationary populations is an important driver of the mismatch in the actual and expected number of deaths. We present both shortterm (decades) and longterm (centuries) of analysis suggesting that the demographic transition produced persistent bulges at ages where the mortality rates are lower. The difference between actual deaths and stationary deaths is mainly driven by the central age groups. This is interesting since this means that more people than we might expect (under stationarity or stability) arrive at adult age. These factors help explain why we find widespread overprediction of the actual number of deaths.
Availability of data and materials
Data are available in the World Population Prospects (https://population.un.org/wpp/), the Human Mortality Database (https://www.mortality.org/), and the Human Fertility Database (https://www.humanfertility.org/).
Abbreviations
 WPP:

World Population Prospects
 HMD:

Human Mortality Database
 AUC:

Area under the receiver operating characteristic curve
References
Aburto, J. M., Villavicencio, F., Basellini, U., Kjærgaard, S., & Vaupel, J. W. (2020). Dynamics of life expectancy and life span equality. Proceedings of the National Academy of Sciences., 117(10), 5250–5259.
Barbieri, M., Wilmoth, J. R., Shkolnikov, V. M., Glei, D., Jasilionis, D., Jdanov, D., Boe, C., Riffe, T., Grigoriev, P., & Winant, C. (2015). Data resource profile: The human mortality database (HMD). International Journal of Epidemiology., 44, 5.
Berkeley USA University of California and Max Planck Institute for Germany. (2022). Human Mortality Database. University of California, Berkeley (USA), and Max Planck Institute for Demographic Research (Germany). Available at www.mortality.org or www.humanmortality.de.
Coale, A. J. (1972). The growth and structure of human populations: A mathematical investigation. Princeton University Press.
Demetrius, L. (1978). Adaptive value, entropy and survivorship curves. Nature, 275, 5677.
Department of Economic United Nations and Population Division Social Affairs. (2022). World Population Prospects. United Nations, Department of Economic and Social Affairs, Population Division. Available at https://population.un.org/wpp/.
Glei, D. A., Borges, G., Riffe, T., Andreeva, M., & Menares, F. (2015). About mortality data for Italy. Human Mortality Database. Italy. Available at https://www.mortality.org/File/GetDocument/hmd.v6/ITA/Public/InputDB/ITAcom.pdf.
Goldman, N., & Lord, G. (1986). A new look at entropy and the life table. Demography, 23, 2.
Keyfitz, N. (1965). The intrinsic rate of natural increase and the dominant root of the projection matrix. Population Studies., 18, 3.
Keyfitz, N. (1977). Introduction to the mathematics of population with revisions. Menlo Park, CA: AddisonWesley Publishing Company.
Kullback, S., & Leibler, R. A. (1951). On information and sufficiency. The Annals of Mathematical Statistics., 22, 1.
Max Planck Institute for Germany and Vienna Institute of Demography Austria.(2022). Human Fertility Database, Max Planck Institute for Demographic Research (Germany) and Vienna Institute of Demography (Austria). Available at www.humanfertility.org.
McCann, J. C. (1976). A technique for estimating life expectancy with crude vital rates. Demography, 13, 2.
Nigri, A., Barbi, E., & Levantesi, S. (2022). The relationship between longevity and lifespan variation. Statistical Methods Applications., 31(3), 1–13.
United Nations Development Programme. (2022). Human Development Report 2021–22. United Nations Development Programme. Available at https://hdr.undp.org/system/files/documents/globalreportdocument/hdr202122pdf_1.pdf.
Vaupel, J. W. (1986). How change in agespecific mortality affects life expectancy. Population Studies., 40, 1.
Vaupel, J. W., Villavicencio, F., & BergeronBoucher, M. P. (2021). Demographic perspectives on the rise of longevity. Proceedings of the National Academy of Sciences, 118(9), e2019536118. https://doi.org/10.1073/pnas.2019536118
Acknowledgements
We thank the referees for careful reading and comments. We thank Rajesh Pant for drawing our attention to this issue and to the published commentary by Dilip D’Souza.
Funding
Open access funding provided by Università degli Studi di Roma La Sapienza within the CRUICARE Agreement. HL acknowledges support from the China Scholarship Council.
Author information
Authors and Affiliations
Contributions
HL performed the analysis, wrote the first draft, and edited the manuscript. ST developed the idea, guided the writing of the paper, and commented on and revised the manuscript. ZG performed some analysis and commented on the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
A different distance measure
We used the Kullback–Leibler distance in the main text. But as noted, qualitatively similar results are obtained using an alternative indicator:
and are given below.
Figure 10 shows the distribution of the distances. Distances over time are shown for 3 countries in Fig. 11. And the longterm pattern for Sweden is shown in Fig. 12.
Finally, Fig. 13 shows the overprediction as a function of the distance \({S}_{u}\) for 6 countries.
Stable populations
We have seen that populations are generally nonstationary and we know how that difference will affect the crude death rate. But of course, much formal demography focuses on stability rather than stationarity. So we asked whether populations were in fact closer to the stable structure implied by their fertility and mortality rates, rather than to the corresponding stationary structure. Using fertility as well as mortality yielded a corresponding stable population growth rate, and then a stable population structure \({u}_{s}^{*}\left(x\right)\). The distance from the actual population structure to that corresponding stable age distribution is
We compare the distances \({K}_{u}^{*}\) and \({K}_{u}\) for both HMD and WPP, in Fig. 14. For the HMD data, we used corresponding fertility data from the Human Fertility Database (Max Planck Institute for Germany & Vienna Institute of Demography Austria, 2022). As we expect both distances are usually nonzero, because actual populations are nonstable and nonstationary. For the HMD countries, most of the distances \({K}_{u}^{*}\) are greater than the distances \({K}_{u}\) values, so the deviation from stability is generally larger than the deviation from stationarity. However, the WPP values show the opposite pattern. We do not understand why.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liang, H., Guo, Z. & Tuljapurkar, S. Why life expectancy overpredicts crude death rate. Genus 79, 9 (2023). https://doi.org/10.1186/s41118023001888
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s41118023001888
Keywords
 Sensitivity
 Crude death rate
 Life expectancy