Transmission T-022: Van Savage on the informational pitfalls of selective testing

Detail: "Le Pont Japonais." Claude Monet. oil. 1922

April 27, 2020

Test kits cannot exponentiate at the same rate as the virus. Unless we ramp up to 500K, the curve will flatten due to artifact.

Read the Reflection, written 18 August 2021, below the following original Transmission.

Decisions about when and how to relax social distancing will ultimately come down to whether or not we think we’re “flattening the curve” — slowing the growth rate for the spread of infection. But how do we know? The prevailing perception is that we can look at the curve’s fit to reported new cases and deaths each day, but this might not be correct. If the true number of cases in the population is well beyond our maximum testing capacity (as it is in the U.S.) and if we are primarily testing those with symptoms (as we are in the U.S.), then in time the changes that we see might be dominated mostly by random noise. Because the true number of cases far exceeds the testing capacity, the signal is essentially saturated. Selectively testing the symptomatic cases is really testing what proportion of respiratory illnesses and fevers are due to COVID-19 — not what percentage of the population has COVID-19 — and that proportion could remain closer to constant as COVID-19 spreads. Currently, this proportion in the U.S. varies by state, but is not changing much over time or as we increase testing capacity.^1,2

Consequently, the time series of new measured cases could simply reflect random fluctuations around an average that is given by the number of tests per day and the chances that someone with symptoms has COVID-19, as opposed to a different respiratory illness. Neither of these is dependent on the current growth rate or trajectory of COVID-19 cases in the general population. For instance, estimating that the U.S. is limited to conducting about 150,000 tests per day¹ and the positive rate for tests is about 20 percent, we expect 30,000 new cases being reported each day with random fluctuations based primarily on exactly how many tests are processed that particular day.

Early on when a disease is spreading, the number of cases will increase and look exponential either because the number of cases is increasing exponentially and can be adequately measured by tests, or the testing capacity is increasing exponentially and the positive test rate is roughly constant, or both. However, if the number of cases is growing exponentially, it will not take long for the true number of cases to reach millions or tens of millions. So, running approximately 100,000 tests per day can’t possibly capture the true numbers. This still might be okay if the shape of our growth curve is the same — meaning the measured cases are a constant proportion of true cases. And if the population is being randomly sampled, this might actually work because the per-capita growth rate could still be captured, so that when the curve flattens, the percentage of tests that are positive will drop. However, this is not necessarily true for the percentage of tested symptomatic cases that are due to COVID-19, which will yield much higher rates of positive results than if tests were randomly sampled from the general population. This is because the percentage of positives from testing only symptomatic cases may remain closer to constant because those sick enough to come in have a reasonable chance of carrying COVID-19.

That is, we could be suffering from a double whammy in our approach. By having limited testing capacity, we can’t track true numbers, and by testing only those with symptoms, we might not be tracking how the per-capita growth is actually changing. Because of this double whammy, we may be flying blind in terms of the growth rate of cases and not know whether we’re flattening the curve.

To illustrate these points, Figures 1 and 2 present a very simple simulated example in which the true cases are growing exponentially for 160 days, versus a scenario where the true cases grow exponentially for 125 days but then flatten and new cases appear at a constant rate. For both scenarios, the total number of cases by day 125 is beyond the maximum testing capacity per day. (All parameter values are rough estimates. The percentage of asymptomatic cases has been varied from 40 to 80 percent and affects numerical values but not the overall conclusions shown in this Mathematica file. PDF version here.) I also assume testing is occurring only in some fraction of those with symptoms, and I estimate that people exhibiting generic symptoms of respiratory distress, fever, etc. have a 10 percent to 20 percent chance of testing positive for COVID-19. In terms of reported cases, both appear as if the curve is flattening, and it’s not clear how to distinguish the dynamics of the true cases — exponential versus flat — using only the data for the measured number of new cases. Perhaps the range of values for new cases is slightly different, but the shape of the curve isn’t.

Figure 1. Two scenarios for growth in true number of cases. A. Exponential growth for 160 days with an increase by a factor of e for every 8 days. B. Exponential growth for the first 125 days as in A. but a flattened curve for the last 35 days. For the flattened curve in the latter 35 days, the new cases per day randomly fluctuate around the number of new cases where exponential growth ceased. Parameter values are given in this Mathematica file (PDF version here). Changing parameter values has little effect on the results as long as the population and testing approach satisfies the two basic assumptions—true cases well beyond testing capacity and selective testing of population with respiratory distress, fever, etc. that has a roughly constant chance of having COVID-19.

Figure 2. Measured new cases for the two scenarios from Figure 1. These results are indistinguishable and both appear as if the curve has been flattened, even though one results from true cases that have pure and continued exponential growth (A.), and the other (B.) has true cases with a first period of exponential growth that is followed by a flattened curve with random fluctuations.

Tracking the number of deaths will likely give similar conclusions unless presumed cases (not just those confirmed by testing) are included. This policy varies by state, but presumed cases are not included in official counts. Also, conclusions based on the number of deaths will be delayed by a few weeks compared with conclusions that could be based on actual case counts and won’t help as much in anticipating or avoiding hospital overflow. Extreme backlogs and time delays in processing tests could also create short periods of a few days that look like the curve is flattened or spiked, even though new cases are still growing exponentially.

In summary, we must ramp up testing capacity and/or test more randomly. My rough calculations here also match prominent calls that 500,000 tests are needed to see whether we’re flattening the curve.³

Van Savage
UCLA School of Medicine
Santa Fe Institute

Robinson Meyer and Alexis C. Madrigal, “A New Statistic Reveals Why America’s COVID-19 Numbers Are Flat.” The Atlantic, April 16, 2020. https://www.theatlantic.com/technology/archive/2020/04/us-coronavirus-outbreak-out-control-test-positivity-rate/610132/
Justin Kashoek and Mauricio Santillana, “COVID-19 Positive Cases, Evidence on the Time Evolution of the Epidemic or An Indicator of Local Testing Capabilities? A Case Study in the United States”, Written April 10, 2020. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3574849
Keith Collins, “Coronavirus Testing Needs to Triple Before the U.S. Can Reopen, Experts Say”, The New York Times, April 17, 2020. https://www.nytimes.com/interactive/2020/04/17/us/coronavirus-testing-states.html

T-022 (Savage) PDF

Read more posts in the Transmission series, dedicated to sharing SFI insights on the coronavirus pandemic.

Listen to SFI President David Krakauer discuss this Transmission in episode 31 of our Complexity Podcast.

Reflection

August 18, 2021

Complexity after COVID-19

More than a year after writing about COVID-19, my views on the basic science of disease transmission—infectiousness, fatality rate, behavioral modifications to slow the rate of spread, the dangers of saturating and exceeding intensive care unit capacity—have changed very little. In direct contrast, however, my views have changed tremendously in terms of how to enact and encourage policies and behaviors that will help us—as a city, state, country, or planet—to slow the spread of COVID-19. This is necessary so that children can go back to school, people can socially and professionally engage in person, and the most vulnerable populations (people who are elderly, with complicating health conditions, with lack of access to healthcare, who can’t stay home from work) can be protected.

On the first count of the basic science of disease spread, I was concerned over a year ago with how to reliably obtain and analyze data on new COVID-19 cases and new fatalities to infer when disease spread and risk are worsening versus improving. My Transmission was meant as a warning that we might have believed we were flattening the curve when in fact it was still growing exponentially, so that we were actually in worse shape than we recognized.

When I look back now, I think about the last day I was in my office and on campus before we went into full lockdown. One of my colleagues asked me if I thought COVID-19 was really that bad. I replied that the estimates of a fatality rate of 0.5–1.5% seemed believable to me based on a couple of pieces of independent data. Since there are about 320 million people in the United States, if you make a couple of assumptions—about half of people get the infection before disease transmission seriously slows, about half of people are asymptomatic or naturally immune—that reduces symptomatic cases to about 80 million people. So if 1% of those cases led to fatality, you’d guess there would be about 800,000 fatalities, and allowing for uncertainty in the range of estimates for fatality rate, you’d guess a rough range of about 500,000 to 1,000,000 deaths in the US. My colleague seemed genuinely surprised and replied that those were very large numbers, and that they didn’t believe that would be allowed to happen. I responded that was the estimate if we did nothing, but that I hoped we—as a society—did something and could keep the numbers much lower than that.

As I’m writing this on August 18, 2021, the Delta variant is leading to new spikes in infections, and the current number of fatalities from COVID-19 in the United States is well over 600,000.¹ And this number ignores undercounting due to missed deaths—especially at the early stages of the pandemic—that could place the actual current number of COVID-19 deaths much closer to one million. That’s now on par with the number of fatalities in the US Civil War—by far the bloodiest war in US history (but it occurred when overall population was one-tenth the size it is now).

Although none of my assumptions were completely correct, the order of magnitude and range of the final estimate were accurate. As a physicist, that’s what we aim for in doing a back-of-the-envelope Fermi calculation.² A year later, I still believe that at the level of rough estimates and calculations—without needing either sophisticated models of dynamical systems or technical statistical analyses—much about this pandemic was relatively straightforward to predict.

But even though I maintain the events that played out over the past year were quite predictable, a more compelling question is whether they were preventable. Indeed, when talking with my colleague, I said “if we did nothing.” But it wouldn’t be fair to say “we did nothing.” At the personal level, my son didn’t go to school, and my wife and I didn’t go into work, for over a year. We wore masks when outside the home. We rarely saw friends and went a year and a half without seeing family. And in the grand scheme of things, I was extremely fortunate to have the flexibility and resources I had. At the level of the whole society, we wore masks, shut down large public events and indoor gatherings, closed schools, imposed lockdowns, provided financial assistance, put a moratorium on evictions, developed multiple effective vaccines, and administered millions of vaccine doses.

So the question is: Why did these actions fail? Or to unpack that: Were these measures not followed closely enough by the public, not communicated well enough by leaders, not started quickly enough or over the right timescales, or not done in the right places? Was the failure because of political polarization that led people to take harmful actions and engage in dangerous behaviors, even when provided with reliable information and sound advice for modifying behavior? Was the failure an inability to enact these kinds of measures at the population level due to the demands of earning a living? Or is the number of actual deaths in the low range of my original estimate because our measures really did help avoid a few hundred thousand deaths? Is it some combination of these?

Having accurate information is necessary for making good decisions, but it is not sufficient. At the population level, we must also clearly communicate that information, work to provide people with the financial ability to follow recommendations, and implement best practices for ensuring that people are not making decisions in ways that are biased or stem from political polarization and misinformation. My Transmission was focused only on the accurate information. A year later what I think we need to focus on is all the rest—political polarization, communication, misinformation, and financial and health support. And I say this as someone who has lost four relatives to COVID-19 since I wrote the original piece, and as someone with relatives who have not yet chosen to get the vaccination (not counting my son, who’s too young).

In looking at our society as a complex system, it is now clearer to me than ever before that these social and behavioral issues are the bottleneck in our ability to help society and protect those we love.

Read more thoughts on the COVID-19 pandemic from complex-systems researchers in The Complex Alternative, published by SFI Press.

Reflection Footnotes

1 U. Irfan, “How the World Missed More Than Half of All COVID-19 Deaths,” Vox, May 7, 2021, https://www.vox.com/22422794/covid-19-death-numbers-total-us-vaccine-ihme

2 See V.M. Savage, J.F. Gillooly et al., 2004, “Effects of Body Size and Temperature on Population Growth,” The American Naturalist 163(3), doi: 10.1086/381872

More SFI News

View All News

Transmission T-022: Van Savage on the informational pitfalls of selective testing

April 27, 2020

Test kits cannot exponentiate at the same rate as the virus. Unless we ramp up to 500K, the curve will flatten due to artifact.

Reflection

Complexity after COVID-19

Share

News Media Contact

Santa Fe Institute

Tags

Related Projects

More SFI News

Karen Willcox Winner of the 2024 Theodore von Kármán Prize

Tim Kohler to deliver Linda S. Cordell Lecture

To accelerate biosphere science, reconnect three scientific cultures

Mirta Galesic receives prestigious ERC Advanced Grant

Carlo Rovelli receives 2024 Lewis Thomas Prize

Research News Brief: Defining a city using cell-phone data

Complexity tools for USDA nutritional guidelines

Quantifying the potential value of data

Carlo Rovelli joins SFI's Fractal Faculty

New book offers thoughtful approach to modeling complex social systems

Research News Brief: A test of AI “personalities” and behavior

Study: To make sense of history, embrace uncertainty

Study: Predicting steps in a random process

Embodied intelligence & a sense of self

How to track important changes in a dynamic network

African and South Asian students build new connections during inaugural Complexity Global School

New gifts support SFI Education and Postdoctoral programs

The cultural evolution of collective property rights

Applications for Complexity Global School are now open

Life as a planetary regulator: an experimental test