U.S. counties - Building the Skyline

February 25, 2021 by Jason Barr Leave a Comment

The Pandemic Tsunami: How COVID-19 Swept Across America

Jason M. Barr and Troy Tassier February 25, 2021

The Great Equalizer?

In March 2020, during the salad days of the COVID-19 pandemic in the U.S., many people, from Madonna to New York Governor Andrew Cuomo, believed the virus was a “great equalizer,” hitting rich as likely as poor, and White as likely as Black or Hispanic.

In short order, however, it became clear that these pronouncements were wrong; people of color and the poorer members of society were hit at rates over their proportion in the population, particularly in the large cities of the Northeast as well as New Orleans and Detroit. As the great equalizer myth faded, another belief rose in its place—that population density would determine the pandemic’s course. In March, few COVID-19 cases could be found in the sparsely populated rural interior, while New York City was flooded with new cases and deaths.

The Bronx is Burning

By mid-April, the State of New York alone had more cases than any country outside the U.S., with the majority in the New York City metropolitan region. When this happened, curiosity into what was unique about New York soon followed. The frequently repeated response: its density. New York is, by far, the densest city in the United States, with 28,000 residents per square mile (10,811 per km²); San Francisco is a distant second at 17,000 (6,564 per km²).

Further singling out New York was its extensive public transportation network. Its subway alone carries five million riders per day. As a comparison, Los Angeles’s subway and rail system takes two weeks to hit this threshold. New York was accused, as it were, of being New York—unique and apart from the heartland—which gave the rest of the country a false sense of security.

But soon that would change—cases spiked in the heartland and south during the summer, followed by a cascade of infections washing over urban, rural, and suburban places alike. As a new spring approaches with cases, hospitalizations, and deaths dropping rapidly, along with the limited arrival of vaccines, it seems as if the great waves of the pandemic are receding with the hope of calmer seas and possible herd immunity by summer.

Figure 1: The Spread of Coronavirus from January 22 to April 25, 2020. Note that cases are given per 100,000 residents. GIF created by Eon Kim, based on data from USAFacts.org.

The Early Role of Population Density

In the early weeks of the pandemic, few items were more discussed than the role of density. We took part in the discussion, as well, by offering an early critique of its importance. We used the metaphor of lightening striking cities before rural areas because large cities are hubs of tourism and international travel. Our argument was that while density was important, it was not the only determinant of the epidemic. New York and other cities were first because dense places are also central places in the global economy. In this case, people were confusing correlation for causation. The high transmissibility of coronavirus meant it was going spread widely and soon; no region was safe.

Nonetheless, density should play a role in the spread of an epidemic. Density unchecked is the antithesis of social distancing. More people in tighter spaces lead to more contacts and more opportunities for infectious diseases to spread. It is not a question of whether density is important or not, but rather of its relative magnitude and how its impact may change over time. Others have also investigated its role.

Researchers at John’s Hopkins University performed the best-known study. Using data on 900 counties in U.S. metropolitan areas, they found that a city’s overall population size was correlated with more cases of coronavirus, but a city’s density (people per land area) was not. In other words, if a city has a large population, there will be more cases, on average, regardless of whether the people are sparsely or tightly packed together within the city. Some studies noted the role of density as a primary predictor of the epidemic in the U.S., while still others were more nuanced, claiming, as we did, that density matters but was not the sole factor in the epidemic’s spread. Studies from countries around the world have found mixed results as well (see here and here.)

Month to Month

One common feature of the studies, however, is that they all use data from the early part of the pandemic, with analysis ending by the early summer of 2020 at the latest. But how did the virus spread since then, and what role did density play month after month over the past year? Were people of color and the poor continuing to suffer the brunt of the pandemic throughout the year, or did the brunt of the impact change over time?

To understand how and why the virus spread, we have performed an analysis over the past 12 months. When we do this, we see the pandemic’s changing impact, with waves crashing on different groups at various times over the year. The resulting impact was that all groups got hit but at unequal levels and during different times.

We expand on the studies discussed above by performing a statistical (regression) analysis that looks at the role of density, and other factors, up to February 1, 2021 (data sources and results here). We use county-level reported coronavirus cases from USA Facts. The goal is to see how the monthly increase in cases can be “explained” by density, race, and poverty. Our method allows us to perform a kind of “hot-spot analysis” to see which key drivers statistically explain coronavirus growth rates each month and how these drivers’ impacts evolved.

Our results give the percentage change in coronavirus cases associated with a 1% change in a variable. Economists call this the elasticity. Larger values mean that the variable of interest has a larger percentage impact on new cases. If a particular variable has an elasticity of 0.5, it means a 1% increase in that variable is associated with a 0.5% increase in coronavirus cases, on average. If the elasticity is one, then a 1% increase in that variable is associated with a 1% increase in coronavirus cases, on average.

The Density Effect

Figure 2 plots the monthly elasticity for density on the number of cases for the pandemic’s first twelve months. If we look at the effect of density throughout the epidemic, we see that it was most important in the initial months. The most significant impact of density appears during the second month because of where the epidemic was first located, in northeastern cities and other metropolitan centers like New Orleans and Detroit. As the epidemic continued, the role of density becomes less important.

Figur2 2: The Density Waves. Each point is an elasticity, it gives the percent increase in coronavirus cases from the month prior with a 1% change in each variable. For example, on July 1, 2020, the elasticity for density was 0.2. This suggests that, on average, across U.S. counties, a county with a 10% higher density has a 2% higher coronavirus case increase. For household size, here the points show the percent increase in coronavirus cases with an increase of one extra household member, on average. The graphs shows that denser counties and those with larger average household sizes, by and large, experienced the worst coronavirus cases loads from April to June, 2020. Sources: See here.

Household Size

A second, more localized aspect of density is the average household size. Early on in the pandemic, it was recognized as an key influence in New York City in academic research and the general press. Other studies noted the link more broadly across the U.S. and the world. Household size is more nuanced than the broader concept of population density and has its own avenues for increasing infections. The effect of household size relates to both density within a home once an infection is present, the number of opportunities for an infection to enter the home, and the breadth of access to diverse places outside the home where infections are possible. In addition, household size, on average, is positively correlated with more density surrounding the home and with poverty.

The studies mentioned above, again, use data from early in the pandemic. If we look at the effect over time, we see that the impact of household size, or localized density, peaks in April when it becomes the largest factor in the U.S. epidemic. Like the density, it also falls off after that point in time.

Race and Ethnicity

If we return to our elasticity measures, in Figure 3, we see a similar story for race and ethnicity. In the initial months, counties with larger proportions of Asian, Black, and Hispanic people were hit hardest, with this initial wave cresting by May. After that, the measure of elasticity for counties with larger minority populations began to decrease. By July, the elasticity measurement for percentage of Asian and Black residents within a county was approaching zero.

The elasticity of percentage of Hispanic people within a county, however, decreased at a slower rate, only being equal to zero by September. Counties with higher percentages of Native Americans had a slightly different path during the epidemic. After an initial peak in elasticity similar to other groups discussed above, counties with larger native American populations had a second elasticity peak in August when the pandemic ravaged many areas in the southwest and the Dakotas, among other more rural regions.

Figure 3: Race and Ethnicity Waves. Each point is an elasticity, it gives the percent increase coronavirus cases from the month prior with a 1% change in each variable. For example, on June 1, 2020, the elasticity for Hispanics was 0.24. This suggests that, on average, across U.S. counties, a county with a 10% higher fraction of Hispanic residents, had a 2.4% higher coronavirus case increase. The graphs show that high Asian counties were hit hard first, then high Black and Hispanic counties after that. High Native American counties had two waves, one in the spring and one in the summer. Sources: See here.

The Poverty Wave

The second peak of cases in the summer is marked by an increase in the elasticity of county-level poverty rates. Initially, in the U.S. epidemic, impoverished counties had lower elasticities. But the effect of poverty within a county peaked in the month of July. Because there are several variables in our regressions that each pick up different aspects of the epidemic, it can be hard to sort out exactly what is happening. Because average household size within a county is strongly correlated with poverty in the county, there is also a poverty factor in the initial wave. But the “poverty wave” shown in Figure 4 is the movement of the pandemic in the early summer to areas with high rural poverty, particularly in the southern states.

The Cresting Wave

As the poverty wave receded, we entered a new wave in the fall. This third wave was bad for everyone and with lower elasticities from race, ethnicity, density, and poverty. This doesn’t mean that we have returned to the pandemic as “great equalizer,” as we make clear below. But we now find that coronavirus infection rates are now more similar across the socioeconomic spectrum than in the early waves of the epidemic. Mortality rates, however, remains unequal.

Excess Mortality

To understand mortality rates, we used data created by the CDC. They calculate a statistic called excess mortality. It measures the number of seasonally adjusted deaths that occur above normal. Below we plot the excess mortality for each of the population groups over the course of 2020 as percentages above normal (epidemiologists call these the “p-scores“).^[1]

For example, in April, the percentage of deaths above typical for Asian, Black, and Hispanic Americans peaked slightly above 100%. Combining this figure with our elasticities paints a clearer picture of the waves. Early on, the initial waves hit dense cities, particularly areas with large numbers of ethnic and racial minority groups. As the initial wave receded, a second wave arrived in more rural areas of the country, mainly rural areas with larger percentages of Hispanic and Native American residents and regions that were poorer on average.

For the Hispanic population, the second wave was almost as severe as the initial wave, if measured by excess deaths. While our new-cases regressions paint a picture of the cresting wave, excess mortality continues to be higher for minority groups (excluding the Black population at least through November), particularly for the Hispanic and Native American populations (see Figure 5).

Figure 5: Excess Mortality. This graph shows for each group the percent above typical mortality rates for each month. For example, in April, the excess mortality rate due to COVID-19 for Hispanics was nearly 1.2, or 120% normal. For nearly all months and minority groups, their excess mortality rates were higher than whites.

Waves within Waves

The pandemic that struck the United States and the world over the past year has not washed over us equally, nor has it arrived in one constant wave. It has been a pattern of episodes, smaller waves within a big storm, that each have unique characteristics. Overall, the pandemic’s waves and idiosyncrasies have inflicted most harm on the more vulnerable in our society, but even there, the effects have changed over time.

Each wave has held its own unique characteristics and impacts on different groups and different regions. This seemingly ever-changing set of features is not surprising to epidemiologists, who are accustomed to dealing with such idiosyncrasies over time. As epidemiologist Adam Kucharski writes in his book, Rules of Contagion, “if you’ve seen one pandemic, you’ve seen… one pandemic.” We wait hopefully for this one to recede into the history books.

Read more posts on the COVID-19 pandemic and related topics here.

—

^[1] These p-scores are highly correlated with our elasticity measures, which can be seen here (in data appendix pdf)

May 19, 2020 by Jason Barr 1 Comment

Border Crossings: The Spread of COVID-19 across U.S. Counties

Jason M. Barr and Troy Tassier May 19, 2020

Pandemic 2020

The COVID-19 pandemic rages on with no end in sight. How long it will take to return to normal, at this point, is anybody’s guess. But since the Federal Government has ceded coordination on further mitigation efforts to individual states means there is going to be a hodgepodge of different policies. Each governor has been left to decide on a suite of strategies for its residents.

Some states are on a path to almost full reopening, while others are taking a more cautious approach. This week, most states will loosen at least some restrictions—even New York, the state most heavily hit by the pandemic—will begin allowing construction and curbside retail in portions of the state less impacted by the epidemic. But many areas along the east and west coasts will remain closed for longer.

While opening too soon is risky for those within each state, one could argue that if a state’s residents want to put their health at risk, that’s their choice. But a key problem with this approach is that the virus knows no borders. What happens in Vegas doesn’t stay in Vegas. If you live in a state that remains closed, but the surrounding states are opening—Illinois, for example—you should be worried.

The Spread of Coronavirus from January 22 to April 25, 2020. Note that cases are given per 100,000 residents. GIF created by Eon Kim, based on data from USAFacts.org.

A Brief History of the Coronavirus in the United States

The first case of coronavirus was reported in the United States in Washington State on January 20, 2020. From there, cases started appearing in various places throughout the country. Stage I was “the spark” in late January when the first few cases appeared. Stage II was the initial spread, but only in a few areas that saw infections early on; but it was still relatively contained.

Then around March 1^st, everything changed—the virus spread to the rest of the nation. Community transmission (where a source of the infection cannot be identified) became common. About one month later, we see the rate of increase begin to slow, suggesting that state-level social distancing measures had a positive effect. One could argue that if a strong federal policy were imposed before the 40^th day since the first infection, the rate of increase would have been much slower.

Total Number of Reported COVID-19 Cases Since January 22, 2020. Note: Day 1 here is January 22, 2020. Data is reported on natural logarithmic scale (which is better to show the growth rates of infections). Source: USAFacts.org.

The Birds and The Bees: The Reproduction Number

To understand the impact of the spread of the virus, we need to review basic epidemiology. It all begins with the reproduction number (R)—the average number of infections one person passes onto the next. For example, if each individual spreads the virus to two people, then the reproduction number is two. In this case, Greg gives it to Marsha and Jan. Marsha gives it to Bobby and Peter. Jan gives it to Cindy and Alice. And so on. Two give it to four, four to eight, and on and on. In other words, with a reproduction number of two, the number of cases double in each generation of infections.

In the U.S., earlier this year, the average reproduction number across all states was a little over two but varied widely across regions. Some estimates place New York City a little under four, while Arkansas and South Dakota may have been below one early in the epidemic. New York was hit so badly in part because of the strength of its social ties.

Four Things

R, however, is determined by four things: the fraction of remaining susceptible people in the population (how many people are left that could be infected), the duration of infection (how long an infected person carries the virus and is able to infect others), the transmission rate (how likely is a susceptible person to be infected if in contact with the virus), and the contact rate (how many other people does an infected person interact with each day when infected).

One fights an epidemic by making R smaller. If it goes below one, then instead of increasing, the epidemic will die out. For instance, if R=1/2, then 100 people become 50 in the next generation, then 25, and so on until the epidemic disappears. Each of the four elements of R can be an avenue for governmental policy.

R-Reducing Policies

We can decrease the fraction of susceptible individuals through a vaccine, but this takes time. We can limit the duration of infection through extensive testing and contact tracing, followed by isolation of the newly infected. The transmission rate can be lowered by prophylactic measures such as wearing gloves, masks and eye wear, and extensive cleaning of surfaces. But the primary method to reduce R has been by lowering the contact rate via social distancing.

After social distancing restrictions were imposed, most states lowered their reproduction number to less than one. Estimates suggest that even New York sits now at about 0.85. But, these values vary across states. Places like Iowa, Illinois, and Arizona are likely sitting just above one. Some like Alaska, Idaho, Montana, and Vermont, are well below one. But as states reopen, their R is likely to increase, even with slight easing of social distancing policies. Additionally, there appears to be no systematic effort for wide-scale testing and contact tracing.

Do No Fences Make Bad Neighbors?

One of the things that students learn early in an economics course is that no individual is an island. The buying and selling decisions of the masses determine the price you pay for any common consumer good you care to name. Laws are passed to limit the pollution of automobiles so that we don’t have too large of an effect on the clean air of others as we commute to work. We ban smoking in many public places, so non-smokers don’t breathe second-hand smoke.

These spillover effects are called externalities. Externalities don’t have to be bad. Planting flowers in your yard has a positive benefit to your neighbors across the street. The vaccines that you get to prevent you from falling sick from the flu also prevent you from infecting others; thus, your decision to get a flu shot makes everyone around you safer.

Many states are making decisions to reopen based on the caseloads, hospital capacities, and economic circumstances within their own borders. But they seem to be considering less, if at all, the potential impacts on surrounding states. A virus does not respect state borders. There is no visible or invisible fence that keeps coronavirus from passing from Georgia to Florida, or from Iowa, Wisconsin, and Indiana (each a high profile state that is reopening) to Illinois, which is still struggling to keep its caseload under control.

Regional Pacts

This is the reason that some states have formed regional pacts to coordinate their reopenings and reduce the negative externalities. For example, New York, New Jersey, Connecticut, Pennsylvania, Delaware, Rhode Island, and Massachusetts have formed a multi-state agreement to coordinate their COVID-19 responses. It is a form of centralization that will help, especially those states in the center of the region like New York and New Jersey, whose state borders are all within the pact’s boundaries. Pennsylvania, on the other hand, shares a border with Ohio, West Virginia, and Maryland. How that affects the Keystone State depends on how well their neighbors “behave.”

Corona Caseloads

To better understand the basic spread of COVID-19 we have undertaken a range of statistical analyses of county-level coronavirus cases. (The result can be found here.) We use data on confirmed cases from USAFacts.org and estimate what drives its spread by looking at variables within each county. In general, we find that the initial seeds of the epidemic in a local region tended to be driven simply by bad luck and the presence of airports.

But once it arrives in a location there are several things that impact its spread. Specifically, over time, denser counties, and those with more use of public transportation have faster growth rates—even with mandated stay-in-place measures.

Spilling Over

But to better understand the virus’ spread, we looked at how the neighboring counties affected each other. We do this in two different ways. First, we measure the average number of cases on March 20 in all surrounding counties that share a border with each county. We then statistically estimate the impacts of neighboring county cases on the original county, as of May 9^th. Second, we measure the number of cases in all counties on March 20 but discount the effect of counties that are farther away. So, this gives an average number of cases in surrounding counties, with more weight given to those closer by.

No matter which measure we choose, we find significant and large externalities across county borders. That is to say, the number of cases in surrounding counties has a significant impact on the number of cases in each county, on average.

Border Effects

We discuss the adjacent neighbors first. Let’s take two counties—call them Oak and Maple, respectively—that are the same in all respects but one. They each have about the same population density, social-economic, and racial profile, etc. The only difference is that Maple County’s neighbors have a 10% higher number of cases, on average, than Oak County. That is, the only difference between the two counties is what’s happening outside of them.

In this case, our data estimations indicate that Maple County will have 2.7% more cases than Oak. In other words, about ¼ of a one-percent increase of the cases of your neighbors are passed on to you within 60 days. If we do this same procedure but include all counties across the country but discount counties that are farther away, we get even larger effects. A 10% increase in the cases of other counties results in a 7% increase in cases in your own county 60 days later. The result from this method is larger because it considers how cases multiply across space. Your neighbors are affected by their neighbors, who are affected by their neighbors, and so on. The impact from the 60-day window illustrates how these effects are long lasting.

Magnitudes

As an example of this magnitude, suppose that in isolation, you would expect to have 1,000 cases based on your profile of population, density, public transportation use, etc. Then suppose that your neighbors (in decreasing weight by distance) have twice as many cases as another similar county’s neighbors 60 days ago. This doubling of your neighbor’s cases would result in you having 1,700 cases instead of 1,000. So, you have an extra 700 cases as a direct result of your neighbors’ influence on the pattern of the epidemic. This is the negative spatial externality associated with this epidemic. It is the reason why states and regional areas are or should be coordinating their efforts.

The main takeaway is that surrounding states can initiate a rise in cases and deaths of their neighbors. If a state, such as Illinois, is struggling to keep its R value under one, and therefore remains closed, but it starts getting “spillover” cases from a neighboring state it could trigger Illinois’ value of R to go above one, thus making it difficult for them to keep their epidemic under control, let alone reopen their economies.

Border Effects. This map gives the estimated number of COVID-19 cases on May 9, 2020 that originated directly outside the county at least 50 days before they were recorded. That is, the map gives state-level estimates of cases originating in a bordering county 60 days priore. Source: here.

Corona 2.0

And, just as there can be negative externalities in cases from a neighbor reopening, there is a positive benefit from a neighbor remaining closed. If state A opens first, it benefits economically from its actions. But, if its neighbor, B, stays closed, A pays a lower cost in terms of the epidemic because it receives fewer spillover cases. Thus, state A wants to open before state B. But, if state B remains closed while A is open, B will experience the negative externality. It loses economically because it is closed, and receives more spillover cases from A. The epidemic in the closed state will be worse as a result (and may even see its R rise to above one).

State B may then be forced to lengthen its stay-in-place policies—and the economic harm they cause—because of its neighbor’s actions. State A gets to “free ride” on its neighbor for a while. Of course, free riding will backfire. If state B has a spike of cases, it will come back to its neighbor A in the future.

The Summer of our Discontent?

This lack of coordination can have even deeper ramifications for the nation as a whole. By triggering new outbreaks across state borders, it makes the U.S. radioactive. Other countries are likely to join together to allow trade and travel among themselves, leaving the U.S. behind while the suffering and death toll drags on because some states are going it alone.

In the meantime, let’s hope an effective treatment or a vaccine gets here soon…