Making sense of coronavirus statistics
The data quality showing the spread of the coronavirus is poor. The data from the Hubei province (the origin of the virus) is still wildly different from the data from elsewhere and so we need to analyse the data in separate parts. While the numbers don’t look as bad when analysed this way, I note the economic impact is driven by the number of shutdowns and quarantines, not by the number of cases.
I generally separate the data into three parts: (a) Hubei (b) the rest of Mainland China (c) the rest of the World for analysis.
Investment Impacts
See this recent post for more detail on the investment impacts. My core thesis is the economic impact of the virus is not particularly related to the number of deaths but more related to the measures.
In some respects, they are inversely correlated. An unusually strict shutdown/quarantine would likely have a more significant economic impact but lead to fewer deaths and a slower spread of the virus. In other respects, they are correlated. More cases/deaths will lead to stricter quarantine conditions.
I note already the travel bans and quarantine measures exceed anything implemented with SARS in 2003.
Total Deaths
It is pretty clear from this chart the deaths in Hubei are on a different scale to the rest of China:
Total Cases
So, with such a stark difference it seems suspicious that the number of cases in Hubei was roughly the same as the rest of China ten days ago. It is still only a little more than double.
Some of this may be a lack of test facilities, some of it may be a bureaucratic cover-up. Some might be that hospitals are over-run in Hubei and so can’t give the same care as elsewhere. Or, maybe there is a different factor creating this issue.
The numbers for the rest of the World are still tiny. And, in reality, aren’t a measure of cases elsewhere as much as they are of travellers bringing the virus with them.
Mortality Rate
The mortality rate is where we can see distinct differences in data. Dividing the number of deaths by the number of cases during the early stages of an outbreak is very misleading. People who were diagnosed today with the disease are still alive, but they still might die from the disease in the coming days.
A better way is to compare the current deaths to the number of cases from “x” days ago. We still don’t know how many days we should be looking back. The stats so far suggest that the median days from the first symptom to death is 14. But with a broad range from 6 to 41. And, we don’t know how long on average after the first symptom a person would take to become a case.
The below charts show the death rate if the right period to look back is four days, eight days or 12 days. Using data without Hubei, a mortality rate of somewhere between 1% and 4% is likely. The Rest of World data is similar but with only 2 deaths not worth looking at yet.
Using Hubei data suggests the mortality rate is somewhere between 18% and 90%. This seems improbable – to put it mildly. If we trust the older Hubei data, there are now more people dead than what there were cases 2 weeks ago.
Outcomes
Keep in mind that almost all “rest of the world” cases so far are really Chinese cases that have travelled. It will be weeks before we know if there will be outbreaks outside of China that will need quarantine measures.
The number of cases within China is probably over 200,000 already – significantly higher than reported.
Given this, economic outcomes are likely to be:
- extensive shutdowns and travel bans in China for at least three months, most likely longer
- a relatively good chance of shutdowns and travel bans in other countries, especially close neighbours
When looking at any coronavirus data, I recommend separating the data before analysing.
1 topic