A simple model for monitoring COVID-19 outbreak in Italy

Massimiliano d’Aquino
Associate professor of Electrical Engineering
Department of Engineering, University of Naples Parthenope
http://wpage.unina.it/mdaquino

March 12th 2020
(forecast updated May 15, 2020, see table of contents, section 5)
Automated forecast tool (daily updated) available at
http://wpage.unina.it/mdaquino/covid19/auto/forecast.html

1 Introduction
2 Modeling growth of infected individuals. Analysis of COVID-19 outbreak in China
3 Connection with SIR mechanistic models
4 Monitoring COVID-19 outbreak in Italy
5 Latest updates and forecasts as of May 15, 2020
6 More advanced forecast based on SIRD model as of May 15, 2020
References

1 Introduction

This note has been written to obtain a simple and aware modeling for epidemics outbreak and, in particular, to follow the evolution of COVID-19 spreading in Italy. The methods adopted are briefly summarized. The starting observation is that the spreading of a epidemics is intrinsically a time-varying exponential growth model. This means that it can be realistically modeled with an exponential function with time-varying rate. This is compatible with the assumption that the time-scale for variation of the rate and contagion are separated. More specifically, it is expected that the rate changes on a slower time-scale than the contagion.

The above reasoning justifies the modeling of the number of infected people I(t) at given time t with the following differential equation:

dI dt = λ(t)I(t), (1)

where λ(t) is the time-varying rate of exponential growth. This equation can be solved in closed form and the solution is expressed as:

I(t) = I(0)exp 0tλ(τ)dτ. (2)

The other assumption that is compatible with the evolution of an epidemics is that in the long term the number of infected people must approach a saturation value. In this respect, it is very important to be able to forecast the time instant associated to such saturation.

2 Modeling growth of infected individuals. Analysis of COVID-19 outbreak in China

Mathematically speaking, the above observation means:

lim tI(t) = I. (3)

This is compatible with what has been observed for the disease evolution in China[1] starting from end of December 2019, as it is reported in fig.1(a). As one can clearly see, the contagion has an initial exponential growth in the early stage of the epidemics (more or less one month), then the growth rate decreases and the total number of cases approaches saturation around day 50. Analysis of this behavior suggested to use an appropriate logistic function to model I(t). Namely, the following form is assumed:

I(t) = aexp (bt) 1 + exp (b(t r)), (4)

where a,b,r are three fitting parameters. One can easily recognize that a is a scaling factor, b controls the initial exponential growth, and r defines a delay time before saturation sets in. We observe that the asymptotic number of infected individuals is given by:

I = lim tI(t) = aexp (br). (5)

pict pict

Figure 1: (a) Evolution of number of people infected (blue symbols and line for cumulative distribution, red for daily contagions) by COVID-19 in China. (b) Predictions with model (4) with data updated to day 32 (green), 35 (purple), 40 (cyan), 45 (yellow). Saturation around day 55 is already predicted at day 32.

The result of the fitting procedure shows very good agreement with measured data, as it is visible in fig. 1(a), predicting approach to saturation at day 54.

In this respect, according to (4), the saturation is reached when t and, consequently, t r 1b. A practical criterion is to require that:

t > tsat = r + 5 b, (6)

which can be easily understood by rewriting eq.(4) in terms of Δt = t r as:

I(t) = aexp (br) exp (bΔt) 1 + exp (bΔt), (7)

and observing that for Δt > 5b one has I(t) > 0.99I. This is similar to what is done to estimate the decay time (t > 5τ, with τ = 1b playing the role of the maximum time constant) of the transient response for linear time-invariant dynamical circuits.

Another quantity of interest in monitoring an epidemic is the maximum growth rate, namely the maximum number of contagions per day ΔImax and the associated prediction for the peak date tmax. It can be easily shown that the maximum of the derivative dIdt is given by:

tmax = argmax dI dt = r,ΔImax = dI dt max = bI 4 = baexp (br) 4 . (8)

Now, we observe that the function (4) can be expressed as:

I(t) = aexp (bt) 1 + cexp (bt), (9)

where obviously c = exp (br). Now, recalling equation (1), and computing the time derivative of the latter expression, one has:

dI dt = bI(t) aexp (bt)cbexp (bt) (1 + cexp (bt))2 . (10)

The latter term can be recast in terms of the function I(t) itself as follows:

aexp (bt)cexp (bt) (1 + cexp (bt))2 = a2(exp (bt))2 (1 + cexp (bt))2 cb a = cb a I2(t). (11)

Thus, it is apparent that the differential equation which models the growth of fig.1 is:

dI dt = b cb a I(t)I(t), (12)

where one can clearly recognize the quantity that plays the role of the rate λ(t).

3 Connection with SIR mechanistic models

This model is amenable of epidemiological interpretation in terms of the so called mechanistic models, such as the SIR model[2] (Susceptible Infected Recovered). In fact, under the assumption that the whole population of N individuals is exposed to contagion, one has that the dynamics of the number of infected I(t) and susceptible individuals S(t) is described by the following coupled equations:

dS dt = βSI N , (13) dI dt = βSI N γI, (14) dR dt = γI, (15)

where β is the transmission rate (reciprocal of the mean time between contagions Tβ), γ is the recovery rate (reciprocal of the mean time to recover from the disease or mean hospitalization time Tγ). It is apparent that this model keeps the sum S(t) + I(t) + R(t) = const.t 0. An important parameter associated with this model is the basic reproduction number R0, defined as:

R0 = β γ = Tγ Tβ, (16)

which represents the average number of infected people by a single infected individual during time Tγ (infectious time interval) and plays a very important role in the early stage of an epidemic. As far as time advances, the effective reproduction number reduces proportionally to the susceptible individuals number and is usually denoted as Rt = R0S(t)N.


pict

Figure 2: (a) Evolution of number of people infected by COVID-19 in Italy as of March 12th, 2020. Predictions with model (4) with data updated to March 12th. Saturation around day 40 (April 5th, 2020) is predicted.

If one assumes that separation of time scales is present between infection and recovery, namely γ β, for long enough time the recovered population R(t) S(t) + I(t), which yields S(t) N I(t). By plugging the latter relation into eq. (14), one ends up with the equation for the sole infected population I(t):

dI dt = β(N I)I N γI = (β γ) β NI(t)I(t) = (R0 1) R0 N I(t)γI(t), (17)

where we have factored out γ in the last term and applied eq.(16). The latter equation, compared with eq. (12), suggests to perform the following identification of parameters a,b,r of growth model (4),(9):

b = (R0 1)γ, (18) cb a = exp (br) b a = b I = γR0 N . (19)

These equations can be used to estimate epidemiological parameters from the fitted model.

4 Monitoring COVID-19 outbreak in Italy

In order to test the predictive capability, this model has been applied to perform extrapolations based on data updated to day 32, 35, 40, 45 of the epidemics. The result is reported in fig. 1(b), which shows that the model is able to predict saturation around day 50 already two weeks before. This suggests that this model can be used to monitor the progress of the COVID-19 outbreak in Italy.

To this end, data collected daily by Department of Civil Protection[3] (DPC) are being used to monitor the behavior of the epidemics in Italy. The same independent fitting is applied for each single Region of Italy and the model parameters are extracted. In addition, by using formula (6), the expected saturation day has been estimated on the basis of contagion data. Figure 2 reports the situation of the 9 Regions with higher number of contagions as of March 12th, 2020.

Remarkably, the prediction for the whole nation (lower left corner of fig.2) reports an expected saturation around the beginning of April, slightly more than three weeks from now, consistently with the statement currently made by the DPC. It is also worth noting that the rate b 0.25 is very similar to that observed for China. Thus, following eq. (18) and considering a mean recovery time Tγ 10 days (γ 0.1), one would derive a basic reproduction rate R0 3 which is compatible with that suggested by Chinese epidemiologists[4] for COVID-19 outbreak in China.

5 Latest updates and forecasts as of May 15, 2020

NEW: Automated forecast tool (daily updated) available at
http://wpage.unina.it/mdaquino/covid19/auto/forecast.html

In the following, the latest updates based on daily collected data are presented for Italian regions and for selected world countries. The quantities listed in table 1 are extracted from data and reported in the figures.




quantity description


I = aexp (br)maximum number of infected people at the saturation of contagions t > tsat, see , eq. (5)
tsat = r + 5 bexpected day of contagion saturation, see eq.(6)
tmax = rexpected day of maximum growth rate (maximum number of positive cases per day), see eq.(8)
ΔImax = bI 4 = ba exp (br) 4 maximum contagions per day at day tmax, see eq.(8)



Table 1: Summary of useful quantities as function of model parameters a, b, r to make predictions from fitted data.

5.1 Data for Italian Regions with most contagions


pict

Figure 3: (a) number of people positive to COVID-19 in Italy as of May 15, 2020, from official data[3]. Lower right panel reports the overall situation in Italy. See table 1 for the meaning of symbols. Blue refers to total number of positive cases, red refers to positive cases per day.

5.2 Forecast for peak and saturation of epidemic in Italian Regions with most contagions

In the figure below, the day-by-day forecast for the dates tmax (red) and tsat (blue) associated with the peak (maximum number of positive cases per day) and saturation (maximum number of total positive), respectively.

The standard deviation σ7d over the last 7 days is also computed for each Region. Values of σ7d 1 (and smaller) indicate more stable forecast.

For example, the lower right panel shows that the growth rate peak in Italy would be located around end of March and the saturation of contagions is expected in the beginning of May. Regione Campania, where my hometown Napoli is located, after going through some oscillation, is also expected to reach the saturation in the first decade of May.


pict

Figure 4: (a) Forecast for the contagion saturation and peak dates in Italian Regions as of May 15, 2020, from official data[3]. The standard deviation σ7d over the last 7 days is also computed. Values of σ7d 1 (and smaller) indicate more stable forecast. See table 1 for the meaning of symbols. Blue refers to total number of positive cases, red refers to positive cases per day.

5.3 Data for selected other countries

In the figure below, the day-by-day situation of positive cases in Italy and other countries (Spain, France, Germany, United States of America, United Kingdom) is reported for comparison. The forecasts for peak tmax and saturation tsat as well as the asymptotic number of total positive cases I are reported on the figure panels.


pict

Figure 5: (a) number of people positive to COVID-19 in Italy, Spain, France, Germany, United States of America, United Kingdom as of May 15, 2020, from official data[1] represented in log scale. Upper left panel reports the overall situation in Italy. See table 1 for the meaning of symbols. Notice that 1 day difference with figs. 3-4 is due to misalignment of time between official Italian[3] and EU[1] datasets. Blue refers to total number of positive cases, red refers to positive cases per day.

In the figure below, the day-by-day forecast for the dates tmax (red) and tsat (blue) associated with the peak (maximum number of positive cases per day) and saturation (maximum number of total positive), respectively.

The standard deviation σ7d over the last 7 days is also computed for each country. Values of σ7d 1 (and smaller) indicate more stable forecast.


pict

Figure 6: (a) Forecast for the contagion saturation and peak dates in Italy, Spain, France, Germany, United States of America, United Kingdom as of May 15, 2020. The standard deviation σ7d over the last 7 days is also computed. Values of σ7d 1 (and smaller) indicate more stable forecast. See table 1 for the meaning of symbols. Notice that 1 day difference with figs. 3-4 is due to misalignment of time between official Italian[3] and EU[1] datasets. Blue refers to total number of positive cases, red refers to positive cases per day. Blue refers to total number of positive cases, red refers to positive cases per day.

6 More advanced forecast based on SIRD model as of May 15, 2020

In this section we present data analysis performed with appropriate fitting of the SIRD (Susceptible-Infected-Recovered-Deceased) model, which is an extension of model (13)-(15). By using this model, one can obtain more accurate forecast which overcome the drift of dates reported in figures 4 and 6.

To this end, we report in the following the time evolution of :total number of positive, referred to as C(t), number of currently positive I(t), number of recovered people R(t), number of deaths D(t).

Moreover, the quantities listed in table 2 are extracted from data and reported in the figures.




quantity description


Cmaximum number of infected people at the saturation of contagions t > tsat,C.
tsat,Cexpected day of contagion saturation when C(tsat,C) = 99%C.
tmax,Cexpected day of maximum growth rate (maximum number of positive cases per day).
Rmaximum number of recovered people at the saturation time t > tsat,R.
tsat,Rexpected day of recovered saturation when R(tsat,R) = 99%R.
tmax,Rexpected day of maximum growth rate (maximum number of recovered cases per day).
Dmaximum number of deaths at the saturation time t > tsat,D.
tsat,Dexpected day of deaths saturation when D(tsat,D) = 99%D.
tmax,Dexpected day of maximum growth rate (maximum number of deaths per day).
Imaxmaximum number of positive at day tmax,I
tmax,Iexpected day of maximum positive people (excluding recovered and deceased).
tzero,Iexpected day of positive cases when 99% of recovered is reached.



Table 2: Summary of useful quantities computed from fitted data.

6.1 Data for Italian Regions with most contagions


pict

Figure 7: Number of total positive C(t), positive I(t) at day t, recovered R(t), deceased D(t) in Italy as of May 15, 2020, from official data[3]. Lower right panel reports the overall situation in Italy. See table 2 for the meaning of symbols.

6.2 Forecast for effective reproduction number Rt in Italian Regions with most contagions


pict

Figure 8: Estimation of effective reproduction number Rt in Italian Regions as of May 15, 2020, from official data[3]. Top text in each panel reports the forecast for the date tRt =1 when Rt = 1 (horizontal green line in figure). Lower right panel reports the overall situation in Italy. See table 2 for the meaning of symbols.

6.3 Data for US and selected other EU countries

In the figure below, the day-by-day situation of positive cases in United States and other EU countries (Spain, France, Germany, United Kingdom, Austria, Sweden, Netherlands, Belgium) is reported for comparison. The forecasts for quantities in table 2 are reported on the figure panels.


pict

Figure 9: Number of total positive C(t), positive I(t) at day t, recovered R(t), deceased D(t) in US, Spain, France, Germany, United Kingdom, Austria, Sweden, Netherlands, Belgium as of May 15, 2020, from data made available by JHU CSSE[5]. Lower right panel reports the overall situation in Italy. See table 2 for the meaning of symbols.

6.4 Forecast for effective reproduction number Rt in US and some EU countries


pict

Figure 10: Estimation of effective reproduction number Rt in US and EU countries as of May 15, 2020, from JHU CSSE data[5]. Top text in each panel reports the forecast for the date tRt =1 when Rt = 1 (horizontal green line in figure). See table 2 for the meaning of symbols.

References

[1]   European Center for Disease Control, https://www.ecdc.europa.eu/en/publications-data/download-todays-data-geographic-distribution-covid-19-cases-worldwide, March 12th.

[2]   Kermack W.O., McKendrick A.G. (August 1, 1927). ”A Contribution to the Mathematical Theory of Epidemics”. Proceedings of the Royal Society A. 115 (772): 700-721. https://doi.org/10.1098/rspa.1927.0118.

[3]   Department of Civil Protection, official data repository for COVID-19 outbreak, https://github.com/pcm-dpc/COVID-19/, March 12th.

[4]   Li Q., Guan X., Wu P. et al., Early transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia. N Engl J Med. 2020; (published online Jan 29.), https://doi.org/10.1056/NEJMoa2001316.

[5]   Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect Dis; published online Feb 19, https://doi.org/10.1016/S1473-3099(20)30120-1. Data available at https://github.com/CSSEGISandData/COVID-19.