
Model definition: estimate_truncation()
Source:vignettes/estimate_truncation.Rmd
estimate_truncation.RmdThis model deals with the problem of nowcasting, or
adjusting for right-truncation in reported count data. This occurs when
the quantity being observed, for example cases, hospitalisations or
deaths, is reported with a delay, resulting in an underestimation of
recent counts. The estimate_truncation() model infers
parameters of the underlying delay distribution from multiple snapshots
of past data. This can be thought of as a Bayesian form of the
chain-ladder nowcasting approach implemented in the baselinenowcast
package, with the added benefit of joint uncertainty quantification and
delay estimation. For settings requiring time-varying delays or more
detailed reporting structure, see the epinowcast package.
Both estimate_truncation() and
estimate_dist() return a delay distribution that downstream
functions such as estimate_infections(),
estimate_secondary(), or a further call to
estimate_truncation() can consume. The main difference is
the data they expect: estimate_truncation() takes
successive snapshots of the same aggregate counts (a reporting
triangle), while estimate_dist() takes individual-level
(linelist) data with primary and secondary event dates. Because it works
from aggregate counts rather than individual records,
estimate_truncation() also fits an observation model for
the counts on top of the delay, whereas estimate_dist()
estimates the delay distribution alone. As a rough decision rule, use
estimate_dist() when you have a linelist and
estimate_truncation() when you have repeated snapshots of
aggregate counts.
Model
Given snapshots
reflecting reported counts for time
where
is in order of recency (earliest snapshots first) and
is the number of past snapshots used for estimation, we infer the
parameters
of a discrete truncation distribution with cumulative mass function
.
The truncation distribution can be any family supported by
dist_spec (e.g. log-normal, gamma).
The model assumes that final counts are related to observed snapshots via the truncation distribution such that
where
is the date of the final observation in snapshot
,
is defined to be zero for negative values of
,
is an additive noise term (controlled via the noise
argument), and
is the observation model (Poisson or negative binomial, controlled via
obs_opts()).
The final counts are estimated from the most recent snapshot as