covdata package aims to make data related to the COVID-19 pandemic easily accessible to users of R. Once the package is installed, load it in the usual way:
library(tidyverse) #> ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.1 ── #> ✔ ggplot2 3.3.3 ✔ purrr 0.3.4 #> ✔ tibble 3.1.1 ✔ dplyr 1.0.5 #> ✔ tidyr 1.1.3 ✔ stringr 1.4.0 #> ✔ readr 1.4.0 ✔ forcats 0.5.1 #> ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ── #> ✖ dplyr::filter() masks stats::filter() #> ✖ dplyr::lag() masks stats::lag() library(covdata) #> #> Attaching package: 'covdata' #> The following object is masked from 'package:datasets': #> #> uspop
Loading the package makes a variety of datasets available for use. Because the data are in tibbles, the use of the
tidyverse suite of packages is strongly recommended, though it is not required. If use the data objects as dataframes (i.e., without loading the
tidyverse packages) you will need to re-encode some variables, most importantly
date-time columns, for them to behave as expected.
The data are provided as-is. More information about collection methods, scope, limits, and possible sources of error in the data can be found in the documentation provided by their respective sources. Follow the links above, and see the vignettes in the package. The collection and effective reporting of case and mortality data by national governments has technical and political aspects influenced by, amongst other things, the varying capacity of states to test, track and measure events in a timely fashion, the varying definitions, criteria, and methods employed by states in registering cases and deaths, and the role of politics in the exercise of capacity and the reporting of unflattering news. Researchers should take care to familiarize themselves with these issues prior to making strong claims based on these data.