COVID-19 Case Surveillance Public Use Data Profile (2024-05-02 version)

e Basic Statistics e Missing Data Profile e Univariate Distribution

o Bar Chart (with frequency)

Basic Statistics

Row Counts Item Counts String Long 1 Rows 105869141 2 Columns | 12 | 3. Rows With Null or Missing Values | 101130306 | 4 Rows With No Null or Missing Values | 4738835 |

Missing Data Profile

Missing/null Data Profile

sex race_ethnicity_combined pos_spec_dt

onset_dt

medcond_yn

icu_yn

hosp_yn

Features

death_yn 54.02% current_status

cdc_report_dt

cdc_case_earliest_dt -0

age_group 71.07%

0% 20% 40% 60% 80% 100% % of Total Rows

Univariate Distributions

age_group

16,000,000 12,000,000 ce =) {o) Oo 4 & 8,000,000 4,000,000 é a S S S S S fo \ we we we we Ae . : & 2 &) © A) Ss tS & S ~e age_group cdc_case_earliest_dt 800,000 600,000 & a [o) oO 3 400,000 200,000 0 2020 2022 2023 2024

cdc_case_earliest_dt by exact time

cdc_report_dt

2,000,000

1,600,000

1,200,000

. c 3 jo} ) = fe) ao

800,000

400,000

2022

cdc_report_dt by day

current_status death_yn

90,000,000 60,000,000

80,000,000

50,000,000 70,000,000

60,000,000 40,000,000

50,000,000

30,000,000

Row count Row count

40,000,000

20,000,000

30,000,000

20,000,000 10,000,000

10,000,000

0

Laboratory-confirmed Probable Case case

current_status

Row count

Row count

hosp_yn icu_yn

100,000,000 + 50,000,000 4 80,000,000 4 40,000,000 4 60,000,000 4 30,000,000 4 7 UO = 20,000,000 40,000,000 5 10,000,000 4 20,000,000 4 0 0 ae es Ss Rg SS y w es ys se hosp_yn icu_yn J 60,000,000 80,000,000 4 50,000,000 60,000,000 4 4.» 40,000,000 Ss | {eo} Y 30,000,000 40,000,000 4 z jag J 20,000,000 20,000,000 4 10,000,000 12 0 0 2 o \ A) Ne Ne x cas es & = . & cS s se .< <— * RS » « s *

medcond_yn sex

Row count

Row count

40,000,000

30,000,000

20,000,000

10,000,000

250,000

200,000

150,000

100,000

50,000

race_ethnicity_combined

698,732 194,874

o AS) xy S NO Le xo? ~ RS GREE NC EOS . J Se Swe Sr x ws ww Ss a Ss. 2? x Le So Pas ee ye Swe s se » oO

race_ethnicity_combined

onset_dt

onset_dt by exact time

Row count

500,000

400,000

300,000

200,000

100,000

2020

pos_spec_dt

| 2022

pos_spec_dt by exact time

2023

2024