Public 12 Reporting

COVID-19 Case Surveillance Public Use Data Utility Summary

Users should consider the level of completeness, including suppression levels when planning their analyses and use of public datasets. Privacy protections will suppress field values to reduce reidentification risks. Completeness varies by jurisdiction (i.e., state, local, and territorial) and time period. Variables are consistently coded to the

value “Unknown” when jurisdictions specify in the case data submitted to CDC that the value is unknown, the value “Missing” when jurisdictions do not provide a value, and the value “NA” when the value is suppressed as part of privacy protections.

Dataset version: 5/2/2024

Quick Summary summary all_fields_counts all_fields_pct quasi_fields_counts quasi_fields_pct String Double Double Double Double 1 total_rows 105,869,141 NaN% 105,869,141 NaN% 2. total_columns 12 NaN% S NaN% 3 total_cells 1,270,429,692 100.0% 317,607,423 100.0% 4 suppressed_fields 75 0.0% 75 0.0% 5 missing_fields 290,381,180 22.9% 4,317,834 1.4% 6 unknown_fields 86,247,511 6.8% 32,738,708 10.3% 7 non_blank_fields 893,800,926 70.4% 280,550,806 88.3% Field Level Utility Summary variable suppressed suppressed_pct missing missing_pct unknown unknown_pct String Long String Long String Long String 1} sex 12 0.0% 496,836 0.5% 1,031,761 1.0% 2 age_group 51 0.0% 1,128,733 1.1% 0 0.0% 3. race_ethnicity_combined 12 0.0% 2,692,265 2.5% 31,706,947 29.9% 4 records_with_any_quasi_identifier Si. 0.0% 3,998,415 3.8% 32,025,979 30.3%