Skip to content

Demonstrates the methods of suppressing small cell counts in a provincial surveillance system to prepare data for public release

License

Notifications You must be signed in to change notification settings

IHACRU/suppress-for-release

Repository files navigation

Publications and Presentations

  • Koval, A., Smolina, K., & Leamon, A. (2020). Using Reproducible Data Visualizations to Augment Decision-Making During Suppression of Small Counts. International Journal of Population Data Science, 5(5). ( talk at the 2020 Conference of the International Population Data Linkkage Network, Adelaide, Australia (virtual) slides)

  • Koval, A., Leamon, A., Smolina, K. (2019) Suppressing Small Counts for Public Release: Applications of Reproducible Analytics in Chronic Disease Surveillance. Poster presented at the annual conference of the Canadian Association for Health Services and Policy Research, Halifax, NS. (link)

  • Koval, A., Leamon, A., (2018) Constructing Workflows for Reproducible Analytics: Suppressing small counts for provincial chronic disease dashboard. Invited talk at the colloquium series by the Institute on Aging and Lifelong Health, Victoria, BC. (link)

suppress-for-release

Demonstrates the methods of suppressing small counts in a provincial surveillance system in preparation of data for public release. animation

Background

When a surveillance agency intends to release incidence counts of some health conditions (like in BC Chronic Disease Dashboard), one must take precaution NOT to disclose values considered "too small", which may present a privacy/re-identification risk. Howeever, redacted values sometimes can be re-calculated from the context, so an analyst must detect these patterns and redact more values in order to remove the possiblity of re-calculation. To avoid manual redaction, which is prone to human error and lacks transparency, BC Observatory has developed a suite of R functions to arrive at recommendation for redaction automatically, based on logical tests developed for standard data forms.

For detailed background of the problem this project addresses, please view the slides from the Community of Practice presentation at BCCDC on 2018-03-07 by Brent and Andriy. For the update on the suppression logic (vesion 2) please see slides by Anthony Leamon.

Workflow

The following scripts comprise the workflow of the mechanized redaction of small cells:

workflow

Key functions

dependency_tree

Team & Funders

The automated small cell suppression for public release project is part of the work flow for annual updates and public release of the Chronic Disease Dashboard developed by the BC Observatory for Population & Public Health based on data provided by the BC Ministry of Health, Provincial Health Officer's Office.

Reproduction

If you wish to follow along, please install the latest version of RStudio, clone/download this repository and makes sure the following script can execute without errors:

library(ggplot2)   
library(magrittr)  
library(dplyr)     
library(readr)     
library(testit)    
library(tidyr)     
library(rmarkdown) 

About

Demonstrates the methods of suppressing small cell counts in a provincial surveillance system to prepare data for public release

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published