PuSH - Publication Server of Helmholtz Zentrum München

Caught you: Threats to confidentiality due to the public release of large-scale genetic data sets.

BMC Med. Ethics 11:21 (2010)
Publ. Version/Full Text Volltext DOI PMC
Open Access Gold
Large-scale genetic data sets are frequently shared with other research groups and even released on the Internet to allow for secondary analysis. Study participants are usually not informed about such data sharing because data sets are assumed to be anonymous after stripping off personal identifiers. DISCUSSION: The assumption of anonymity of genetic data sets, however, is tenuous because genetic data are intrinsically self-identifying. Two types of re-identification are possible: the "Netflix" type and the "profiling" type. The "Netflix" type needs another small genetic data set, usually with less than 100 SNPs but including a personal identifier. This second data set might originate from another clinical examination, a study of leftover samples or forensic testing. When merged to the primary, unidentified set it will re-identify all samples of that individual.Even with no second data set at hand, a "profiling" strategy can be developed to extract as much information as possible from a sample collection. Starting with the identification of ethnic subgroups along with predictions of body characteristics and diseases, the asthma kids case as a real-life example is used to illustrate that approach. SUMMARY: Depending on the degree of supplemental information, there is a good chance that at least a few individuals can be identified from an anonymized data set. Any re-identification, however, may potentially harm study participants because it will release individual genetic disease risks to the public.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
0.000
0.964
26
37
Tags
Annotations
Special Publikation
Hide on homepage

Edit extra information
Edit own tags
Private
Edit own annotation
Private
Hide on publication lists
on hompage
Mark as special
publikation
Publication type Article: Journal article
Document type Scientific Article
Keywords GENOMIC RESEARCH; PRIVACY; INFORMATION
Language english
Publication Year 2010
HGF-reported in Year 2011
e-ISSN 1472-6939
Quellenangaben Volume: 11, Issue: 1, Pages: , Article Number: 21 Supplement: ,
Publisher BioMed Central
Reviewing status Peer reviewed
POF-Topic(s) 30202 - Environmental Health
Research field(s) Lung Research
PSP Element(s) G-505000-003
PubMed ID 21190545
Scopus ID 78650591265
Erfassungsdatum 2011-01-18