Data validation

LF Lucas M. Fleuren
TD Tariq A. Dam
MT Michele Tonutti
DB Daan P. de Bruin
RL Robbert C. A. Lalisang
DG Diederik Gommers
OC Olaf L. Cremer
RB Rob J. Bosman
SR Sander Rigter
EW Evert-Jan Wils
TF Tim Frenzel
DD Dave A. Dongelmans
RJ Remko de Jong
MP Marco Peters
MK Marlijn J. A. Kamps
DR Dharmanand Ramnarain
RN Ralph Nowitzky
FN Fleur G. C. A. Nooteboom
WR Wouter de Ruijter
LU Louise C. Urlings-Strop
ES Ellen G. M. Smit
DM D. Jannet Mehagnoul-Schipper
TD Tom Dormans
CJ Cornelis P. C. de Jager
SH Stefaan H. A. Hendriks
SA Sefanja Achterberg
EO Evelien Oostdijk
AR Auke C. Reidinga
BF Barbara Festen-Spanjer
GB Gert B. Brunnekreef
AC Alexander D. Cornet
WT Walter van den Tempel
AB Age D. Boelens
PK Peter Koetsier
JL Judith Lens
HF Harald J. Faber
AK A. Karakus
RE Robert Entjes
PJ Paul de Jong
TR Thijs C. D. Rettig
SA Sesmu Arbous
SV Sebastiaan J. J. Vonk
MF Mattia Fornasa
TM Tomas Machado
TH Taco Houwert
HH Hidde Hovenkamp
RN Roberto Noorduijn-Londono
DQ Davide Quintarelli
MS Martijn G. Scholtemeijer
AB Aletta A. de Beer
request Request a Protocol
ask Ask a question
Favorite

Data validation and quality control were integrated throughout the project. The internal validity of the data was safeguarded by incorporating data that were validated by the clinical team during routine care, comparing calculated clinical scores against the manually recorded benchmarking scores from national registry data, and by data verification checks with the original hospital. In addition, several checkpoints ensured accurate processing of the data throughout the ETL and data processing pipeline. First, patient tables, headers, and column data were checked for completeness in the ETL pipeline. Secondly, parameter mappings were checked by an intensive care clinician and were therefore independently performed by two clinicians. Next, value distribution plots were continuously generated as part of the processing pipeline. These plots show the distribution of all parameters from all hospitals that were mapped to a certain concept and easily identify aberrant mappings. For all concepts, medically impossible cutoff values were determined by the clinical domain experts. Finally, demographics and any inconsistencies in the distributions or mapping were validated with their original hospital.

Do you have any questions about this protocol?

Post your question to gather feedback from the community. We will also invite the authors of this article to respond.

0/150

tip Tips for asking effective questions

+ Description

Write a detailed description. Include all information that will help others answer your question including experimental processes, conditions, and relevant images.

post Post a Question
0 Q&A