Date: April 05, 2025
Classification: Frontiers
Literature Overview
This article, titled 'Data accuracy in the European Cystic Fibrosis Society Patient Registry: results of an on-site data validation project', was published in the journal Orphanet Journal of Rare Diseases. It reviews and summarizes the ECFSPR data quality validation project, aiming to assess the consistency between registry data and original medical records, and to confirm its reliability for epidemiological studies. Through on-site validation, the study analyzed records of 4,024 patients from 133 centers across 34 countries, revealing high accuracy in key variables such as demographics, transplant status, and annual disease progression data, while also identifying potential areas for optimization, particularly for complex variables such as lung function and genetic information.
Background Knowledge
Cystic Fibrosis (CF) is a rare inherited metabolic disorder caused by mutations in the CFTR gene, affecting the function of multiple organs including the lungs, pancreas, and liver. Patient registries are critical data sources for studying epidemiology, treatment patterns, and disease burden in rare diseases. However, their quality is often affected by the complexity of data sources, the standardization of variable definitions, and the consistency of data entry. As the largest CF registry in Europe, ECFSPR integrates data from 54,000 patients across 40 countries, but its data quality had not been systematically validated prior to this study. This research fills that gap by conducting on-site validation and providing empirical evidence for improving registry quality. The study specifically examined whether ECFSPR data meet the data quality requirements set by EMA and FDA for drug studies, and explored how cross-country standardization and data entry training affect data accuracy. It not only confirms the high accuracy of the registry on key variables but also provides methodological insights for future multinational rare disease registry systems.
Research Methods and Procedures
The research team conducted on-site data validation from November 2018 to April 2024, involving 4,024 CF patients across 133 centers in 34 participating countries. The variables assessed included demographics, diagnosis information, transplant status, and annual disease progression (e.g., height, weight, lung function, infections, complications, medication use). The study used standardized variable definitions, comparing registry data with original medical health records (MHR) to calculate data accuracy rates. The validation process also included analysis of data entry procedures at both center and country levels, and sensitivity analyses were performed to account for variations in patient numbers and validation coverage across countries.
Key Findings and Insights
Significance and Future Directions
This study is the first systematic evaluation of the ECFSPR data quality. It confirms the registry's high accuracy on key variables, supporting its use in epidemiological research, treatment monitoring, and clinical trials. At the same time, it highlights areas for improvement in complex variables such as height, weight, lung function, and genetic data, emphasizing the importance of standardized definitions and verification of original reports. Looking forward, ECFSPR plans to expand validation efforts to the remaining countries and further optimize the data entry process to enhance cross-country data consistency.
Conclusion
Through on-site data validation involving 4,024 cystic fibrosis patients across 133 centers and 34 countries, this study systematically assessed the data quality of the European Cystic Fibrosis Society Patient Registry (ECFSPR). The results indicate that ECFSPR demonstrates very high accuracy in variables such as birth date, gender, transplant status, and annual disease progression data (>94%). However, there remains room for improvement in height, weight, lung function, and genetic information, largely limited by heterogeneous data sources and inconsistent entry standards. The study highlights the importance of standardized definition training, verification of original genetic reports, and cross-country data harmonization. These findings not only provide empirical support for the continuous improvement of ECFSPR, but also offer a valuable quality management framework for other multinational rare disease registries. Going forward, ECFSPR will continue to enhance data validation and improve data consistency across countries, further supporting precision medicine and global clinical trial design.