CDISC CORE Streamlit Validator

CDISC CORE Streamlit Validator

Project Overview

The CDISC CORE Streamlit Validator is an interactive web application designed to simplify the validation of SDTM and ADaM datasets using the official CDISC rules engine. It provides a modern, intuitive interface for uploading datasets, running automated validation, and exploring results visually — making CDISC compliance accessible to programmers and non-programmers alike.

Key Features

  • Multi-format upload: XPT, CSV, and XLSX support.
  • Automatic CSV → XPT conversion for CORE compatibility.
  • Real-time validation pipeline powered by the CDISC CORE engine.
  • Interactive dashboards summarizing errors, warnings, and passed rules.
  • Rule-level filtering by domain, severity, or message text.
  • Excel export for documentation and regulatory workflows.

Technologies Used

  • Python — core logic and data handling.
  • Streamlit — fully interactive UI.
  • CDISC CORE (cdisc-rules-engine) — official rules validation.
  • Pandas & Dask — fast dataset processing.
  • Plotly — real-time visual summaries.
  • Pyreadstat — XPT handling and conversions.

Applications

  • Clinical Programming: Validate SDTM/ADaM datasets before submission.
  • Biostatistics: Ensure dataset integrity before analysis.
  • Data Management: Perform fast QC checks during study development.
  • Internal QA: Create validation reports for documentation.

Conclusion

The CDISC CORE Streamlit Validator transforms complex regulatory validation into a clean, accessible, and highly visual workflow. It enables clinical teams to ensure CDISC compliance quickly and confidently, without relying on command-line tools or manual report parsing.

Project information

  • Category Clinical Data Tools
  • Project date December 2025
  • Technologies Python, Streamlit, CDISC CORE
  • Project Type Application Development