PSAP

Context

Peptides are increasingly investigated in therapeutics, cosmetics, biomaterials, and drug delivery. Yet they often exhibit high conformational flexibility, multiple relevant conformations may coexist, and structural predictions depend on environmental conditions. Docking scores alone are poor predictors of biological activity, and molecular dynamics results depend on force fields and simulation protocols.

PSAP was built to answer a central R&D question: can a reproducible computational workflow prioritize peptide candidates from public literature based on structural plausibility, stability, and robustness?

Approach

Unlike demonstration projects focused solely on running AlphaFold, PSAP reproduces a realistic early-stage R&D workflow where computational predictions are treated as hypotheses, not conclusions. The pipeline automates the full path from literature retrieval to automated scientific reporting.

Literature mining and peptide dataset curation from PubMed, PDB, and UniProt
Structure prediction with ESMFold, ColabFold, and AlphaFold, benchmarked against experimental structures
Rosetta relax and scoring, with OpenMM fallback when PyRosetta is unavailable
GROMACS molecular dynamics with optional AMBER/OpenMM cross-validation
Environmental robustness assessment and trajectory analysis
Self-hosted LLM scientific review (Qwen on EU infrastructure) and automated reporting

Solution

PSAP runs in three execution modes: demo (laptop, minutes to hours), standard (GPU workstation), and research (GPU server or HPC). When heavy dependencies are absent, stages emit clearly labeled synthetic artifacts so the full pipeline still completes and produces a report.

The framework explicitly rejects oversimplified assumptions: high pLDDT does not imply biological relevance, good docking scores do not imply activity, stable RMSD does not imply correctness, and short simulations do not prove equilibrium. Each computational observation is mapped to a relevant experimental validation (CD, DSC, NMR, SPR, ITC, LC-MS).

Key outcome

A reproducible framework that combines modern computational tools to generate robust, experimentally testable hypotheses, with uncertainty quantification, model comparison, and validation planning built in.

Applications

PSAP supports natural-language queries over pipeline results, for example:

Early-stage peptide R&D: prioritize candidates from literature based on structural plausibility and stability
Cosmetics and biomaterials: assess conformational robustness under environmental stress
Model comparison studies: evaluate consistency between ESMFold, ColabFold, and AlphaFold predictions
"Which peptides should be prioritized for experimental validation?"

Project details

Context

Contexte

Approach

Approche

Solution

Solution

Applications

Applications

More projects

Autres projets

DNA-Based Data Storage

Stockage de données ADN

Raw Material Substitution

Substitution de matières premières

CDISC Clinical Data Pipeline