DNA-Based Data Storage

DNA Data Storage R&D Pipeline

A research pipeline for encoding digital information into synthetic DNA. It includes sequence constraints, noise simulation, and error correction to ensure robust decoding under realistic degradation.

Key Features

  • Bit-to-nucleotide encoding with GC and homopolymer constraints.
  • Noise simulation (mutations, indels, sequencing errors).
  • Reed-Solomon error correction for recovery.

Technologies Used

  • Python, NumPy, error-correcting codes

Project information

  • Category Research
  • Status Prototype
  • Project URL Available on request