Channel Coding for DNA Data Storage

DNA storage refers to the idea of using DNA as data storage, whereby the nucleotides making up the sequence encode for the information. The data is stored by synthesizing the desired DNA sequence and retrieved by means of DNA sequencing. While DNA based data storage has enormous potential with respect to capacity and durability, this potential is accompanied by challenges arising from reading and writing the information from and to the DNA, but also the DNA itself. The DNA inherent challenges are for example mutations, deletions, insertions and constraints on the sequence used. To overcome these challenges, the goal of this project is to develop a coding scheme matching the needs of DNA storage and allowing for robust information storage.

Knowledge on coding theory aids in the design of a channel code specific for DNA data storage. For the implementation and evaluation of channel codes programming skills are important.

Additional Information

Project Capacity Three IREP student
Project available for Spring, Summer and Fall 2024
Credits 18
Available via Remote No
Project Supervisor Erik Kubaczka