论文标题
基于索引的串联代码多绘制DNA存储通道
Index-Based Concatenated Codes for the Multi-Draw DNA Storage Channel
论文作者
论文摘要
我们考虑用于基于DNA的存储的错误校正编码。我们将DNA存储通道建模为多绘图IDS通道,其中输入数据分为$ M $短DNA链,并将其复制到随机数量,并且该通道输出随机选择$ n $ noisy DNA链。检索到的DNA链容易插入,缺失和替换(IDS)错误。我们提出了一个基于索引的串联编码方案,该方案包括外部代码的串联,索引代码和内部同步代码,后两个铲球IDS错误。我们进一步提出了一个不匹配的关节指数同步代码的最大后验概率解码器,可选聚类以推断外解解码器的后验概率。我们分别计算出外部代码的可实现的信息率,并分别针对合成和实验数据的蒙特卡罗模拟进行了蒙特卡洛模拟。
We consider error-correcting coding for DNA-based storage. We model the DNA storage channel as a multi-draw IDS channel where the input data is chunked into $M$ short DNA strands, which are copied a random number of times, and the channel outputs a random selection of $N$ noisy DNA strands. The retrieved DNA strands are prone to insertion, deletion, and substitution (IDS) errors. We propose an index-based concatenated coding scheme consisting of the concatenation of an outer code, an index code, and an inner synchronization code, where the latter two tackle IDS errors. We further propose a mismatched joint index-synchronization code maximum a posteriori probability decoder with optional clustering to infer symbolwise a posterior probabilities for the outer decoder. We compute achievable information rates for the outer code and present Monte-Carlo simulations for information-outage probabilities and frame error rates on synthetic and experimental data, respectively.