论文标题
通过:短肽序列的从头汇编器
PASS: De novo assembler for short peptide sequences
论文作者
论文摘要
以序列水平分辨率表征蛋白质的能力对于生物学研究至关重要。目前,蛋白质测序的主要方法是通过液相色谱质谱法(LC-MS),而蛋白质通过酶促消化物还原为其组成肽,然后在LC-MS仪器上进行分析。该分析产生的短肽序列用于表征样品的原始蛋白质含量。在这里,我们介绍了短肽序列的从头组装程序,可用于重建大部分蛋白质靶标,这一步骤可以促进下游样品表征工作。我们展示了如何通过足够的肽序列覆盖范围和几乎没有额外的序列处理,将蛋白质序列重建成具有高(93.1-99.1%)序列身份的相对较大(100氨基酸或更长的)重叠群,以使参考抗体光和重链蛋白质。可用性:通过GNU通用公共许可证版本3(GPLV3)发布,可从https://github.com/warrenlr/pass公开获得
The ability to characterize proteins at sequence-level resolution is vital to biological research. Currently, the leading method for protein sequencing is by liquid chromatography mass spectrometry (LC-MS) whereas proteins are reduced to their constituent peptides by enzymatic digest and subsequently analyzed on an LC-MS instrument. The short peptide sequences that result from this analysis are used to characterize the original protein content of the sample. Here we present PASS, a de novo assembler for short peptide sequences that can be used to reconstruct large portions of protein targets, a step that can facilitate downstream sample characterization efforts. We show how, with adequate peptide sequence coverage and little-to-no additional sequence processing, PASS reconstructs protein sequences into relatively large (100 amino acid or longer) contigs having high (93.1 - 99.1%) sequence identity to reference antibody light and heavy chain proteins. Availability: PASS is released under the GNU General Public License Version 3 (GPLv3) and is publicly available from https://github.com/warrenlr/PASS