论文标题
关于拆卸地面真理的产生和拆卸器的评估
On the Generation of Disassembly Ground Truth and the Evaluation of Disassemblers
论文作者
论文摘要
当软件转换或软件安全任务需要分析给定的程序二进制时,第一步通常是拆卸的。由于许多现代拆卸器在许多二进制中都变得非常准确,因此我们认为可靠的拆卸基准测试需要标准化所使用的二进制组以及这些二进制文件的拆卸基础真相。本文介绍了(i)我们正在制作的拆卸基准套件的第一版,该套件由879个二进制文件组成,这些二进制文件来自各种项目,其中包含多个编译器和优化设置,以及(ii)一个新颖的拆卸真理生成器,利用了“清单文件”的概念,该列表的概念由Clang,GCC,GCC,ICC,ICC,ICC和MSSVC。另外,它还介绍了我们使用此基准套件和一个自定义评估系统对四个突出的开源拆卸器的评估。我们的整个系统和所有生成的数据均在GitHub上公开维护,以鼓励社区采用。
When a software transformation or software security task needs to analyze a given program binary, the first step is often disassembly. Since many modern disassemblers have become highly accurate on many binaries, we believe reliable disassembler benchmarking requires standardizing the set of binaries used and the disassembly ground truth about these binaries. This paper presents (i) a first version of our work-in-progress disassembly benchmark suite, which comprises 879 binaries from diverse projects compiled with multiple compilers and optimization settings, and (ii) a novel disassembly ground truth generator leveraging the notion of "listing files", which has broad support by Clang, GCC, ICC, and MSVC. In additional, it presents our evaluation of four prominent open-source disassemblers using this benchmark suite and a custom evaluation system. Our entire system and all generated data are maintained openly on GitHub to encourage community adoption.