论文标题
网卡:网络数据的简洁,可读的摘要
Network Cards: concise, readable summaries of network data
论文作者
论文摘要
网络数据集的泛滥需要一种标准方法,以有效,简洁地总结网络数据集。基于类似的努力来标准化机器学习中模型和数据集的文档,我们在这里提出网络卡,网络数据集的简短摘要,这些摘要不仅可以捕获网络的基本统计数据,还可以捕获有关数据构建过程,出处,道德考虑以及其他元数据的信息。在本文中,我们列出了(1)应包含在网卡中的关键元素的理由和目标,以及(3)示例网卡,以强调在各种研究领域的好处。我们还提供了用于生成网卡的模式,模板和软件包。
The deluge of network datasets demands a standard way to effectively and succinctly summarize network datasets. Building on similar efforts to standardize the documentation of models and datasets in machine learning, here we propose network cards, short summaries of network datasets that can capture not only the basic statistics of the network but also information about the data construction process, provenance, ethical considerations, and other metadata. In this paper, we lay out (1) the rationales and objectives for network cards, (2) key elements that should be included in network cards, and (3) example network cards to underscore their benefits across a variety of research domains. We also provide a schema, templates, and a software package for generating network cards.