论文标题
数据科学工作流的云计算平台的比较综述
Comparative Review of Cloud Computing Platforms for Data Science Workflows
论文作者
论文摘要
云计算以服务为服务,软件作为服务以及作为服务的基础架构提供的优点,数据工程师和数据科学家能够为其ETL/ELT(提取,转换和负载)以及ML(机器学习)要求和部署利用云计算。 The proposed framework for the comparative review of cloud computing platforms for data science workflows uses an amalgamation of the analytical hierarchy process, Saaty's fundamental scale of absolute numbers, and a selection of relevant evaluation criteria (namely: automation, error handling, fault tolerance, performance quality, unit testing, data encryption, monitoring, role based access, security, availability, ease of use, integration and互操作性)。该框架使用户能够评估与数据科学工作流有关云平台有关的条件,并且还可以根据上述标准的相对重要性建议哪种云平台适合用户。表明对标准的评估是一致的,因此标准对云服务提供商或云平台选择的目标的加权是明智的。提议的框架足以适应标准和替代方案的变化,具体取决于用户云平台的要求和云平台选择的范围。
With the advantages that cloud computing offers in terms of platform as a service, software as a service, and infrastructure as a service, data engineers and data scientists are able to leverage cloud computing for their ETL/ELT (extract, transform and load) and ML (machine learning) requirements and deployments. The proposed framework for the comparative review of cloud computing platforms for data science workflows uses an amalgamation of the analytical hierarchy process, Saaty's fundamental scale of absolute numbers, and a selection of relevant evaluation criteria (namely: automation, error handling, fault tolerance, performance quality, unit testing, data encryption, monitoring, role based access, security, availability, ease of use, integration and interoperability). The framework enables users to evaluate criteria pertaining to cloud platforms for data science workflows, and additionally is able to recommend which cloud platform would be suitable for the user based on the relative importance of the above criteria. Evaluations of the criteria are shown to be consistent and thus the weighting of criteria against the goal of cloud service provider or cloud platform selection are sensible. The proposed framework is robust enough to accommodate for changes in criteria and alternatives, depending on user cloud platform requirements and scope of cloud platform selection.