Research of Hybrid Resource Scheduling Framework of Heterogeneous Clusters for Dataflow

doi:10.13328/j.cnki.jos.006356

微信服务号

微信订阅号

2025-6-2- 21

Home > Archive>Volume 33, Issue 12, 2022 >4704-4726. DOI:10.13328/j.cnki.jos.006356

PDF HTML XML Export Cite reminder

Research of Hybrid Resource Scheduling Framework of Heterogeneous Clusters for Dataflow
DOI:
                        10.13328/j.cnki.jos.006356
                    
Author:
                        TANG Xiao-ChunTANG Xiao-Chun
School of Computer Science, Northwestern Polytechnical University, Xian 710129, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHAO QuanZHAO Quan
School of Computer Science, Northwestern Polytechnical University, Xian 710129, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
FU YingFU Ying
School of Computer Science, Northwestern Polytechnical University, Xian 710129, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHU Zi-YuZHU Zi-Yu
School of Computer Science, Northwestern Polytechnical University, Xian 710129, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
DING ZhaoDING Zhao
School of Computer Science, Northwestern Polytechnical University, Xian 710129, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
HU Xiao-XueHU Xiao-Xue
School of Computer Science, Northwestern Polytechnical University, Xian 710129, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LI Zhan-HuaiLI Zhan-Huai
School of Computer Science, Northwestern Polytechnical University, Xian 710129, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP311
Fund Project:National Key Research & Development Program of China(2018YFB1003400)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The use of the Dataflow model integrates the batch processing and stream processing of big data computing. Nevertheless, the existing cluster resource scheduling frameworks for big data computing are oriented either to stream processing or to batch processing, which are not suitable for batch processing and stream processing jobs to share cluster resources. In addition, when GPUs are used for big data analysis and calculations, resource usage efficiency is reduced due to the lack of effective CPU-GPU resource decoupling methods. Based on the analysis of existing cluster scheduling frameworks, a hybrid resource scheduling framework called HRM is designed and implemented that can perceive batch/stream processing applications. Based on a shared state architecture, HRM uses a combination of optimistic blocking protocols and pessimistic blocking protocols to ensure different resource requirements for stream processing jobs and batch processing jobs. On computing nodes, it provides flexible binding of CPU-GPU resources, and adopts queue stacking technology, which not only meets the real-time requirements of stream processing jobs, but also reduces feedback delays and realizes the sharing of GPU resources. By simulating the scheduling of large-scale jobs, the scheduling delay of HRM is only about 75% of the centralized scheduling framework; by using actual load testing, the CPU resource utilization is increased by more than 25% when batch processing and stream processing share clusters; by using the fine-grained job scheduling method, not only the GPU utilization rate is increased by more than 2 times, the job completion time can also be reduced by about 50%.

Key words:dataflow model;batch process;streaming process;application aware;CPU-GPU;queue overlap

Get Citation

汤小春,赵全,符莹,朱紫钰,丁朝,胡小雪,李战怀.面向Dataflow的异构集群混合式资源调度框架研究.软件学报,2022,33(12):4704-4726

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:November 23,2020
Revised:January 25,2021
Adopted:
Online: May 21,2021
Published: December 06,2022

You are the first2049925Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History