Evolutionary scheduling of dynamic multitasking workloads for big-data analytics in elastic cloud

Fan Zhang; Junwei Cao; Wei Tan; Samee U. Khan; Keqin Li; Albert Y. Zomaya

doi:10.1109/TETC.2014.2348196

Publication

IEEE TETC

Paper

Evolutionary scheduling of dynamic multitasking workloads for big-data analytics in elastic cloud

IEEE TETC

Download paper

Abstract

Scheduling of dynamic and multitasking workloads for big-data analytics is a challenging issue, as it requires a significant amount of parameter sweeping and iterations. Therefore, real-time scheduling becomes essential to increase the throughput of many-task computing. The difficulty lies in obtaining a series of optimal yet responsive schedules. In dynamic scenarios, such as virtual clusters in cloud, scheduling must be processed fast enough to keep pace with the unpredictable fluctuations in the workloads to optimize the overall system performance. In this paper, ordinal optimization using rough models and fast simulation is introduced to obtain suboptimal solutions in a much shorter timeframe. While the scheduling solution for each period may not be the best, ordinal optimization can be processed fast in an iterative and evolutionary way to capture the details of big-data workload dynamism. Experimental results show that our evolutionary approach compared with existing methods, such as Monte Carlo and Blind Pick, can achieve higher overall average scheduling performance, such as throughput, in real-world applications with dynamic workloads. Furthermore, performance improvement is seen by implementing an optimal computing budget allocating method that smartly allocates computing cycles to the most promising schedules.

Date

01 Sep 2014

Publication

IEEE TETC

Authors

IBM-affiliated at time of publication

Topics

Resources

Publication

Abstract

Date

Publication

Authors

Topics

Resources

Share