读书笔记:Fuxi: a Fault-Tolerant Resource Management and Job Scheduling System at Internet Scale...

    xiaoxiao2021-04-17  210

    此文是关于阿里云伏羲平台的论文,一些感兴趣的点:     Fuxi:a resouce management and job scheduling system. (我感觉是基于YARN做的,很像YARN)         1, An incremental resource management protocol         2, A user-transparent failure recovery         3, A  effective (faulty-node) detection mechanism and a mlti-level blacklisting schema          Fuxi (FuxiMaster, AppMaster, Tubo) <>YARN(ResourceManager, AppMaster, NodeManager)     Fuxi 与 YARN区别:         1,Fuxi seperates the notion of task(the application process  that performs the actual work) and container(the unit of resource grant). Once an application master receives an grant , it explicitly controls its life-cycle and may reuse the container to run multiple tasks.         2,Lcality tree based scheduling.      

    最新回复(0)