此文是关于阿里云伏羲平台的论文,一些感兴趣的点:
Fuxi:a resouce management and job scheduling system. (我感觉是基于YARN做的,很像YARN)
1, An incremental resource management protocol
2, A user-transparent failure recovery
3, A effective (faulty-node) detection mechanism and a mlti-level blacklisting schema
Fuxi (FuxiMaster, AppMaster, Tubo) <>YARN(ResourceManager, AppMaster, NodeManager)
Fuxi 与 YARN区别:
1,Fuxi seperates the notion of task(the application process that performs the actual work) and container(the unit of resource grant). Once an application master receives an grant , it explicitly controls its life-cycle and may reuse the container to run multiple tasks.
2,Lcality tree based scheduling.