How I Learned to Stop Worrying and Love the Exascale

1 735
28.9
Опубликовано 6 сентября 2018, 17:49
Our ability to produce data continues to grow exponentially, and so does the computational power available in the world. By 2021, it is expected that zettabytes will be stored in the cloud, and the DoE will have deployed heterogeneous exascale clusters with millions of CPU cores and tens of thousands of nodes. Resources at this scale have the potential to accelerate scientific breakthroughs, but for that to happen we need to build scalable software systems that will utilize them efficiently.

Today, these clusters confine applications to operate in a non-privileged environment without feedback. I will be discussing our two-pronged effort to improve operational efficiency in this setting by: (a) allowing applications to control mechanisms and observe information traditionally accessible only to the operating system, and (b) using machine learning models to inform users of expected job behavior. This work is a collaboration with the Los Alamos and Argonne National Labs.

See more at microsoft.com/en-us/research/v...
автотехномузыкадетское