Executing and Pausing Distributed Applications Running on Desktop Clouds by Global Snapshots

 

Wedi'i Gadw mewn:
Manylion Llyfryddiaeth
Awduron: Gómez, Carlos E., Chavarriaga, Jaime, Bonilla, David C., Castro, Harold E.
Fformat: artículo original
Statws:Versión publicada
Dyddiad Cyhoeddi:2020
Disgrifiad:Desktop Clouds rely on volatile computing resources. For instance, platforms such as cuCloud and UnaCloud run scientific applications in virtual machines exploiting idle resources harvested in computer labs. Regretfully, these resources can be claimed by users, turned off and faulted at any time. The application running on these platforms suffer interference and interruptions that do not occur in dedicated platforms. We have been researching how to deal with these interruptions to increase the platform reliability and support applications running for large periods of time. This paper describes an application of our Global Snapshot Protocol, which can be employed for executing and pausing distributed applications running on desktop clouds. We found that, in these environments, the number of failures caused by desktop users is greater than the caused by hardware and communications. There, when a distributed system running in the virtual machines of a desktop cloud is paused, it can be restored in the same desktops, and successfully finish the application execution.
Gwlad:Portal de Revistas TEC
Sefydliad:Instituto Tecnológico de Costa Rica
Repositorio:Portal de Revistas TEC
Iaith:Inglés
OAI Identifier:oai:ojs.pkp.sfu.ca:article/5074
Mynediad Ar-lein:https://revistas.tec.ac.cr/index.php/tec_marcha/article/view/5074
Allweddair:Reliability
Fault tolerance
Checkpointing
Global snapshot
Desktop clouds
Confiabilidad
Tolerancia a fallas
Snapshot Global