Accelerating machine learning at the edge with approximate computing on FPGAs

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون:	León-Vega, Luis Gerardo, Salazar-Villalobos, Eduardo, Castro-Godínez, Jorge
التنسيق:	artículo original
الحالة:	Versión publicada
تاريخ النشر:	2022
الوصف:	Performing inference of complex machine learning (ML) algorithms at the edge is becoming important to unlink the system functionality from the cloud. However, the ML models increase complexity faster than the available hardware resources. This research aims to accelerate machine learning by offloading the computation to low-end FPGAs and using approximate computing techniques to optimise resource usage, taking advantage of the inaccurate nature of machine learning models. In this paper, we propose a generic matrix multiply-add processing element design, parameterised in datatype, matrix size, and data width. We evaluate the resource consumption and error behaviour while varying the matrix size and the data width given a fixed-point data type. We determine that the error scales with the matrix size, but it can be compensated by increasing the data width, posing a trade-off between data width and matrix size with respect to the error.
البلد:	Portal de Revistas TEC
المؤسسة:	Instituto Tecnológico de Costa Rica
Repositorio:	Portal de Revistas TEC
اللغة:	Inglés Español
OAI Identifier:	oai:ojs.pkp.sfu.ca:article/6491
الوصول للمادة أونلاين:	https://revistas.tec.ac.cr/index.php/tec_marcha/article/view/6491
كلمة مفتاحية:	Approximate computing edge computing machine learning neural networks linear algebra Computación aproximada computación periférica aprendizaje por computador redes neuronales álgebra lineal

Accelerating machine learning at the edge with approximate computing on FPGAs

مواد مشابهة