OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA
Authors | Roberto L. Castro, Diego Andrade Canosa, Basilio B. Fraguela |
Journal | Mathematics Vol. 9 Num. 17 |
DOI | https://doi.org/10.3390/math9172033 |
Department | Computer Engineering |
Knowledgment area | Computer Architecture and Technology |
Research | Research group Grupo de Arquitectura de Ordenadores |
Research lines | No data available from Curriculum Management System at UDC. (SUXI). |
Contacto | UDC directory |
Orcid id0000-0001-5493-0287 Scopus57219633478 |
This section shows the teaching given in degrees, masters and other officers studies in last 6 years.
Subject and involved studies | Type | Distance hours | Total hours |
---|---|---|---|
Integrative Programming
Degree in Computer Engineering
|
Optional | 0 | 60 |
Subject and involved studies | Type | Distance hours | Total hours |
---|---|---|---|
Integrative Programming
Degree in Computer Engineering
|
Optional | 0 | 60 |
Subject and involved studies | Type | Distance hours | Total hours |
---|---|---|---|
Integrative Programming
Degree in Computer Engineering
|
Optional | 0 | 60 |
No available EOG works or final master thesis directed by current teacher since 2013 year.
Select merit type and year to query research merits.
OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA
Authors | Roberto L. Castro, Diego Andrade Canosa, Basilio B. Fraguela |
Journal | Mathematics Vol. 9 Num. 17 |
DOI | https://doi.org/10.3390/math9172033 |
Using Artificial Vision Techniques for Individual Player Tracking in Sport Events
Authors | Roberto L. Castro, Diego Andrade Canosa |
Journal | Proceedings Vol. 21 Num. 1 |
DOI | https://doi.org/10.3390/proceedings2019021021 |
VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
2023, International Conference for High Performance Computing, Networking, Storage and Analysis, SC'23
International
Authors | Roberto L. Castro, Andrei Ivanov, Diego Andrade Canosa, Tal Ben-Nun, Basilio B. Fraguela, Torsten Hoefler |
Place | Denver, CO (Estados Unidos) |
DOI | https://doi.org/10.1145/3581784.3607087 |
Accelerating Machine Learning Computational Kernels on the GPU
18th International Summer School on Advanced Computer Architecture and Compilation for High-performance Embedded Systems, ACACES 22
International
Authors | Roberto L. Castro, Diego Andrade Canosa, Basilio B. Fraguela |
Organization | European Network of Excellence on High Performance and Embedded Architecture and Compilation (HiPEAC) |
Place | Fiuggi (Italia) |
Probing the Efficacy of Hardware-Aware Weight Pruning to Optimize the SpMM routine on Ampere GPUs
2022 International Conference on Parallel Architectures and Compilation Techniques, PACT
International
Authors | Roberto L. Castro, Diego Andrade Canosa, Basilio B. Fraguela |
Organization | University of Illinois Chicago |
Place | Chicago, IL (Estados Unidos) |
DOI | https://doi.org/10.1145/3559009.3569691 |
Academic or management positions held by teacher.