FPGA万花筒之(八)：FPGA+CNN论文分类整理

姓名：张俸玺学号：20012100022 学院：竹园三号书院

转自https://blog.csdn.net/qq_38798425/article/details/109124572

【嵌牛导读】FPGA，可编程门阵列，作为一种较为新型的技术，为大多数人所陌生。如今，FPGA成为一个技术热门。FPGA的神经网络实现是当今世界的热门技术话题之一。本文是对FPGA+CNN论文分类的整理。

【嵌牛鼻子】FPGA CNN 论文分类

【嵌牛提问】FPGA+CNN论文如何分类？

【嵌牛正文】

快速计算

快速计算分为两种，一种是利用快速算法，快速算法主要是FFT和Winograd算法，能够加快卷积运算。其中FFT更适合于卷积核较大的情况下，Winograd更适合硬件平台部署，这里的详细分析可以移步知乎。另一种是合理利用计算资源（DSP单元）。

FFT：

[2013]-Fast Training of Convolutional Networks through FFTs

[2016]-Very Efficient Training of Convolutional Neural Networks using Fast Fourier Transform and Overlap-and-Add

[2017]-Frequency Domain Acceleration of Convolutional Neural Networks on CPU-FPGA Shared Memory System

[2018]-A Framework for Generating High Throughput CNN

[2017]-Evaluating Fast Algorithms for Convolutional Neural Networks on FPGAs这里面winograd 和FFT都有涉及。

Winograd：

[1980]- Multiplication of Polynomials Modulo a Polynomial

[2016]-Fast Algorithms for Convolutional Neural Networks

[2018]-Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA

[2018]-A High-efficiency FPGA-based Accelerator for Convolutional Neural Networks using Winograd Algorithm

[2018]-A Novel Low-Communication Energy-Efficient Reconfigurable CNN Acceleration Architecture

[2019]Towards an Efficient Deep Pipelined Template-Based Architecture for Accelerating the Entire 2D and 3D CNNs on FPGA（其实这个就是上面那个会议论文扩的期刊）

[2019]Accelerating 3D CNN-based Lung Nodule Segmentation on a Multi-FPGA System

[2020]-Stride 2 1-D, 2-D, and 3-D Winograd for Convolutional Neural Network

[2020]-A Power-Efficient Optimizing Framework FPGA Accelerator Based on Winograd for YOLO

DSP复用

[2017]-Double MAC Doubling the Performance of Convolutional

以及上文扩的期刊

[2019]-A High Throughput Acceleration for Hybrid Neural Networks With Efficient Resource Management on FPGA

乘法转移位

[2017]Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions

[2019]-Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs

谱卷积

[2019]-A Flexible Design Automation Tool for Accelerating Quantized Spectral CNNs

[2020]-Reuse Kernels or Activations? A Flexible Dataflow for Low-latency Spectral CNN Acceleration

GEMM

[2016]-Fast algorithms for convolutional neural

networks

[2019]-High-Performance CNN Accelerator on FPGA Using Unified Winograd-GEMM Architecture这篇论文把GEMM和Winograd做了个结合

其他

乘加顺序变化来减小计算量

[2020]-Sparse-YOLO: Hardware/Software Co-Design of an FPGA Accelerator for YOLOv2

浮点数分块计算

[2018]-Reconfigurable Acceleration of 3D-CNNs for Human Action Recognition with Block Floating-Point Representation

工程应用

工程应用这块儿文章不少，这里列举一部分年限比较近的

[2019]-A Simplified Speaker Recognition System Based on FPGA Platform盲源分离

[2019]-A Real-Time Convolutional Neural Network for Super-Resolution on FPGA With Applications to 4K UHD 60 fps Video Services视频

[2019]-Acceleration of FPGA Based Convolutional Neural Network for Human Activity Classification Using Millimeter-Wave Radar毫米波雷达

[2019]-Towards an Efficient Accelerator for DNN-based Remote Sensing Image Segmentation on FPGAs分割

[2020]-Deep Learning Approach for Epileptic Focus Localization癫痫病灶

[2020]-On the Use of FPGAs to Implement CNNs: A Brief Review 这篇review里面列举了挺多应用

编译器

现在很多文章的一个主要贡献就是做编译器，FPGA的编程说实话确实很难，尤其是有些时候需要做很多优化HLS不能用，因此编译器就显得很重要。

[2016]-Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA

[2019]-TensorFlow to Cloud FPGAs_ Tradeoffs for Accelerating Deep Neural Networks

[2020]-Automatic Compilation of Diverse CNNs Onto High-Performance FPGA Accelerators

[2020]-A Novel FPGA Accelerator Design for Real-Time and Ultra-Low Power Deep Convolutional Neural Networks Compared With Titan X GPU

[2020]-End-to-End Optimization of Deep Learning Applications

模型结构

[2017]-Accelerating Binarized Convolutional Neural Networks with Software-Programmable FPGAs

[2018]-FBNA_ A Fully Binarized Neural Network Accelerator

[2018]-Towards Efficient Convolutional Neural Network for Domain-Specific Applications on FPGA

[2019]-Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs

[2019]-LUTNet_ Rethinking Inference in FPGA Soft Logic这篇文章的思想非常值得读

硬件架构

[2017]-Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks

[2018]-cascade cnn_ pushing the performance limits of quantisation in convolutional neural networks

[2018]-RNA_ An Accurate Residual Network Accelerator for Quantized and Reconstructed Deep Neural Networks

[2018]-A CNN Accelerator on FPGA Using Depthwise Separable Convolution

[2018]-ANgel-Eye_ A complete Design Flow for Mapping CNN Onto Embedded FPGA

[2019]-MulNet: A Flexible CNN Processor With Higher Resource Utilization Efficiency for Constrained Devices

[2019]-A High Throughput Acceleration for Hybrid Neural Networks With Efficient Resource Management on FPGA

[2020]-A Novel FPGA Accelerator Design for Real-Time and Ultra-Low Power Deep Convolutional Neural Networks Compared With Titan X GPU

[2020]-LACS: A High-Computational-Efficiency Accelerator for CNNs

[2020]-Light-OPU: An FPGA-based Overlay Processor for Lightweight Convolutional Neural Networks

[2020]-Reuse Kernels or Activations? A Flexible Dataflow for Low-latency Spectral CNN Acceleration

[2020]-End-to-End Optimization of Deep Learning Applications

[2020]-Automatic Compilation of Diverse CNNs Onto High-Performance FPGA Accelerators

[2020]-Performance Modeling for CNN Inference Accelerators on FPGA

训练

有的论文做了训练，不过这部分非常少见。虽然训练的计算也没有很复杂，仍然是矩阵乘法，但是需要的资源占用是非常多的。尤其是很多前向传播的优化过程中为了复用存储空间，将feature map释放掉了，这种显然没有办法再做训练了（所以说都是trick嘛）。

[2019]-Automatic Compiler Based FPGA Accelerator for CNN Training

[2019]-FPGA-based Training Accelerator Utilizing Sparseness of Convolutional Neural Network

人面猴
序言：七十年代末，一起剥皮案震惊了整个滨河市，随后出现的几起案子，更是在滨河造成了极大的恐慌，老刑警刘岩，带你破解...
沈念sama阅读 215,634评论 6赞 497
死咒
序言：滨河连续发生了三起死亡事件，死亡现场离奇诡异，居然都是意外死亡，警方通过查阅死者的电脑和手机，发现死者居然都...
沈念sama阅读 91,951评论 3赞 391
救了他两次的神仙让他今天三更去死
文/潘晓璐我一进店门，熙熙楼的掌柜王于贵愁眉苦脸地迎上来，“玉大人，你说我怎么就摊上这事。” “怎么了？”我有些...
开封第一讲书人阅读 161,427评论 0赞 351
道士缉凶录：失踪的卖姜人
文/不坏的土叔我叫张陵，是天一观的道长。经常有香客问我，道长，这世上最难降的妖魔是什么？我笑而不...
开封第一讲书人阅读 57,770评论 1赞 290
港岛之恋（遗憾婚礼）
正文为了忘掉前任，我火速办了婚礼，结果婚礼上，老公的妹妹穿的比我还像新娘。我一直安慰自己，他们只是感情好，可当我...
茶点故事阅读 66,835评论 6赞 388
恶毒庶女顶嫁案：这布局不是一般人想出来的
文/花漫我一把揭开白布。她就那样静静地躺着，像睡着了一般。火红的嫁衣衬着肌肤如雪。梳的纹丝不乱的头发上，一...
开封第一讲书人阅读 50,799评论 1赞 294
城市分裂传说
那天，我揣着相机与录音，去河边找鬼。笑死，一个胖子当着我的面吹牛，可吹牛的内容都是我干的。我是一名探鬼主播，决...
沈念sama阅读 39,768评论 3赞 416
双鸳鸯连环套：你想象不到人心有多黑
文/苍兰香墨我猛地睁开眼，长吁一口气：“原来是场噩梦啊……” “哼！你这毒妇竟也来了？” 一声冷哼从身侧响起，我...
开封第一讲书人阅读 38,544评论 0赞 271
万荣杀人案实录
序言：老挝万荣一对情侣失踪，失踪者是张志新（化名）和其女友刘颖，没想到半个月后，有当地人在树林里发现了一具尸体，经...
沈念sama阅读 44,979评论 1赞 308
护林员之死
正文独居荒郊野岭守林人离奇死亡，尸身上长有42处带血的脓包…… 初始之章·张勋以下内容为张勋视角年9月15日...
茶点故事阅读 37,271评论 2赞 331
白月光启示录
正文我和宋清朗相恋三年，在试婚纱的时候发现自己被绿了。大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
茶点故事阅读 39,427评论 1赞 345
活死人
序言：一个原本活蹦乱跳的男人离奇死亡，死状恐怖，灵堂内的尸体忽然破棺而出，到底是诈尸还是另有隐情，我是刑警宁泽，带...
沈念sama阅读 35,121评论 5赞 340
日本核电站爆炸内幕
正文年R本政府宣布，位于F岛的核电站，受9级特大地震影响，放射性物质发生泄漏。R本人自食恶果不足惜，却给世界环境...
茶点故事阅读 40,756评论 3赞 324
男人毒药：我在死后第九天来索命
文/蒙蒙一、第九天我趴在偏房一处隐蔽的房顶上张望。院中可真热闹，春花似锦、人声如沸。这庄子的主人今日做“春日...
开封第一讲书人阅读 31,375评论 0赞 21
一桩弑父案，背后竟有这般阴谋
文/苍兰香墨我抬头看了看天上的太阳。三九已至，却和暖如春，着一层夹袄步出监牢的瞬间，已是汗流浃背。一阵脚步声响...
开封第一讲书人阅读 32,579评论 1赞 268
情欲美人皮
我被黑心中介骗来泰国打工，没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留，地道东北人。一个月前我还...
沈念sama阅读 47,410评论 2赞 368
代替公主和亲
正文我出身青楼，却偏偏与公主长得像，于是被迫代替她去往敌国和亲。传闻我的和亲对象是个残疾皇子，可洞房花烛夜当晚...
茶点故事阅读 44,315评论 2赞 352

FPGA万花筒之(八)：FPGA+CNN论文分类整理

推荐阅读更多精彩内容