Close

Presentation

Enabling Fast Uncertainty Estimation: Accelerating Bayesian Transformers via Algorithmic and Hardware Optimizations
TimeTuesday, July 12th4:10pm - 4:30pm PDT
Location3005, Level 3
Event Type
Research Manuscript
Keywords
SoC, Heterogeneous, and Reconfigurable Architectures
Topics
Design
DescriptionTransformer has been demonstrated their high accuracy performance in various applications.....
However, they are not able to capture uncertainty...
Bayesian transformer, which .....
Nevertheless,
they require repeated MC sampling.....
In this paper, we explore the sparisy in Bayesian transformer .....
On CPU and GPU,
our framework improve the latency by ......
On a dedicated design FPGA,
we achieve nearly speedup.