[HN Gopher] Nystromformer: A Nystrom-Based Algorithm for Approxi... ___________________________________________________________________ Nystromformer: A Nystrom-Based Algorithm for Approximating Self- Attention Author : tmfi Score : 46 points Date : 2021-02-11 18:42 UTC (4 hours ago) (HTM) web link (arxiv.org) (TXT) w3m dump (arxiv.org) | elcomet wrote: | Here's a nice video by Yannick Kilcher explaning the | Nystromformer: https://www.youtube.com/watch?v=m-zrcmRd7E4 | | The benefits over regular transformers is that it is more | efficient (does less operations), as the original transformer has | a quadratic complexity in the number of input tokens. ___________________________________________________________________ (page generated 2021-02-11 23:01 UTC)