Time Complexity Linear

Contextual priority attention enables linear time sequence modeling in transformers

The quadratic computational complexity of the self-attention mechanism in Transformer models severely constrains their applicability to long sequence inputs. We propose Contextual Priority Attention ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Contextual priority attention enables linear time sequence modeling in transformers

Trending now