Understanding Transformer
/
Contents
Introduction
Standard Convolution
Continuous Convolution
Introduction
Let be a sentence with
tokens embedded in
-dimensional vectors,
be the
token vector.
Standard Convolution
Based on the definition of convolution, the output vector at position
with kernel size
is computed as: