Efficient Attention using a Fixed-Size Memory Representation

  Abstract