demonstrate how they can be composed to yield flexible and performant transformer \ layers with improved user experience. One may observe that the ``torch.nn`` module currently provides various ...