Scaling Recommender Transformers to a Billion Parameters

How to implement a new generation of transformer recommenders