I work in the Distributed Data Lab at Huawei at their Toronto based R&D installation. My work here has spanned from automatic performance tuning (AutoTune) for data loading pipelines, used for Deep learning model traning - to creating high performance GPU CUDA kernels for ML computation...
Welcome to the landing page