Machine Learning, GPU Performance
I am a Machine Learning Engineer specializing in optimizing large-scale models for efficient deployment, with a focus on GPU performance and inferencing. Currently, I work at Microsoft, where I optimize OpenAI models for inferencing in Azure, leveraging kernel optimization, performance analysis, and distributed computing.
Jun 2022 — Present | Redmond, WA
Technologies: Python, C++, CUDA, Triton, ROCm (HIP), ONNXRuntime, Pytorch, Deepspeed.
May 2020 — Jun 2022 | Iowa City, IA
Technologies: Python, Go, VueJS, Flask, Docker, NLP, XGBoost, AWS (EC2, S3, SageMaker, CloudWatch, EB, SQS)
Jan 2019 — Aug 2023 | Casablanca, Morocco
Technologies: Python, Flask, Docker, NLP, Keras, Tensorflow
Maharishi International University | Aug 2019 — Jan 2021
Data Science
Abdelmalek Essaâdi University, National School of Applied Sciences | Aug 2016 — Aug 2019
Business Intelligence
Abdelmalek Essaâdi University | Aug 2014 — Jul 2016
Mathematics and Computer Science