Mohammad Donyavi
Hi, I’m Mohammad — A Passionate and versatile AI researcher with a strong foundation in deep learning, LLMs, and distributed ML systems. Experienced in optimizing model performance and building scalable data/ML pipelines. Adept at bridging academic research and practical deployment in NLP systems..
My Journey
My academic and technical journey began with a foundation in engineering
problem-solving during my undergraduate studies in Petroleum Engineering
at Sharif University of Technology. Fascinated by data-driven approaches, I focused my
bachelor thesis on applying machine learning to analyze wettability changes in smart
water injection — an early sign of my growing interest in AI and computational methods.
Driven by a passion to bridge engineering challenges with intelligent systems, I
transitioned into the world of computer engineering for my graduate studies. I was
accepted into the prestigious University of Tehran, where I began my M.Sc. in
Computer Engineering with a concentration in Computer Architecture and AI
systems. During this time, I immersed myself in advanced coursework spanning deep
learning, distributed systems, and statistical inference, all of which laid the groundwork
for high-performance AI research.
My master's thesis, titled “Accelerating Deep Neural Networks Training for Large
Language Models,” focuses on optimizing large-scale model training through
multi-constrained computational graph partitioning and distributed training
techniques. I worked extensively with frameworks like PyTorch Distributed,
DeepSpeed, and FSDP, reducing communication overhead by 23% and enhancing GPU efficiency by
up to 33%.
In parallel, I gained hands-on experience with cutting-edge large language models
(LLMs) such as LLaMA 3.1, Phi-3, and Gemma-2, fine-tuning them on curated
datasets and evaluating them using rigorous NLP benchmarks like MMLU and COMET. I also
contributed to data preprocessing pipelines for low-resource Persian
datasets, improving classification performance by 39%, and explored
Retrieval-Augmented Generation (RAG) for enhancing text generation systems.
My commitment to expanding my global academic perspective led me to Mälardalen University in
Sweden as an Erasmus+ exchange student, where I studied foundation
models, intelligent systems, and advanced algorithms —
enriching my understanding of cross-disciplinary AI applications.
Experiences
Research Assistant at NLP Lab [University of Tehran] - (Sep. 2021 – Aug. 2025)
Django Web Developer Intern [Mapsa] - (May. 2020 – Sep. 2020)
Teaching Experience
Skills
- Programming: Python (PyTorch, TensorFlow, Keras), C/C++, R, MATLAB
- Big Data & Cloud: HDFS, Kafka, Spark, Presto, Docker
- Databases: SQL, NoSQL (MongoDB, Cassandra)
- DevOps & Tools: Git, Linux, Shell, Docker, VS Code, Latex
- Languages: Persian (Native), English (TOEFL 86 – R:19, L:25, S:21, W:21)
Honors
Example: Ranked 25th among 40,000 participants in Iran’s national Computer Engineering entrance exam — top 0.06%Get In Touch
Whether you're hiring, collaborating, or just curious — feel free to reach out. I'm always open to exciting opportunities in the world of data and AI.