← Back to list
Registration: 20.02.2025

Mohammed Sedeg

Specialization: Deep Learning Engineer
— Intelligent Systems Engineer specializing in Computer Vision, NLP, and Generative AI. Currently pursuing a PhD in Embodied AI, with a Master's in Mechatronics, combining deep learning, automation, and robotics to push the frontiers of intelligent systems. — Passionate about state-of-the-art (SOTA) models, with a strong focus on understanding AI systems from first principles—breaking down architectures, optimizing performance, and rethinking foundational concepts. — Experienced in developing and deploying cutting-edge transformer-based models, multimodal AI, and deep generative networks for real-world applications. — Dedicated to bridging the gap between theoretical AI research and practical, scalable solutions in perception, reasoning, and automation.
— Intelligent Systems Engineer specializing in Computer Vision, NLP, and Generative AI. Currently pursuing a PhD in Embodied AI, with a Master's in Mechatronics, combining deep learning, automation, and robotics to push the frontiers of intelligent systems. — Passionate about state-of-the-art (SOTA) models, with a strong focus on understanding AI systems from first principles—breaking down architectures, optimizing performance, and rethinking foundational concepts. — Experienced in developing and deploying cutting-edge transformer-based models, multimodal AI, and deep generative networks for real-world applications. — Dedicated to bridging the gap between theoretical AI research and practical, scalable solutions in perception, reasoning, and automation.

Portfolio

MyLLM

• Developed a scalable LLM pipeline in PyTorch, covering data preprocessing, tokenization, training using multi-GPU training, , and fine-tuning with SFT and RLHF (PPO/DPO). • Optimized for real-time inference quantization, and KV_cache while maintaining low-level control for efficiency. • Currently building Meta_Bot, a chatbot that explains LLM concepts, integrating Gradio for real time interaction.

PAPER2CODE

PAPER2CODE repository! This repository is dedicated to the implementation of state-of-the-art research papers in machine learning and deep learning from scratch. Our goal is to bridge the gap between theoretical research and practical implementation, providing clear, educational, and reproducible implementations of the models and methods described in these papers. We implement these models using TensorFlow and PyTorch, depending on the specific requirements of each project.

Automated Object Detection

This project provides an **automated pipeline** for **inference** and **fine-tuning** using TensorFlow 2's **Object Detection API**, aimed at simplifying and speeding up the object detection process. ### **Inference Pipeline**: 1. **Download Pre-trained Model**: Quickly access pre-trained models to start inference. 2. **Run Detection**: - **Images**: Detect objects in static images. - **Videos**: Perform detection on video files. - **Webcam**: Enable real-time detection via webcam feed. ### **Fine-Tuning Pipeline**: 1. **Download Pre-trained Model**: Start with a pre-trained model for fine-tuning. 2. **Data Pipeline**: - **Load Data**: Import images and annotations. - **Annotate**: Label images with bounding boxes and class info. - **Convert Annotations**: Transform annotations from XML to CSV. - **Create TFRecord**: Generate TFRecord files for large datasets. 3. **Retraining**: - **Load Data**: Convert data into NumPy arrays for model training. - **Model Config**: Adjust model configuration for fine-tuning. - **Device Config**: Select CPU, GPU, or TPU for training. - **Training Loops**: Fine-tune the model with iterative training. ### **Application Development**: Utilize the fine-tuned model to build custom applications in the **App** directory, enhancing workflow efficiency. ### **Efficiency and Automation**: This project automates key processes of object detection and model training, minimizing manual effort and maximizing efficiency, ultimately providing a seamless workflow for better results.

SilvaNet

**SilvaNet** is a lightweight Python library designed to simplify deep learning concepts and model building. It features **autograd-enabled tensors** for seamless backpropagation, an **intuitive API** for easy neural network construction, and support for **element-wise operations, activation functions (ReLU, sigmoid, tanh), and loss functions (softmax cross-entropy).** SilvaNet is **flexible, extensible, and supports model saving/loading**, making it an ideal tool for both beginners and educators. 🚀

Skills

C++
C
Matlab
TensorFlow
PyTorch
Keras
OpenCV
NumPy
Pandas
Git

Work experience

Instructor / Team Lead
Till the present day |NDA
LLMs, Computer Vision, Technofest, Robotics, ROS2
Private Engineering Courses / Technofest Competition. ● Designed and delivered AI-focused courses covering mathematics, programming, and specialized topics like LLMs, Computer Vision. ● Trained students in engineering and AI concepts for Technofest’s AI competition. ● Leading a team in Technofest’s AI sector, applying course concepts to develop innovative AI and robotics solutions. ● Managed team progress, optimized models, and promoted collaboration for a competitive edge.
AI Engineer / Computer Vision Specialist
since 01.2025 - Till the present day |VisionCore
Pytorch , TFLite,OpenCV , YOLO , ultralytics , Roboflow, TensorRT
● Develop and deploy real-time AI models for industrial applications, optimizing for low-latency performance using advanced techniques such as quantization, pruning, and model compression. ● Design and fine-tune end-to-end training pipelines, including data augmentation, image preprocessing, and transfer learning, ensuring maximum model performance and faster convergence. ● Continuously optimize models for real-time deployment, leveraging techniques like model distillation and edge computing to reduce resource usage and enhance scalability in production environments.
Deep Learning Engineer
since 06.2023 - Till the present day |NDA
PyTorch, TensorFlow, Hugging Face Transformers, LoRA, DeepSpeed, bitsandbytes, vLLM, GAN,, OpenCV, GPT, LLaMA, Gradio
● Developed generative models for applications like image generation, style transfer, and super-resolution using GANs and VAEs. ● Fine-tuned GPT and LLaMA-based models on custom datasets, leveraging domain-specific knowledge to improve performance while using minimum computational resources. ● Implemented advanced NLP solutions for text generation, classification, and summarization, achieving high-quality, tailored results with optimized resource usage.

Educational background

Mechatronics (Doctor of Science)
since 2024 - Till the present day
Karabuk University
Mechatronics (Masters Degree)
2020 - 2023
Karabuk University
Electrical Engineer (Bachelor’s Degree)
2011 - 2016
Sudan University of Science and Technologies

Languages

ArabicNativeEnglishProficientTurkishIntermediate