Rakib Ullah

AI/ML Developer & Researcher
Specializing in Generative AI, RAG, and Low-Resource Bengali NLP
Rakib Ullah

About Me

Building AI systems that bridge research and production

AI/ML Researcher and Developer working at the intersection of cutting-edge research and production-grade AI systems. I specialize in building end-to-end AI solutions, from dataset creation and model experimentation to API development, containerization, and cloud deployment.

My work spans medical NLP, multimodal content understanding, and safety-focused AI, often under data and compute constraints. Currently seeking Junior AI/ML Engineer roles focused on scalable, real-world AI solutions.

Technical Arsenal

Tools and technologies I work with

πŸ€–

ML/DL Frameworks

PyTorch TensorFlow Scikit-Learn Hugging Face
πŸ”—

LLM & Gen AI

LangChain RAG Fine-tuning PEFT (LoRA)
πŸ“Š

Data Science

Pandas NumPy Matplotlib FAISS
πŸš€

Deployment

FastAPI Docker Streamlit Render
🎯

Specializations

Bengali NLP Medical NLP Multimodal AI Model Compression
πŸ”§

Tools & More

Git Python LaTeX Label Studio

Research & Publications

Contributing to the advancement of AI research

πŸ“ Research Paper (Submitted)

Clinical Outcome Prediction Framework for Low-Resource Settings

πŸ“… Jan 2024 – May 2025 🏒 Mahdy Research Academy

Status: Submitted to Computer Methods and Programs in Biomedicine

Key Contributions:

  • Led pioneering research on clinical outcome prediction for low-resource medical text settings without native language corpora
  • Fine-tuned transformer-based models achieving 65.3% ROC-AUC for length-of-stay prediction and 64.9% ROC-AUC for mortality prediction on 33,000+ clinical records
  • Developed end-to-end NLP pipeline for dataset preprocessing
  • Co-authored manuscript with publication-ready figures, statistical analyses, and performance tables
πŸ“ Research Paper (Submitted)

Classifying Malicious Content in Bengali and Code-Switched Memes

πŸ“… Nov 2024 – Jun 2025 🏒 SafeNet AI

Status: Submitted to IC3IT 2026

Key Contributions:

  • Conceptualized and led the first research on low-resource Bengali and Bengali-English code-mixed memes
  • Developed novel dataset of 3,247 instances with detailed annotation guidelines achieving 79% inter-annotator agreement
  • Designed custom fusion model achieving 77.65% Macro F1-score for classifying inflammatory, hateful, and benign memes
  • Co-authored manuscript with publication-ready visualizations and detailed result analyses
πŸ”¬ Ongoing Research

Parameter-Efficient Modeling for Low-Resource Bengali NLP

πŸ“… Aug 2025 – Present 🏒 AI/ML Professional Community

Focus: Benchmarking PEFT methods and model compression for Bengali NLP

Key Contributions:

  • Benchmarking parameter-efficient architectures (LoRA, QLoRA) for Bengali NLP tasks
  • Analyzing performance–efficiency trade-offs for real-time production deployment
  • Conducting statistical analysis and visualization of model compression techniques
  • Authoring research manuscript including methodology, evaluation, and result analysis

Technical Projects

Production-ready AI systems and applications

πŸš€ Production System

Context-Aware Scientific Document Q&A System

πŸ“… Aug 2025 – Sep 2025 πŸ”— GitHub

Overview: Production-ready RAG system enabling natural language querying of long scientific PDFs

Key Features:

  • Built scientific QA chatbot with LangChain leveraging Gemini Pro's long context window
  • Enhanced system to interpret figures and tables for accurate, context-aware answers from 20+ page research papers
  • Developed microservices architecture (FastAPI + Streamlit) with containerization using Docker
  • Deployed on Render with comprehensive testing for reliability and maintainability

Impact: Addresses Gemini's limited usability for scientific QA in native app

Competitions & Datathons

Applied ML under real-world constraints

πŸ† National Datathon

CUET National Datathon 2025 Top 25%

πŸ“… Dec 2025 πŸŽ“ Chittagong University of Engineering & Technology

Ranking: 37th / 151 teams

Score: 81% Macro F1 (Private Leaderboard)

Approach & Contributions:

  • Instruction-tuned all layers of Qwen3-VL 8B using LoRA for multimodal classification
  • Applied 4-bit quantization via Unsloth to enable training under strict compute constraints
  • Handled the complete ML pipeline independently, including preprocessing, training, validation, and evaluation
  • Addressed significant distribution shift between training and test data to improve generalization

Affiliation: Sylhet Engineering College

Education

Academic background

Bachelor of Science in Computer Science and Engineering

πŸŽ“ Sylhet Engineering College, Sylhet, Bangladesh

Experience Journey

Building expertise through diverse research roles

Researcher - AI/ML Professional Community Bangladesh

πŸ“ Remote | Aug 2025 – Present

Researcher & Team Lead - SafeNet AI

πŸ“ Remote | Nov 2024 – Jun 2025

Research Intern & Team Lead - Mahdy Research Academy

πŸ“ Remote | Jan 2024 – May 2025

Get In Touch

Let's connect and build something amazing together

πŸ“§

Email

secrakibullah@gmail.com
πŸ™

GitHub

github.com/secrakib
πŸ“±

Phone

+880-1630-208517