Hi, I'm

Delphin Kaduli

I didn't do a Master's degree to study data. I did it to build things that work in the real world. Two years of graduate-level projects, churn models, automated pipelines, and SQL systems built to the standard of would this hold up in production? Now I'm bringing that into industry. Ready.

View My WorkMy Story
Delphin Kaduli - Data Scientist Portfolio
Actively Available

The Person Behind the Data

I don't just build models. I build business answers.

What Drives Me

Off the screen, I'm grounded by faith, family, and movement. As a Christian, integrity isn't a value I list, it's how I make decisions when no one's watching. As a Barcelona fan, I understand that the best teams don't just have talented individuals, they have a system. I bring that same thinking to how I build pipelines and collaborate with teams. When I'm not in the data, I'm on a trail, or surrounded by people I love. That balance is what keeps the work sharp.

A model is only as good as the decision it changes. I just completed my M.S. in Data Analytics at Catholic University, and throughout that journey I built churn models at 80%+ accuracy, automated pipelines that cut reporting errors by 15%, and migrated fragmented records into SQL systems running at 99.9% uptime. None of that matters to me in isolation, what matters is whether someone made a better call because of it.

My core stack: Python · SQL · AWS · GCP · Power BI · Tableau · XGBoost · Scikit-Learn · MLflow · Docker, the full engine from raw data to boardroom decision.

See My ProjectsLet's Talk

Projects

My work focuses on the intersection of Predictive Systems, Revenue Optimization, and Business Intelligence.

Demand Forecasting System

Demand Forecasting System

Problem

Bike-sharing platforms lose revenue daily from a core imbalance: fleets sit idle during off-peak hours while surge periods go underserved. Without forward-looking demand signals, pricing and fleet decisions are reactive, always one step behind the customer.

Solution

Built an end-to-end forecasting pipeline to production standards, predicting city-wide demand 12 hours ahead. Engineered a 3-stage modeling approach: baseline Random Forest → lag feature engineering → CatBoost with Bayesian Optimization via Optuna. Containerized with Docker and deployed to cloud infrastructure, giving operations teams real-time inference to inform dynamic pricing and proactive fleet distribution.

Key Outcome

51% reduction in MAE (84.78 → 41.17), 44% reduction in RMSE (116.13 → 64.80), and 22% reduction in MAPE (34.63 → 27.10) over a no-feature-engineering baseline, achieved through iterative lag feature engineering and CatBoost hyperparameter tuning with Bayesian Optimization. Delivered as a Dockerized inference pipeline ready for real-world deployment.

PythonCatBoostBayesian Optimization (Optuna)Time-Series Feature EngineeringMLOpsDockerCloud DeploymentPredictive Analytics
View Project
Customer Churn Prediction

Customer Churn Prediction

Problem

Telecom providers lose customers they could have kept, because they only identify at-risk users after a cancellation request. By then, the decision is already made. The real opportunity is predicting churn before it happens, during the window when intervention still works.

Solution

Built a full classification pipeline using CatBoost and Optuna on a real-world Vodafone dataset. Addressed a severe 73/27 class imbalance through minority upsampling. Engineered features around customer tenure and usage patterns, then used Optuna for systematic hyperparameter tuning. Validated model behavior through SHAP analysis to identify which signals actually drive churn risk.

Key Outcome

Identified the Tenure Effect as the primary churn driver, customers in their first 12 months represent the highest-risk, highest-ROI intervention window. This finding reframes retention strategy from reactive damage control to a structured early-engagement program. Achieved 0.64 precision on churn class with a heavily imbalanced dataset.

PythonScikit-LearnXGBoostCatBoostOptunaSHAPPower BI (DAX)YData Profiling
View Project
Fraud Detection System

Fraud Detection System

Problem

Financial institutions lose billions to undetected fraud, but the real challenge isn't catching fraud, it's catching it without drowning analysts in false alarms. With fraud representing only 0.17% of transactions, standard models either miss anomalies entirely or flag so many legitimate transactions they become useless.

Solution

Engineered a high-precision scoring engine using CatBoost and Optuna. After an empirical ablation study revealed that extreme outliers in features V1–V28 were critical fraud signals, not noise, revised the preprocessing strategy to retain them rather than clip them. This single insight was the turning point for the model's precision.

Key Outcome

0.93 Precision and 0.82 Recall on the fraud class, meaning 93% of flagged transactions are genuine fraud cases. Prevented a total recall collapse by retaining critical outliers, a finding that came directly from systematic ablation testing rather than assumption.

PythonCatBoostOptunaRandom ForestYData ProfilingAnomaly DetectionImbalanced Data
View Project

My Skills

Python, SQL, AWS & Machine Learning

Python (Pandas, NumPy)
SQL (Advanced Querying, CTEs)
Data Warehousing (Snowflake, SQL Server)
ETL Pipeline Automation
Data Visualization (Tableau, Power BI)
A/B Testing & Statistical Analysis
Business Intelligence (KPI Design)

Core Tech Stack

Python
Snowflake
AWS
PostgreSQL
Docker
Pandas
Scikit-learn
MLflow
Streamlit
Tableau
Power BI
Jupyter
Excel
GitHub
Jira
FastAPI

Experience

Professional Impact & Results

Data Scientist & Research Analyst
The Catholic University of America
Jan 2024 - Sep 2025
Contract / Research Fellow

Experience.experienceDesc1

Data Infrastructure Engineer
Women of Faith
Jul 2021 - Dec 2023
Full-time

Experience.experienceDesc2

Education

Education.subtitle

M.S. in Data Analytics
The Catholic University of America

Education.educationDesc1

M.S. in Internet & Systems Engineering
Kigali Independent University

Education.educationDesc2

B.S. in Computer Science
Kigali Independent University

Education.educationDesc3

Contact Me

Let's turn your Data into Business Decisions

Get In Touch
I am currently accepting full-time opportunities in Data Science and Analytics.

Email

delphin.kaduli@gmail.com

Location

Washington, Dc | Open to relocate

Connect with me
Professional Profiles
Send Me a Message
Have a challenge you need solved? I'd love to hear about it.

© 2026 All rights reserved.

Privacy PolicyTerms of ServiceSitemapQuick Links