Home About Case Studies Projects Education Skills Certifications Contact
Open to Consulting Internships & Full-Time Roles

Turning messy data into
decisions that move
businesses forward

Data & AI Engineering student at ENSA Al Hoceima  ·  Analytics Consulting track

Open to Analytics & AI Consulting roles  ·  Available 2026

About

Engineer who thinks
like a consultant

I'm a final-year Data Engineering student at ENSA Al Hoceima (Morocco), obsessed with one question: what does this data actually tell us — and what should the business do about it?

Four years translating data into decisions: churn diagnostics for energy utilities, financial health scoring for 9,600 African SMEs, national connectivity demand modeling for Togo. Behind every insight — production pipelines on Azure Databricks, cloud-native platforms, and executive dashboards built to communicate, not just visualize. Two internships in applied AI confirmed that the hardest part is never the model — it's framing the right problem.

What makes me different: I bring engineering precision to consulting questions. I don't just build the analysis — I deliver the recommendation.

Essohanam Josué KPATCHA — Data & AI Engineer
4 Consulting-grade case studies delivered end-to-end
0.86 ROC-AUC — Zindi Africa Togo Internet Uptake Prediction
0.96 Precision — BCG X PowerCo Churn Analysis (XGBoost)
0.82 Weighted F1 — African SME Financial Health Prediction
Experience
AI & Data Engineering Intern
ISICOD
Jun 2025 – Sep 2025  ·  4 months
Rabat, Morocco

Co-built an end-to-end biometric AI platform for fingerprint classification & ridge counting across 12 classes. Sole implementer of the Human-in-the-Loop retraining system enabling continuous model improvement in production.

PyTorch FastAPI React Computer Vision
Machine Learning Engineer Intern
3D Smart Factory
Jun 2024 – Aug 2024  ·  2 months
Mohammedia, Morocco

Applied research on 3D medical image segmentation. Implemented custom KAN layers integrated into a 3D U-Net — achieving +3% segmentation accuracy gain over the standard baseline on MRI datasets.

PyTorch TensorFlow 3D Deep Learning Medical AI
Case Studies

Work that speaks
in business outcomes

Each project structured as a consulting engagement: problem framed, approach rigorous, impact quantified.

★ Featured Case Study BCG X Analytics Simulation

Why PowerCo's Customers Churn — and What To Do About It

Energy utility · 14,600 customers · 193,000 pricing records · One bold hypothesis reframed.

View on GitHub
Problem

PowerCo (energy utility) believed price sensitivity was the primary churn driver. BCG was asked to investigate and recommend a targeted retention strategy.

Approach

Reframed the hypothesis. Analyzed 14,600 customers + 193,000 pricing records. Engineered 50+ features. Benchmarked Random Forest, Gradient Boosting, XGBoost with rigorous CV.

Impact

Price is NOT the main driver — margin and customer segment are. Prioritize retention on low-margin, high-churn segments. XGBoost: ROC-AUC = 0.68, Precision = 0.96.

BCG GenAI Consulting Simulation

AI Financial Analyst for Global Finance Corp.

Problem

GFC needed to analyze hundreds of 10-K filings faster and extract strategic insights without manual reading.

Approach

Built a hybrid RAG engine (ChromaDB + SQLite). Extracted 2,000+ records from 34 companies × 3 years via SEC EDGAR. Engineered 20+ industry-benchmarked KPIs.

Impact

Production-ready system answering complex financial questions in seconds. Integrated Langfuse for hallucination eval + LiteLLM for failover.

RAG · LLM · Finance GitHub
Zindi Africa Competition

Predicting Financial Health for 9,600 African SMEs

Problem

SME lenders in Southern Africa needed to segment businesses by financial health to improve credit risk decisions.

Approach

Cleaned 9,618 records × 39 features across 4 countries. Engineered 47 features. Benchmarked 4 algorithms with stratified CV.

Impact

Gradient Boosting: F1 = 0.82 weighted, 78.7% accuracy. FastAPI scoring API deployed for real-time batch prediction.

GBM · FastAPI · Africa GitHub
Zindi Africa Competition · National Impact

Togo National Internet Connectivity — Demand Prediction

Problem

The Togolese government needed to predict household internet uptake across the country to guide FTTH infrastructure investment decisions.

Approach

End-to-end ML pipeline on a national survey dataset (~14,400 rows, ~4,000 features). Strict data quality checks, PCA (350 components) for dimensionality reduction, combined with curated categorical indicators.

Impact

XGBoost selected: ROC-AUC = 0.86 on validation. Competition score: 0.7144. National-scale model guiding connectivity infrastructure priorities.

XGBoost · PCA · National Policy GitHub
Technical Projects

Engineering depth
across the full stack

Selected projects demonstrating technical breadth — from AI agents to data platforms.

Data Engineering · Analytics
Job Market Intelligence Platform — Skills Really In Demand

Automated scraping of Morocco data job listings (Indeed, ReKrute) with HDBSCAN clustering to segment archetypes and quantify skill demand. Power BI dashboards delivering prioritized skill gap recommendations.

HDBSCAN Power BI Scraping
GitHub
Data Engineering · Azure
Real-Time Food Delivery Analytics Lakehouse

Medallion lakehouse on Azure Databricks with real-time CDC via Debezium + Kafka and SCD Type 2 historical tracking. Optimized Delta tables via partitioning + Z-ORDER for scalable reporting workflows.

Databricks Kafka Delta Lake
GitHub
AI Agents · MCP
DevAssist — AI Agent Platform

Production-ready AI assistant combining LangGraph agents with real-time tool usage via Model Context Protocol. Integrated GitHub, Supabase, Exa MCP servers. Evaluated with DeepEval. Deployed on GKE with Terraform + Prometheus.

LangGraph MCP Kubernetes GCP
GitHub
Data Engineering · Healthcare
Healthcare Data Platform & Analytics Stack

Scalable healthcare data platform with ETL/ELT pipelines (Airflow + Airbyte), multi-layer dbt models (bronze/silver/gold), data quality checks with Great Expectations, and interactive dashboards in Superset.

Airflow dbt Airbyte Superset
GitHub
Computer Vision · Deep Learning
Story Vision — AI Image Captioning

Full-stack AI app generating descriptive captions from images using InceptionV3 encoder + LSTM decoder with attention mechanism. FastAPI backend with async file handling, React frontend with drag-and-drop and dark mode.

PyTorch InceptionV3 LSTM FastAPI React
GitHub
Education

Engineering track with mathematics
and data foundations

Ecole Nationale des Sciences Appliquées d'Al Hoceima

Engineer's Degree — Data Engineering

Oct 2023 – 2026 Current · Final Year

Ecole Nationale des Sciences Appliquées de Fès

Integrated Preparatory Classes — Applied Mathematics

Oct 2021 – Jul 2023 Completed
Skills

Full-stack data capability

From raw ingestion to deployed models and board-ready recommendations.

Languages & Libraries
PythonNumPyPandasScikit-learnTensorFlowKerasPyTorchNLTKGensimSpaCySQLJava
📊
Analytics & Machine Learning
RegressionSVMDecision TreesXGBoostRandom ForestGradient BoostingClusteringCNNRNN / LSTMBiLSTMGRUNLPLLMsRAGStatistical ModelingA/B TestingPower BITableauSeaborn
🔧
Data Engineering
Apache KafkaApache SparkdbtAirflowDatabricksDelta LakeSnowflakeBigQueryAirbytePostgreSQL
🤖
AI & LLMs
LangChainLangGraphRAGChromaDBCrewAIHuggingFaceLangfuseMCPLiteLLM
Cloud & DevOps
GCPAzureAWSDockerKubernetesTerraformPrometheusGrafanaCI/CD
Certifications

Verified credentials

BCG X Data Science Simulation

Forage  ·  Mar 2026

BCG GenAI Consulting Simulation

Forage  ·  Feb 2026

Oracle Cloud GenAI Professional

Oracle  ·  Sep 2025

Spark, Hadoop & Snowflake — Big Data Specialization

Duke University  ·  Oct 2025

AWS Cloud Quest: Cloud Practitioner

Amazon Web Services  ·  Apr 2025

Neo4j Certified Professional

Neo4j  ·  Feb 2025

AI Agents Fundamentals

HuggingFace  ·  Apr 2025
Contact

Let's talk data
& strategy

Open to consulting internships and full-time roles in data analytics & AI.
Based in Morocco — open to relocate.

josuekpatcha1@gmail.com
+212 710 84 81 11