Available from April 2026

El Mehdi Bechnikha

Data Scientist & Data Engineer

Building end-to-end intelligent systems — from LLM-powered document pipelines at AXA France to large-scale Databricks engineering. I bridge rigorous data science with production-grade engineering.

0
Years Exp.
10×
Speedup
0
M Rows
0
AI Use Cases
🧠 NLP / GenAI
👁️ Computer Vision
☁️ Azure Databricks
🤖 LLM · RAG · OCR
🗼 Paris, France
Scroll to explore
01 — Profile

About Me

01

I'm a Data Scientist & Data Engineer based in Paris with 2+ years of experience delivering AI-driven systems in demanding financial environments.

At AXA France, I led end-to-end AI pipelines: OCR-based document intelligence, multi-label classification, LLM-powered extraction, and large-scale PySpark engineering on Azure Databricks.

My dual background — Ingénieur Statistiques & Recherche Opérationnelle (INSEA) and M2 Machine Learning (Paris Cité) — lets me bridge rigorous mathematical foundations with real production software.

🧑‍💻 Builder
📊 Data-driven
🔬 Researcher
🚀 Production-focused
🌍 Trilingual
📞 Phone
07 58 45 99 48
💼 LinkedIn
📍 Location
Paris, Île-de-France
🌐 Languages
FR · EN · AR
✅ Status
Open · April 2026
0
Years professional AI/ML experience
🚀
10×
Clause extraction speedup at CACIB
📊
0
Million data rows processed at CACIB
🎯
0
AI use cases shipped to production
🐍Python
PySpark
🧠TensorFlow
🤖Mistral LLM
👁️YOLOv8/v9
☁️Azure Databricks
🔗LangChain
🔍Elasticsearch
🌲LightGBM
📄Surya OCR
🔁Azure DevOps
📊Power BI
🔧Dataiku
📐Scikit-learn
🗄️SQL / NoSQL
🖥️Streamlit
02 — Career

Work Experience

02
2025
📅 Oct 2025 — Feb 2026
CDD
AXA France
🏢 IARD & Partenariats · Nanterre
Data Scientist / Data Engineer
  • Designed and shipped an automated accounting audit pipeline 🏗️ on Azure Databricks.
  • Led migration to Databricks — significantly reduced monthly compute time ⚡.
  • Processed large-scale financial data with PySpark: business rule enforcement and data quality gates 📊.
  • Industrialised with Azure DevOps pipelines and Git 🔁.
  • Built Power BI dashboards enabling autonomous audit workflows 📈.
☁️ Azure Databricks⚡ PySpark🔁 Azure DevOps📊 Power BISAS🐍 Python
📅 May 2024 — May 2025
CDD
AXA France
🏥 Santé & Collectives · Nanterre
Data Scientist
  • Built a full GenAI / NLP pipeline 🤖 centralising Savings-Retirement contract data.
  • Document preprocessing and text extraction with SURYA OCR 📄.
  • Multi-label classification with LightGBM 🏷️.
  • Signature/stamp detection via fine-tuned YOLOv9 👁️.
  • Contractual extraction with GPT‑4o LLM with few-shot prompting ✨.
🤖 LLM👁️ YOLOv9LightGBMSurya-OCR📝 NLP☁️ Databricks
2024
2023
📅 Mar 2023 — Sep 2023
Internship
Crédit Agricole CIB
🏦 AI Factory · Paris
Data Scientist
  • NLP clause tool — extraction time from 1–2 weeks → 10 minutes 🚀.
  • Trained YOLOv8 for zone detection; OCR + Elasticsearch pipeline 🔍.
  • Analytics POC: loan opportunity detection — 30M rows, ML price modelling in Dataiku 📊.
👁️ YOLOv8🔤 Tesseract OCR🔍 ElasticsearchDataiku🐍 Python
📅 Mar 2022 — Jul 2022
Internship
ONCF
🚄 Revenue Management · Rabat
Data Scientist / Operations Research
  • Deployed a Revenue Management strategy 💰 to optimise HSL train occupancy.
  • Passenger traffic forecasting using LSTM deep learning 🧠.
  • Revenue allocation via linear programming; Streamlit dashboard 📱.
🧠 LSTMDeep LearningLinear Programming🖥️ StreamlitMySQL
2022
03 — Expertise

Technical Skills

03
🧠 AI / Machine Learning
⚙️ Machine Learning
95%
📝 Deep Learning / NLP
90%
🤖 GenAI / LLM / RAG
88%
👁️ Computer Vision
86%
📐 Operations Research
82%
2+
Years professional AI/ML experience
🐍
96%
Python mastery — primary stack
☁️ Engineering & Cloud
🐍 Python
96%
☁️ Azure Databricks
85%
⚡ PySpark / Big Data
83%
🗄️ SQL / NoSQL
80%
📊 Power BI / Dataiku
80%
🛠️ Full tech stack
🐍 Python⚡ PySpark🧠 TensorFlow⚙️ Scikit-learn🔗 LangChain🐼 Pandas👁️ YOLOv8/v9🌲 LightGBM🤖 Mistral LLM📄 Surya OCR🔤 Tesseract☁️ Azure Databricks🔁 Azure DevOps🔍 Elasticsearch🗄️ MySQL📊 Power BI🔧 Dataiku🖥️ Streamlit📁 Git📈 R📑 LaTeX📉 SAS
04 — Formation

Education

04
📅 2022 — 2023
🎓 Master 2 — Machine Learning for Data Science
🏫 Université Paris Cité
📍 Paris, France 🇫🇷
Machine Learning · Deep Learning · NLP · Time Series · Text Mining · Data Engineering ·
📅 2019 — 2022
🏗️ Diplôme d'Ingénieur — Statistiques & Recherche Opérationnelle
🏫 INSEA — Institut National de Statistique et d'Économie Appliquée
📍 Rabat, Maroc 🇲🇦
Advanced Statistics · Operations Research · Optimisation · Metaheuristics · Programming
📅 2017 — 2019
📐 CPGE — Concours National Commun — Maths-Physique
🏫 Lycée Moulay Youssef
📍 Rabat, Maroc 🇲🇦
🏅 Certifications & Awards
🏅 Dataiku Core Designer
📚 Udemy — Machine Learning A–Z
🏆 Champion du Maroc Basketball 2014
🥈 8th place Euro Pacé Basketball 2013
05 — Let's connect

Get In Touch

05

Open to new opportunities 🚀

Looking for Data Scientist, Data Engineer, or ML Engineer roles in Paris and remote. If you're building something ambitious with AI, I'd love to hear about it! 🤝

📧 Send an email →