Data Engineering
Designing and automating the flow from raw, multi-source data to clean, queryable tables that are ready for analysis.
Data Engineer / Pipelines, Big Data & Cloud
I build data pipelines that turn complex, multi-source data into clean, reliable datasets.
Mathematics graduate (Data Science, USJ) who designs ETL pipelines, relational data models, and analysis-ready datasets in Python and SQL to support analytics and machine learning.
I am a Mathematics graduate (Data Science option, USJ) specialising in data engineering. My work centres on designing data ingestion and ETL pipelines, building relational data models, and turning large, multi-source datasets into clean, well-structured tables that are ready for analysis.
I work mainly in Python and SQL, with hands-on experience integrating satellite, weather, and transactional data (including a dataset of more than 500,000 rows) and automating multi-stage processing workflows. I hold the IBM Data Science Professional Certificate and am currently preparing for the Google Cloud Cloud Data Engineer Professional Certificate.
I am looking for an entry-level Data Engineer role where well-built pipelines and well-structured data give analytics and machine learning teams a dependable foundation to work from.
Grouped across the data lifecycle, with data engineering as the core of my work.
Designing and automating the flow from raw, multi-source data to clean, queryable tables that are ready for analysis.
A selection of projects covering data ingestion, modelling, and transformation, with analytics, dashboards, and applications built on top.
Co-developed with Ibrahim Rajab
Built an automated geospatial data pipeline on Google Earth Engine, ingesting and integrating four satellite, radar, and climate sources (NASADEM, ESA WorldCover, ERA5-Land, Sentinel-1) into a 521,664-row labelled dataset across 572 grid cells and five winters. A 22-feature transformation layer fed 11+ classifiers, delivering a tuned XGBoost + MLP soft-voting ensemble. Shipped a Flask web app (live risk map, automated report generator, four-language AI copilot) and a 34-page Streamlit dashboard.
572 grid cells, five winters (2020 to 2024)
Built a 6-stage data pipeline on 19,960 orders (4,338 customers, GBP 10.6M revenue): ingestion, cleaning, SQL transformation in SQLite, RFM segmentation, forecasting, and reporting. Modelled 11 RFM segments and surfaced Champions driving GBP 5.7M of GBP 8.9M known-customer revenue, plus an 8-week forecast and a 15-visual Power BI layer.
Built a reproducible NLP data pipeline: curated and balanced a 2,400-essay corpus (human PERSUADE 2.0 vs. GPT, Copilot, DeepSeek, Gemini), then ran preprocessing, TF-IDF feature extraction, POS tagging, and clustering. Benchmarked 5 classifiers including a GRU; Linear SVM reached 0.80 accuracy, 0.83 F1, and 0.99 ROC-AUC, deployed via a Flask app with auth, quotas, and PDF/DOCX/TXT input.
Cleaned and feature-engineered historical flight records (delay-ratio, seasonal and temporal features), then applied K-Means (K=3, elbow + silhouette validated) to surface operational delay profiles, visualised with PCA. Benchmarked regression and classification models for delay prediction and delivered an interactive Streamlit dashboard.
Designed a relational SQLite schema and seeded 330 products behind a Flask app with CSRF protection, rate limiting, soft-delete audit logs, and an admin dashboard (CSV exports, coupons, application tracking). Authored 15 SQL analytics queries covering revenue, top sellers, coupon effectiveness, and customer lifetime value.
Google Cloud / Coursera
IBM / Coursera
Université Saint-Joseph (USJ), North Lebanon
Relevant coursework
UN Global Compact, Ambassador of Change
2026 - PresentAdvocating the UN Global Compact's Ten Principles and Sustainable Development Goals, and supporting local engagement initiatives.
SDG Brain Lab 5.0, Delegate & Spokesperson
UN Global CompactLed the Fair Trade Lebanon challenge and co-developed LOCAL+, a certification, QR-transparency, and retail-activation system helping Lebanese products compete with imported brands.
RoadRescue AI x RoadChip, Entrepreneurship Competition
USJ BIAT, May 2026Developed and pitched a startup concept for the USJ BIAT entrepreneurship competition: a two-product idea pairing an AI roadside-assistance assistant with a Bluetooth vehicle-monitoring device. Prepared the market analysis, business model, and financial projections, and presented the pitch to the judging panel.
Global Mentorship Initiative (GMI), Mentee
Completed Dec 2025Structured mentorship with an international industry professional on personal branding, interviewing, and global business communication.
Google Developer Student Club (GDSC), Social Media Designer
Nov 2023 - May 2024Led visual-content strategy and event branding for a campus tech community in USJ North Lebanon.
I am open to entry-level Data Engineer roles and data pipeline work. The fastest way to reach me is by email.