Khadija CHAHIDI

The Data Behind Me

I've always been fascinated by the stories data can tell and the power it holds to transform businesses. For me, data isn't just numbers—it's the foundation for innovation, growth, and smarter decision-making.


Certified as an AWS Data Engineer and HCIA-Big Data Associate, I specialize in building end-to-end data pipelines, implementing data warehousing solutions, optimizing systems to unlock the full potential of big data. I'm also passionate about the intersection of data & AI, constantly exploring how intelligent systems can create smarter, more effective solutions.


Committed to continuous learning, I love pushing the boundaries of what's possible with data.

Skills & Services

Data Engineering

Design and implement data pipelines, ETL processes, and data warehouses

Machine Learning & AI

Implementing AI solutions, predictive models, and intelligent data processing

Programming

Python, SQL, R, and frameworks for development and automation

Data Analytics

Data analysis, visualization, and reporting solutions

Cloud Platforms

AWS, Azure, GCP infrastructure and services

Experience

Data Engineer

Omdena | Oct 2024 - Dec 2024

  • Developed an orchestrated ETL pipeline for extracting, transforming, and loading data from various sources.
  • Utilized an LLM to analyze the processed data, classifying news articles based on their relevance and credibility.

Using: Python, BeatifulSoup, Apache Airflow, MongoDB, BERT (LLM), Streamlit

Data Engineer & Scientist

Lear Corporation Engineering | Feb 2024 - Aug 2024

  • Automated data extraction and structuring from clients' specifications for ingestion into management systems, utilizing AI approaches for tables detection in documents.
  • Implemented a Requirement Prediction System based on machine learning algorithms.
  • Significantly reduced the time needed for requirements extraction by 20%, aiding in faster and more accurate analysis.

Using: Python, NLP, OCR, Deep Learning, Machine Learning, MySQL

Data Engineer

Brocoli Data | Aug 2023 - Oct 2023

  • Demonstrated expertise in ETL processes for extracting, transforming, and loading Moroccan data.
  • Designed interactive sales dashboards showcasing market trends and customer insights.

Using: Python, SQL, Docker, DBT, DuckDB, Google Cloud Storage, Looker Studio

Data Scientist

3D SMART FACTORY | Jul 2023 - Sep 2023

  • Craft real-time Financial Prediction Solution.
  • Extracted and transformed real-time financial data.
  • Implemented Apache Kafka to optimize and streamline data ingestion processes.
  • Developed an ARIMA model for accurate financial predictions.
  • Automated the entire workflow, accelerating the delivery of actionable insights.

Using: Python, Apache Kafka, Confluent Cloud, Jenkins, ML(ARIMA)

Open-Source Projects

E-commerceAnalytics Pipeline on GCP

Designed an ETL pipeline to process e-commerce data, automating workflows and enabling insightful dashboards for better decision-making.

PythonSQLAirflowDBTBigQueryGoogle Cloud StorageLooker Studio
GitHub repo →

SDGS Insights

Designed an ETL pipeline to process e-commerce data, automating workflows and enabling insightful dashboards for better decision-making.

PythonPower BIStreamlitPostgreSQL
GitHub repo →

Fifa 2023 Analytics on Azure

end-to-end data engineering project uses Azure services to extract, clean and analyze data to provide insights into FIFA 2023 players performance.

PythonSQLDatabricksAzure SynapseAzure Data FactoryAzure Data Lake StoragePower BI
GitHub repo →

Sentiment Analysis

Developed a sentiment analysis model to classify IMDb movie reviews as positive or negative, achieving 90.6% accuracy.

PythonPysparkNLPMachine LearningFLASK
GitHub repo →

Education & Certifications

Education

National School of Applied Sciences of Al-Hoceima

Data Engineering

2021 - 2024

National School of Applied Sciences of Al-Hoceima

Preparatory Cycle

2019 - 2021

Certifications

Let's Connect

Interested in working together? Get in touch! ✨

Contact Me