Summary

I’m a seasoned Senior Data Engineer with 7.5 years of experience, passionate about building scalable data solutions for e-commerce, healthcare, and energy sectors. With expertise in Apache Spark, AWS, and Python, I design robust data pipelines that drive business insights. I enjoy mentoring junior engineers, tackling complex data challenges, and collaborating with cross-functional teams to deliver reliable, high-performance systems.

Technical proficiencies

Programming Languages: Python, SQL, Scala

Skills:

  • Data Engineering: Apache Spark, Apache Airflow, ETL/ELT Development
  • Databases: PostgreSQL, MySQL, Redshift
  • Data Processing: PySpark, Pandas, Delta Lake
  • Cloud Platforms: AWS (S3, Redshift, Glue, EMR), Azure
  • Others: Data Pipeline Optimization, Data Lake Architecture, Team Leadership

Tools: Docker, Kubernetes, Git, Jenkins, Databricks.

Professional experience

E-commerce Data Platform - Netherlands

Senior Data Engineer, Jun 2023 - Present

Project description 

  • Led the development of a data platform for an e-commerce company to handle real-time customer and sales data across Europe.

Responsibilities 

  • Designed and deployed ETL pipelines using Apache Airflow and AWS Glue to process data from PostgreSQL into Redshift.
  • Optimized Spark jobs on AWS EMR to handle 1TB+ daily data, cutting processing time by 25%.
  • Collaborated with data scientists to build a Delta Lake for machine learning model training.
  • Mentored two junior engineers on PySpark and cloud best practices.
  • Monitored pipeline health with CloudWatch, ensuring 99.95% uptime.
  • Worked with product teams to integrate real-time analytics into the platform.

Technologies 

  • Apache Spark, Apache Airflow, Python, SQL, PostgreSQL, AWS Redshift, AWS Glue, PySpark, Delta Lake

Healthcare Data Integration - Sweden

Senior Data Engineer, Mar 2020 – May 2023

Project description 

  • Built data integration systems to consolidate patient data for a Swedish healthcare network.  

Responsibilities 

  • Developed Spark-based ETL processes to transform data from MySQL to AWS S3, supporting 50+ clinics.
  • Implemented data validation scripts in Python to ensure HIPAA compliance.
  • Worked with architects to set up a data lake on AWS, improving data accessibility by 30%.
  • Automated pipeline scheduling with Airflow, reducing manual effort by 40%.
  • Troubleshot performance issues, enhancing query response times by 15%.
  • Supported analytics teams with ad-hoc data extracts and schema adjustments.

Technologies 

  • Apache Spark, Python, SQL, MySQL, AWS S3, Apache Airflow, PySpark

Energy Data Pipeline - Brazil

Data Engineer, Sep 2018 – Feb 2020

Project description 

  • Supported an energy company with data pipelines for consumption and grid performance analysis.  

Responsibilities

  • Built Python scripts to process grid data and load it into PostgreSQL.
  • Assisted in setting up initial ETL workflows for daily reporting.
  • Collaborated with engineers to optimize data storage.
  • Documented pipelines for team handover.

Technologies

  • Python, SQL, PostgreSQL
Certifications

AWS Certified Big Data - Specialty, 2022

English: Advanced  

  • IELTS 7.0

Book an Appointment

Navigating OurCooperation Models

We assess your needs first. Then, we will send you the top software engineer CV options so that you can select your favorite. The chosen engineer becomes part of your in-house team.

Ideal for businesses that:
Need specialized expertise but don't want to hire full-time staff
Want to scale resources up and down quickly
Require extra support for upcoming or ongoing projects
You can choose from our numerous software developer CV options. The selected developers form a dedicated team that works exclusively on your project. They also collaborate closely with your in-house team to achieve your goals.

Ideal for businesses that:
Require cost-effective and scalable solutions for large and long-term projects
Want to form a consistent team with excellent skills
Need a development team committed to their business goal
We define a clear path for your project. Since the project has clear timelines and scopes, you can control your budget better. You can also choose to work with a remote team or manage specialized technical roles.

Ideal for businesses that:
Have a set budget and clearly outline the project scope
Struggle with strict deadlines
Handle projects with clear goals, a detailed outline, and achievable milestones
Let's Discuss Your Needs
How to Hire Top Developers from Saigon Technology?