Summary

I am a dedicated Middle Data Engineer with over 5 years of experience, focused on designing and implementing data pipelines to support business intelligence across e-commerce, healthcare, logistics, and financial services sectors. Skilled in Apache Airflow, Spark, and AWS, I build scalable data solutions that enhance data accessibility and reliability. I thrive in collaborative environments, working with cross-functional teams to optimize data workflows and deliver actionable data infrastructure.

Technical proficiencies

Programming Languages: Python, SQL

Skills:

  • Data Engineering: Apache Airflow, Apache Spark, ETL Development
  • Databases: MySQL, PostgreSQL, MongoDB
  • Data Processing: Pandas, PySpark
  • Cloud Platforms: AWS (Redshift, S3, Glue), Google Cloud Platform
  • Others: Data Pipeline Optimization, Cloud Integration, Schema Design

Tools: Docker, Kubernetes, Git, Jenkins, Visual Studio Code.

Professional experience

E-commerce Data Pipeline Project - Japan  

Data Engineer, 07/2023 - Present  

Project description 

  • Developed and maintained data pipelines for an e-commerce platform to process real-time sales and inventory data.

Responsibilities 

  • Designed and implemented ETL pipelines using Apache Airflow to ingest data from MySQL into AWS Redshift.
  • Optimized data processing workflows with Apache Spark, reducing latency by 30% for daily reports.
  • Collaborated with data analysts to ensure data quality and schema consistency across pipelines.
  • Integrated AWS Glue for automated data cataloging and transformation.
  • Monitored pipeline performance using CloudWatch, resolving issues to maintain 99.9% uptime.
  • Documented pipeline architecture to support team onboarding and future scaling.

Technologies 

  • Apache Airflow, Apache Spark, Python, SQL, MySQL, AWS Redshift, AWS Glue, PySpark

Healthcare Data Integration Project - United States

Data Engineer, 05/2022 - 06/2023

Project description 

  • Built data integration solutions to consolidate patient records and operational data for a U.S. healthcare provider.  

Responsibilities 

  • Constructed ETL processes using Python and Apache Spark to transform data from PostgreSQL to AWS S3.
  • Worked with healthcare teams to design data models for patient outcome analysis.
  • Implemented data validation checks to ensure 100% data integrity during migration.
  • Deployed pipelines on AWS Glue, improving data availability for reporting by 25%.
  • Assisted in troubleshooting pipeline failures, reducing downtime by 15%.

Technologies 

  • Apache Spark, Python, SQL, PostgreSQL, AWS S3, AWS Glue, PySpark

Financial Services Data Workflow Project - Singapore

Data Engineer, 04/2021 - 04/2022

Project description 

  • Supported a financial services firm with data workflows for transaction processing and fraud detection.  

Responsibilities

  • Built ETL pipelines using Python to process transaction data from MongoDB.
  • Optimized data ingestion for real-time fraud detection systems.
  • Collaborated with analysts to ensure data accuracy for reporting.
  • Documented workflows for future reference.

Technologies

  • Python, SQL, MongoDB, AWS S3

Logistics Data Processing Project - Australia

Data Engineer 05/2020 - 03/2021

Project description 

  • Supported a logistics firm with data processing for shipment tracking and route optimization.  

Responsibilities

  • Built batch processing scripts using Python to handle shipment data.
  • Integrated data into MySQL databases for real-time tracking.
  • Collaborated with developers to optimize query performance.

Technologies

  • Python, SQL, MySQL
Certifications

AWS Certified Data Engineer - Associate, 2023

English: Advanced  

  • IELTS 6.5

Navigating OurCooperation Models

We assess your needs first. Then, we will send you the top software engineer CV options so that you can select your favorite. The chosen engineer becomes part of your in-house team.

Ideal for businesses that:
Need specialized expertise but don't want to hire full-time staff
Want to scale resources up and down quickly
Require extra support for upcoming or ongoing projects
You can choose from our numerous software developer CV options. The selected developers form a dedicated team that works exclusively on your project. They also collaborate closely with your in-house team to achieve your goals.

Ideal for businesses that:
Require cost-effective and scalable solutions for large and long-term projects
Want to form a consistent team with excellent skills
Need a development team committed to their business goal
We define a clear path for your project. Since the project has clear timelines and scopes, you can control your budget better. You can also choose to work with a remote team or manage specialized technical roles.

Ideal for businesses that:
Have a set budget and clearly outline the project scope
Struggle with strict deadlines
Handle projects with clear goals, a detailed outline, and achievable milestones
How to Hire Top Developers from Saigon Technology?