Summary

I’m an accomplished Senior Data Engineer with 10.5 years of experience, dedicated to crafting data infrastructures for fintech, healthcare, and manufacturing industries. Proficient in Apache Kafka, Databricks, and Azure, I lead end-to-end data pipeline projects with a focus on scalability and reliability. I thrive on solving complex data problems, guiding teams, and ensuring seamless integration with business needs across global markets.

Technical proficiencies

Programming Languages: Python, SQL, Java

Skills:

  • Data Engineering: Apache Kafka, Databricks, ETL/ELT Development
  • Databases: Oracle, SQL Server, Cassandra
  • Data Processing: PySpark, Dask, Apache NiFi
  • Cloud Platforms: Azure (Data Factory, Databricks), AWS (S3, EMR)
  • Others: Stream Processing, Data Governance, System Architecture

Tools: Docker, Kubernetes, Git, Terraform, Airflow.

Professional experience

Fintech Data Streaming - United Kingdom

Senior Data Engineer, 04/2023 - Present

Project description 

  • Led a data streaming platform for a fintech firm to process real-time transaction data across 15+ countries.

Responsibilities 

  • Designed Kafka-based streaming pipelines to handle 500K+ transactions daily, integrated with Azure Databricks.
  • Built ETL workflows using Azure Data Factory to transform data from Oracle into a data lake.
  • Collaborated with security teams to implement data governance policies on sensitive financial data.
  • Mentored three engineers on stream processing and cloud optimization techniques.
  • Monitored system performance with Azure Monitor, achieving 99.9% data delivery.
  • Worked with product managers to align data outputs with regulatory reporting needs.

Technologies 

  • Apache Kafka, Azure Databricks, Python, SQL, Oracle, Azure Data Factory, PySpark

Healthcare Data Warehouse - France

Senior Data Engineer, 07/2019 - 03/2023

Project description 

  • Oversaw the construction of a data warehouse to unify patient and operational data for a French healthcare provider.  

Responsibilities 

  • Developed PySpark jobs on Databricks to process and load data from SQL Server into Azure Data Lake.
  • Designed data models to support clinical analytics, improving query efficiency by 20%.
  • Automated data ingestion with Apache NiFi, reducing manual tasks by 35%.
  • Led a team of two engineers in troubleshooting and scaling the warehouse.
  • Integrated real-time data feeds using Kafka, enhancing reporting capabilities.
  • Documented architecture for compliance audits and team training.

Technologies 

  • Databricks, PySpark, Apache NiFi, Apache Kafka, SQL, SQL Server, Azure Data Lake

Manufacturing Data Pipeline - Mexico

Data Engineer, 03/2017 - 06/2019

Project description 

  • Supported a manufacturing firm with data pipelines for production and supply chain analytics.  

Responsibilities

  • Built Python scripts to process production data and load it into Cassandra.
  • Assisted in setting up initial ETL processes for daily reports.
  • Collaborated with analysts to refine data schemas.
  • Maintained pipeline documentation for team use.

Technologies

  • Python, SQL, Cassandra

Retail Data Optimization - Spain

Data Engineer, 09/2015 - 02/2017

Project description 

  • Optimized data flows for a retail chain to support inventory and sales forecasting.  

Responsibilities

  • Developed Java-based ETL scripts to process sales data.
  • Integrated data into SQL Server for analysis.
  • Worked with IT to ensure data consistency.

Technologies

  • Java, SQL, SQL Server
Certifications

Databricks Certified Data Engineer Associate, 2021

English: Advanced  

  • IELTS 8.0
Book an Appointment

Navigating OurCooperation Models

We assess your needs first. Then, we will send you the top software engineer CV options so that you can select your favorite. The chosen engineer becomes part of your in-house team.

Ideal for businesses that:
Need specialized expertise but don't want to hire full-time staff
Want to scale resources up and down quickly
Require extra support for upcoming or ongoing projects
You can choose from our numerous software developer CV options. The selected developers form a dedicated team that works exclusively on your project. They also collaborate closely with your in-house team to achieve your goals.

Ideal for businesses that:
Require cost-effective and scalable solutions for large and long-term projects
Want to form a consistent team with excellent skills
Need a development team committed to their business goal
We define a clear path for your project. Since the project has clear timelines and scopes, you can control your budget better. You can also choose to work with a remote team or manage specialized technical roles.

Ideal for businesses that:
Have a set budget and clearly outline the project scope
Struggle with strict deadlines
Handle projects with clear goals, a detailed outline, and achievable milestones
Let's Discuss Your Needs
How to Hire Top Developers from Saigon Technology?