Summary
Overview
Work History
Education
Skills
Timeline
Generic

Nur-Akhmet Baimakhan

Almaty

Summary

Data Engineer with experience in developing and optimizing ETL processes, working with big data, and databases such as Oracle and PostgreSQL. Proficient in PL/SQL, Python. Experienced in working with AWS services such as S3, Lambda, Glue and Redshift. Skilled in Informatica PowerCenter, Apache Kafka, and Apache Airflow for data integration, streaming, and workflow orchestration. Interested in learning new technologies and tools for professional growth and solving complex problems.

Overview

4
4
years of professional experience

Work History

Data Analyst

Payda
Astana
02.2021 - 10.2021
  • Increased monthly sales by 10% by collaborating with the marketing department.
  • Increased active customers by 15% by implementing customer retention strategies.
  • Developed monthly reports and dashboards using Power BI for data-driven decision-making.

Data Engineer

Bank CenterCredit
Almaty
10.2021 - Current
  • Development, maintenance, and optimization of ETL processes in Informatica PowerCenter and Apache Airflow.
  • Experience with Oracle PL/SQL: optimizing complex queries, developing stored procedures, and automating data processing workflows.
  • Supporting real-time microservices with Apache Kafka to ensure seamless data streaming and processing.

Projects:

1. Customer Selection Process

Migration from Oracle to Informatica

  • Led the full migration of the customer segmentation process for the #kartakarta product, transitioning from PL/SQL-based Oracle procedures to Informatica PowerCenter.
  • Optimized 10+ complex ETL procedures, improving data processing speed and reducing execution time.

2. Migration of 50+ ETL Pipelines from Informatica to Apache Airflow

  • Successfully redesigned and migrated over 50 ETL workflows from Informatica PowerCenter to Apache Airflow, improving orchestration and monitoring capabilities.
  • Enhanced scalability and automation by integrating Airflow DAGs with cloud-based storage and processing systems

3. Fraud Detection Microservice using Apache Kafka & AWS

  • Designed and deployed a real-time fraud detection system for financial transactions using Apache Kafka and AWS services (S3, Lambda, and DynamoDB).
  • Implemented streaming data processing pipelines that analyze incoming transaction data and detect anomalies and reduced fraudulent transaction processing time by 30%

4. Automated JSON File Parsing System

  • Built a scalable ETL pipeline to ingest and process large volumes of JSON files from various sources, improving data integration efficiency.
  • Developed custom Python scripts for parsing and transforming JSON data into structured formats for database storage.

Education

Bachelor of Science - Mathematics

Nazarbayev University
Astana
05-2020

Skills

  • SQL
  • Python
  • Apache Airflow
  • Apache Kafka
  • AWS
  • Informatica Powercenter

Timeline

Data Engineer

Bank CenterCredit
10.2021 - Current

Data Analyst

Payda
02.2021 - 10.2021

Bachelor of Science - Mathematics

Nazarbayev University
Nur-Akhmet Baimakhan