Опыт работы:
Axenix (ex-Accenture) 1y 10m
Data Engineer
Top-3 local bank, reporting migration project (data marts DWH) 07/2022 – Present
• Helped establish a technical partnership between the company and a top-3 local cloud provider. Solved the task of moving MySQL data to Clickhouse with Terraform (IaC tool).
• Integrated a "test early" approach into the team using component testing and quality matrices, reducing the release cycle by 15%.
• Migrated 13 data marts from Teradata to custom ELT python framework + Airflow + Greenplum.
• Structured and described the e2e process of migration for the next iterations in the wiki, which provides integration of new members into the teams more quickly.
Stack: Airflow, Python, OLAP Greenplum 6, SQL, Liquibase, Git, Terraform, Data vault.
Top-10 local bank, reporting migration project (core DWH) 10/2021 – 06/2022
• Decreased x10 disk consumption by refactoring ETL from snapshots to incremental loading.
• Optimized MPP settings in DataStage steps which fixed joins and speeded up pipelines by 15%.
• Described and structured the team’s release policy, which decreased release time by 25%.
• Helped manage a mixed analyst/dev team (5 people).
Stack: IBM DataStage 11, OLAP IBM DB2 11, SQL, HDFS, Data vault.
Automation-service 3y 3m
Data Engineer
Industrial software development; industrial monitoring product (OLTP) 06/2018 – 09/2021
• Convinced to add a prototyping stage of building a web app before the primary stack implementation which led to 40% decreased release cycle time in rapidly changing requirements.
• Structured data loading from OLTP into warehouse using MS SSIS reusable templates.
• Designed and implemented relational normalized data structures, aligned naming convention.
• Designed REST API interconnections for OLTP as a backend of an industrial monitoring system.
• Proposed using Postman for an ever-growing API library which made my team lead happier.
• Raised 4 SQL developers from interns.
Stack: MS SQL Server 2016, MS SSIS, REST API, T-SQL, Postman, Kimball.
Образование:
Industrial cyber-physical systems, IoT
Engineering Master's, ITMO University
2020-2022
Профессиональные и другие навыки:
• SQL (window, analytical functions; optimization)
• Python (sqlalchemy, pandas, numpy, requests, bs4)
• RDBMS: Greenplum, PostgreSQL, ClickHouse, MS SQL Server, IBM DB2, BigQuery
• Data modeling: Data vault, Kimball
• NoSQL: Neo4j (graph db)
• Clouds: MS Azure, Yandex Cloud
• ETL: Spark, IBM DataStage, MS SSIS
• ELT: dbt (data build tool)
• Data lake: S3, HDFS (Hadoop)
• Orchestration: Airflow, Prefect
• DevOps: Terraform, Gitlab CI, Docker
• VCS: Git, Liquibase
• English B2
Дополнительно:
An experienced data engineer with a 5+ years of delivering successful projects. Helped company establish a partnership with a cloud provider. Skilled in solving complex data challenges transforming raw data into actions. Have strong understanding of modern data stack, including Python, SQL, dbt, Airflow, Spark, HDFS, S3, MS Azure, Terraform. Currently aiming to become a Data Solution Architect.
Ready to relocate with sponsorship, don’t have specific work permits, looking for a b2b contract as a Georgian individual entrepreneur, ready to start fully remote from tomorrow 8h daily in range 6...18 UTC