Burak Colak

Data engineer with over 3+ years of experience specializing in audit-related automation, database management, and machine learning. Proficient in Python, SQL, and AWS, with a proven track record in optimizing data-driven processes. Passionate about leveraging data to drive decision-making and innovation.

Say hi!

About Me

I am a Data Engineer at MSCI Inc.,where I assist in cloud migration efforts, particularly with Oracle SQL databases and Google Cloud Platform services. I also contribute to the development of CI/CD pipelines using Azure Pipelines, enhancing code deployment. Additionally, I work with Docker and Kubernetes to improve application deployment workflows and optimize data processes from Autosys to Apache Airflow.


I am pursuing a BSc degree in Electrical Engineering at Budapest University of Technology and Economics, where I am part of the Lacrosse team, Solar Boat team, and VIK-HK. I have a huge interest in data engineering, machine learning, and deep learning, and I love Python for API development and data-related tasks. I have also worked as a Junior Solution Architect at DevKit, designing and developing backend systems for ticketing systems using FastAPI and SQL, and as a Data Engineer at Starschema, supporting the development of audit-related automation in Python and Snowflake for BlackRock. I am a proud alumnus of the Kennedy-Lugar Youth Exchange and Study Program, AFS-USA, US Department of State, and Erasmus Youth-pass, and I speak Turkish, English, and German. I am always eager to learn more and participate in hackathons. You can download my CV from here.


Here are some technologies I have been working with:

  • Python
  • SQL
  • C/C++
  • AWS
  • Linux
  • API
  • Docker
  • CD/CI
  • ETL
  • Apache Airflow
  • Kubernetes
  • Golang
  • Elasticsearch

Experiences

Data Engineer @ MSCI

Oct 2023 - Present

  • • Assist in cloud migration efforts, particularly with Oracle SQL databases and Google Cloud Platform services.
  • • Contribute to the development of CI/CD pipelines using Azure Pipelines, enhancing code deployment.
  • • Containerization and orchestration with Docker and Kubernetes to improve application deployment workflows and using ElasticSearch & Fluentd for logging, Grafana for monitoring.
  • • Aidintransitioning and optimizing data processes from Autosys to Apache Airflow.

Jr. Solution Architect @ Devkit

Jan 2021 - Present

  • • Construct web applications using FastAPI and implement data mining solutions w/ Docker, Nginx, and Celery.
  • • Employ data scraping techniques (Beautiful Soup, Scrapy) and manage backend infrastructure (Nginx, SQL).
  • • Enhance system efficiency and reliability through orchestration (Apache Airflow), visualization (Grafana, Elasticsearch, Kibana), and time-series data management (InfluxDB).
  • • Implement messaging queues (RabbitMQ) and scalable API logic (FastAPI) to handle high-volume traffic.

Medior Data Engineer @ Starschema (Contracted to BlackRock)

Sept 2022 - Sept 2023

  • • Spearheaded the development of multiple audit-related automation processes using Python.
  • • Designed intricate Snowflake procedures and facilitated API integrations.
  • • Developed and implemented a petabyte-scale data-driven audit system for financial records in Snowflake, improving accuracy by 95% and reducing audit time by 80% annually.
  • • Designed a REST API to automate data modification and creation.
  • • Crafted advanced SQL queries for the Snowflake platform.
  • • Crafted a Machine Learning model to find the best nutrition plan for professional athletes
  • • Currently creating a booking platform using AWS for internal use. (Lambda, DynamoDB, APIGateWay, S3)

Cyber Security Automation Engineering Trainee @ Nokia

Jan 2022 - July 2022

  • • Designed a testing framework for security vulnerabilities in the Modular Private WiFi product, reducing security incidents by 30%.
  • • Workedwith Cloud Native Communication software, Kubernetes, Docker, and Modular Private Wireless.
  • • Conducted malware scans for containers and penetration security testing.

Projects

United | Onboarding tool for teams

PythonReactJSFastAPIMLMongoDBAWS LambdaDynamoDBAPIGateway

1st place in a hackathon. Introduced an AI-driven feedback analysis for real-time insights.

View Project

Saving Nest

Meta AIML(ARIMA) modelPythonNumpy / Pandas

Our algorithm calculates the most stable currencies in timeline and suggests the most stable currencies for you to save. Now you can save your money without worrying about inflation.

View Project