Summary
Overview
Work History
Education
Skills
Timeline
Generic

Neeraj Avutu

Data Engineer
Hyderabad

Summary

Results-driven Data Engineer with experience at Publicis Sapient, specializing in ETL development and data pipeline design. Successfully migrated legacy systems to big-data technologies, enhancing performance and scalability. Proficient in Python and AWS, while mentoring junior team members to foster collaboration and innovation within the team.

Overview

7
7
years of professional experience
6
6
years of post-secondary education

Work History

Data Engineer

Publicis Sapient
Hyderabad
09.2021 - Current
  • Optimized data processing by implementing efficient ETL pipelines and streamlining database design.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Enhanced data quality by performing thorough cleaning, validation, and transformation tasks.
  • Automated routine tasks using Python scripts, increasing team productivity and reducing manual errors.
  • Migrated legacy systems to modern big-data technologies, improving performance and scalability while minimizing business disruption.
  • .Fine-tuned query performance and optimized database structures for faster, more accurate data retrieval and reporting.
  • Provided technical guidance and mentorship to junior team members, fostering a collaborative learning environment within the organization.
  • Led end-to-end implementation of multiple high-impact projects from requirements gathering through deployment and post-launch support stages.
  • Adding new data sources to the existing lambda pipelines.
  • Migrating the existing data pipelines to DBT-airflow pipelines
  • Using Athena for utilizing the parallel processing capability of Snowflake
  • Optimizing the Power BI reports and also implementing incremental refresh for faster update of the underlying data
  • Worked on migrating cloud resources from Cloud formation to Terraform based deployments

Data Engineer

Hudson Data
Gurgaon
03.2018 - 08.2021
  • Automated routine tasks using Python scripts, increasing team productivity and reducing manual errors.
  • Migrated legacy systems to modern big-data technologies, improving performance and scalability while minimizing business disruption.
  • Designed data models for complex analysis needs.
  • Developed and delivered business information solutions.
  • Implementing and Using Gradient Boosting Machines (GBM), Logistic Regression, Linear Regression, XgBoost, Random Forest algorithms for building ML Models.
  • For hyperparameter tuning, implementing and using Cartesian, Random Discrete, Bayesian Approach for selecting optimal hyperparameters Implementing and using Forward Selection, Backward elimination and Stepwise selection for feature selection.
  • For speeding up the time taken for implementing the model in production, implemented the first version of an Java based approach for packaging the model which has helped them integrate the code as an API in the client systems

Education

Master of Science - Computer Science

Blekinge Institute of Technology
Sweden
01.2016 - 06.2018

Bachelor of Technology - Computer Science And Engineering

JNTU College of Engineering Hyderabad
Hyderabd
09.2012 - 12.2015

Skills

ETL development

Data pipeline design

Data modeling

Data warehousing

AWS

GCP

Airflow

Python

SQL

Java

Lambda

Data Structures and Algorithms

Timeline

Data Engineer

Publicis Sapient
09.2021 - Current

Data Engineer

Hudson Data
03.2018 - 08.2021

Master of Science - Computer Science

Blekinge Institute of Technology
01.2016 - 06.2018

Bachelor of Technology - Computer Science And Engineering

JNTU College of Engineering Hyderabad
09.2012 - 12.2015
Neeraj AvutuData Engineer