A self-motivated person who believes in growth and is very open to new ideas. I am looking for the place where my skills and expertise is being fully utilized and prove to be an asset to the company.
8+ Years of overall IT experience in Data warehousing solutions into SPARK, Google Cloud Platform (GCP), PYSPARK, Big Data Hadoop ETL, PYTHON, Informatica, Oracle, RDBMS 5+ Years of exclusive experience as Data Engineer in PYSPARK, SPARK, Hadoop and BigData eco-system components like HDFS, Hive, Sqoop, NiFi, NoSQL 2+ Years into Google Cloud (GCP) infrastructure Extensively drive the data processing through SPARK-SQL and SPARK-CORE Experience in BigQuery, Data Flow, DataProc, GCS, Cloud SQL, Cloud Function, Cloud Scheduler and Cloud Storage Expertise in SQL, ETL, Informatica, Oracle, Snowflake and NO-SQL. Knowledge on SPARK-STREAMING Experienced in creating Data Pipeline and loading large sets of structured, semi-structured and unstructured data in TB, GB into file system like GCS, HDFS using Hadoop/SQOOP, into RDBMS using Informatica and into Database using Oracle External Loader Extensively worked on data extraction, transformation and loading data from Application source system, Log server, RDBMS, Flat files, Excel. Strong HIVE-SQL, SQL, SPARK-SQL, SPARK-CORE Understanding in end-to-end Data Warehousing concept and ETL processing from Source to Data Mart/Data Lake Hand on experience in performance optimization in tuning, identifying and resolving performance bottlenecks in various levels of data pipelines and transformation. Involved in total SYSTEM DEVELOPMENT LYFE CYCLE (SDLC) Experience in UNIT testing in (Hadoop, ETL and DB Testing) and Data validation. Knowledge on scheduling tools like Active Batch and AirFlow and Informatica Scheduler. Excellent communication, documentation and presentation skills.
Oracle 11g(xe)
Google Cloud Data Engineer
Google Cloud Data Engineer