
Led ETL test strategy discussions and reviewed test artifacts to ensure comprehensive coverage and complianceProject Overview:
Responsible for migrating large-scale claims, members, and provider data from an on-premises Hadoop ecosystem to Azure Cloud. Rebuilt ETL pipelines using Azure Data Factory and Databricks, converted HiveQL scripts to Spark SQL/PySpark, and migrated historical and incremental datasets to Azure Data Lake. Performed end-to-end ETL testing, data reconciliation, transformation verification, incremental load validation, and ensured compliance with HIPAA standards. Collaborated with data engineers, architects, and business analysts to ensure zero data loss, and optimized pipelines for accuracy and scalability.
Key Responsibilities:
Key Achievements:
Project Overview:
Worked on migrating large-scale claims, member, and provider data from legacy Oracle systems to a Hadoop ecosystem. Rebuilt ETL pipelines, converted PL/SQL scripts to HiveQL and PySpark, and migrated historical and incremental datasets. Conducted end-to-end ETL testing, data reconciliation, transformation verification, incremental load validation, and compliance checks. Collaborated with ETL developers, data engineers, and business analysts to ensure data integrity, zero data loss, and optimized Hadoop workflows.
Key Responsibilities:
Project Overview:
Performed end-to-end testing of a large-scale data warehouse, integrating multiple business data sources. Validated ETL workflows to ensure accurate extraction, transformation, and loading of data into the target Oracle database, supporting business intelligence, and reporting requirements.
Key Responsibilities:
ETL Testing
Hadoop, HDFS, HiveQL, Spark, PySpark
Oracle, SQL Server, Teradata, Hive, SQL
Azure Data Factory (ADF), Azure Data Lake, Apache Airflow
JIRA, Git, Agile Methodologies, Test Case & Defect Management
Python, Shell Scripting