Experienced Data Engineer specializing in designing and implementing scalable data pipelines and solutions using AWS, PySpark, and Python. Proven track record in building robust ETL frameworks, real-time and batch data ingestion pipelines, and managing large-scale data processing through distributed systems. Proficient in containerizing applications with Docker and deploying data workflows on cloud-native infrastructure. Skilled in handling diverse data types, including semi-structured and unstructured text data, with practical experience in developing predictive analytics models and conducting text mining for NLP-based applications. Collaborative team player ensuring data availability, quality, and actionable insights for business stakeholders.