p
prateek_715

Prateek T

@prateek_715

Data Engineer

India
Engels, Hindi
Sommige informatie wordt in het Engels weergegeven.
Over mij
I am a Data Engineer with hands-on experience in PySpark, Kafka, Python, SQL, and the Hadoop ecosystem. Currently, I build large-scale data pipelines and ETL workflows at Infosys, focusing on medallion architecture and Spark optimization. I have a strong foundation in ML-powered data products and experience taking projects from EDA to deployed APIs.... Lees meer

Skills

p
prateek_715
Prateek T
offline • 
Gemiddelde reactietijd: 1 uur

Bekijk mijn diensten

Formules en macro's
I will solve your excel problems

Werkervaring

Infosys

Data Engineer

Infosys • Fulltime

Sep 2025 - Present9 mos

Deployed on Databricks platform; helped build production pipelines processing daily 2–9 GB datasets (7-12 million rows): designed schema transformations for medallion architecture, engineered PySpark optimizations (partition pruning, shuffle hash, broadcast joins), implemented data serialization tuning; optimizations reduced job execution time by upto 20% in some pipelines. Led data quality validation, schema design improvements, and schema evolution to accommodate upstream data changes; worked cross-functionally with team lead and senior engineers on parallelism optimization strategies