Glue, S3, Redshift, EMR, Lambda, Step Functions, Kinesis,
Athena, IAM.
Strong programming skills in Python, PySpark, and Apache
Spark for data processing and
transformation.
In-depth understanding of data modeling (conceptual, logical,
physical) and design principles.
Experience with ETL/ELT frameworks, data governance, and
data quality/security practices.
Familiarity with data warehouse systems (on-premise and
cloud) and migration strategies.
Experience with Snowflake, Dataiku, or Alteryx.
Exposure to Veeva API integration (a plus, not mandatory).
Understanding of DevOps for Data Engineering — CI/CD
pipelines and Infrastructure as Code
(Terraform or CloudFormation).
Soft Skills
Strong analytical thinking, problem-solving, and
communication skills.
Passionate about clean, efficient, and scalable data systems.
Self-motivated with a continuous learning mindset and
proactive ownership attitude.