Strong programming skills in Python and SQL.
Deep knowledge of data science and machine learning
libraries such as Pandas, Scikit-learn,
TensorFlow, and PyTorch.
Proven experience in building, training, and deploying
machine learning models into production
systems.
Hands-on experience with big data technologies (Hadoop,
Spark).
Strong working knowledge of AWS cloud services (S3, EC2,
EMR, Lambda, SageMaker, etc.).
Experience working with Databricks and Airflow for data
workflow management.
Excellent problem-solving skills and the ability to work in
collaborative, fast-paced environments.
Preferred Qualifications:
Experience with MLOps practices and CI/CD for model
deployment.
Familiarity with data versioning tools (e.g., DVC, MLflow).
Strong understanding of data engineering concepts and
distributed computing.