Architect end-to-end solutions for GenAI-based products
Design scalable, secure, and performant systems for
real-time inference and data processing
Collaborate with product and engineering teams to align
architecture with business goals
Evaluate and integrate new GenAI models (LLMs,
diffusion models, etc.) into production systems
Ensure reliability, observability, and compliance in system
design
Skills Proven experience as a System Architect in AI/ML
product environments
Deep understanding of GenAI/LLM model integration,
APIs, and deployment
strategies
Expertise in cloud architecture (AWS / Azure / GCP)
Strong skills in microservices, event-driven systems,
containerization, and Kubernetes
Familiarity with MLOps practices, inference optimization,
and data pipelines
Excellent problem-solving, communication, and
leadership skills
Nice to Have:
Experience with open-source LLM frameworks
(LangChain, Hugging Face
Transformers, etc.)
Prior contributions to architectural decisions in a SaaS or
platform product
Knowledge of data privacy, security, and
compliance in AI systems