Cloud Architect with expertise in Generative AI
GyanSys Inc. - Santa Clara, CA
Apply NowJob Description
We are seeking a highly skilled Cloud Architect with expertise in Generative AI, Copilot Studio, and multi-cloud platforms spanning Azure (including Azure AI Foundry), AWS, and Google Cloud.Core Responsibilities:Architect end-to-end Generative AI solutions, including model serving (vLLM, TGI), API integration, and user interaction layers.Design and implement RAG architecture using vector stores, embeddings, hybrid search, and re-ranking to embed enterprise knowledge into LLMs.Create agentic systems, enabling multi-agent collaboration for complex, stateful workflows and reasoning-driven automation.Develop and govern Copilots in Copilot Studio, including connectors, actions, plugins, DLP rules, environment strategy, and integration with Microsoft 365 and enterprise systems.Leverage Azure AI Foundry (prompt flow, evaluators, safety, model orchestration) to operationalize LLM applications at scale.Evaluate and optimize AI system performance, balancing quality, latency, throughput, cost efficiency, and safety compliance.Implement Responsible AI, security, and HITL (HumanintheLoop) controls, ensuring compliance in regulated environments.-in-the-Loop) controls, ensuring compliance in regulated environments.Produce clear, maintainable documentation for architecture, patterns, and operational processes.Required Qualifications:10+ years of experience in cloud architecture or enterprise software engineering.3+ years of hands-on experience designing or delivering Generative AI or LLM applications.Proven experience with Azure AI Foundry, Azure OpenAI, and Copilot Studio (actions, connectors, governance, M365 integration).Experience deploying AI solutions on AWS (Bedrock, SageMaker) and/or GCP (Vertex AI).Hands-on experience with RAG, vector databases (Azure AI Search, Pinecone, OpenSearch, Vertex Matching Engine), embeddings, and hybrid search.Deep understanding of cloud security (IAM/RBAC, Key Vault/KMS, VPC/PrivateLink, token safety).Experience with Kubernetes (AKS/EKS/GKE), containerization, API frameworks (FastAPI, Node.js, .NET), Python, TypeScript, or C#/.NET.Working knowledge of transformer architectures and model adaptation techniques (fine-tuning, LoRA, prompt engineering).Familiarity with AI Ops / MLOps tools such as Prompt Flow, MLflow, SageMaker Pipelines, or Vertex Pipelines.Bachelor's/ Masters in Computer Science, Engineering, Information Systems, Data Science, or related field (required).About GyanSysGyanSys is a leading global system integrator company supporting enterprise customers worldwide. We specialize in solutions implementations, managed services, and data analytics spanning SAP, Salesforce, Microsoft, and other prime enterprise platforms. Using a mature blended delivery model with over 3,000 consultants, we support over 350 enterprise customers across the Americas, Europe, and APAC.
Created: 2026-05-09