Senior Solution Architect - HPC, Cloud-Native Systems
Tech Mahindra - Alpharetta, GA
Apply NowJob Description
Job Summary Senior Solution Architect - HPC, Cloud-Native Systems (ITAR-Restricted Role) Location- Alpharetta. GA Type- FTE (Need independent Visa candidates only as this is ITAR Project) Position Overview: We are seeking a high-performance Senior Solution Architect to lead the convergence of traditional High-Performance Computing (HPC) environments with modern cloud-native architectures. This position is designated as ITAR-restricted, requiring candidates who are legally authorized to access and handle U.S. export-controlled technical data. The architect will design, integrate, and optimize large-scale, containerized, hybrid HPC environments using technologies such as Docker, Mirantis, ELK Stack, and advanced batch schedulers. This role requires deep technical leadership, architectural vision, and hands-on experience supporting mission-critical computational workloads in secure, compliant environments. Core Responsibilities: 1. Architecture & Design Architect end-to-end hybrid cloud solutions integrating Mirantis Container Cloud with dedicated HPC clusters. Balance performance, elasticity, and compliance requirements across on-prem and cloud environments. Produce architecture documentation in adherence with ITAR export-controlled standards and review practices. 2. HPC Orchestration Design and implement HPC job scheduling strategies using Slurm, Volcano, LAVA, or similar technologies. Support deterministic resource allocation for AI/ML analytics, physics simulations, and scientific workloads. Ensure schedulers meet ITAR-restricted workload isolation and audit requirements. 3. Optimization & Performance Tuning Apply best practices for high-performance containerization: multi-stage builds, minimal base images, and resource tuning (CPU, GPU, Memory). Implement strategies to minimize overhead, ensure stability, and eliminate noisy-neighbor issues. 4. Centralized Observability Architect and operate an enterprise-grade ELK Stack (Elasticsearch, Logstash, Kibana) tuned for HPC-scale environments. Manage Index Lifecycle Management (ILM) for massive log throughput while preserving traceability for compliance audits. 5. Full-Stack Automation Build IaC-driven automation pipelines using Terraform, Ansible, and GitOps workflows. Automate deployment of Mirantis Kubernetes Engine (MKE) and integrated HPC schedulers within an ITAR-secured environment. 6. CI/CD Automation Implement robust CI/CD workflows using Jenkins, GitLab CI, Argo Workflows, or similar tools. Ensure pipelines comply with ITAR policies, including artifact access control, secure registries, and encrypted transport. 7. Hybrid Integration Architect integration between Kubernetes and traditional HPC schedulers. Enable advanced workloads requiring high-speed interconnects such as InfiniBand, RDMA, or GPU-accelerated clusters. Required Technical Skills: Containers & Mirantis Expertise in Docker Runtime, Mirantis Kubernetes Engine (MKE), and Lens Desktop management. Deep experience designing containerized workloads for HPC environments. HPC Schedulers Hands-on experience with Slurm, PBS, or Kubernetes-native batch schedulers such as Volcano. Knowledge of hierarchical priority queues, gang scheduling, and resource fairness algorithms. ELK Stack Mastery Strong understanding of Logstash pipeline performance optimization, Elasticsearch shard strategies, and Kibana visualization design. Performance Tools Experience with NVIDIA Enroot/Pyxis or equivalent technologies supporting near-bare-metal container performance. Security & Compliance Implement secure registry solutions, TLS encryption, RBAC, and identity-driven access controls. Demonstrated experience supporting compliance frameworks including ITAR, NIST 800-53, or similar. Experience & Qualifications: Professional Background 10+ years in systems architecture or engineering roles. 5+ years in HPC, Cloud Infrastructure, or enterprise-scale DevOps environments. HPC Knowledge Understanding of MPI (Message Passing Interface), GPU compute workloads, low-latency networks, and distributed parallel frameworks. Certifications Preferred certifications: Certified Kubernetes Administrator (CKA) Mirantis Kubernetes certifications Relevant security/compliance certifications are a plus. Cloud Platforms Experience with AWS HPC environments (EKS, AWS Batch, FSx for Lustre, EC2 GPU/accelerated instances). ITAR Requirements (Mandatory): This position requires access to export-controlled technical data under the International Traffic in Arms Regulations (ITAR). Therefore, candidates must meet ALL of the following: Be a U.S. Person as defined by ITAR (U.S. Citizen, U.S. National, lawful permanent resident, or protected individual). Be legally permitted to access ITAR-controlled systems and documentation. Pass enhanced background checks aligned with ITAR and organizational security policies. The pay range for this role is $130,000 - $135,000per annum including any bonuses or variable pay. Tech Mahindra also offers benefits like medical, vision, dental, life, disability insurance and paid time off (including holidays, parental leave, and sick leave, as required by law). Ask our recruiters for more details on our Benefits package. The exact offer terms will depend on the skill level, educational qualifications, experience and location of the candidate. "Tech Mahindra is an Equal Employment Opportunity employer. We promote and support a diverse workforce at all levels of the company. All qualified applicants will receive consideration for employment without regard to race, religion, color, sex, age, national origin or disability. All applicants will be evaluated solely on the basis of their ability, competence, and performance of the essential functions of their positions with or without reasonable accommodations. Reasonable accommodations also are available in the hiring process for applicants with disabilities. Candidates can request a reasonable accommodation by contacting the company ADA Coordinator at ."
Created: 2026-03-10