Vertex AI Platform Engineer
Diverse Lynx - Alpharetta, GA
Apply NowJob Description
Vertex AI Platform Engineer Maintain and optimize the operational stability and performance of the Vertex AI environment Monitor health and performance of Vertex AI services including notebooks, pipelines, endpoints, and managed instances Troubleshoot and resolve issues related to scheduling, testing, and configuration Collaborate with DevOps teams to implement automated deployment and testing processes Ensure new Vertex AI features (e.g., GenAI) are properly configured and integrated Triage and resolve support tickets related to Vertex AI platform issues Perform root cause analysis to identify and prevent future problems Develop and maintain documentation on incident resolution procedures Investigate and address performance bottlenecks in the Vertex AI environment Implement monitoring and alerting systems to proactively identify potential issues Collaborate with AI/ML teams to optimize resource utilization and cost efficiency Stay up-to-date on new Vertex AI features and releases Plan and execute platform upgrades and enhancements Work with AI/ML teams to assess the impact of new features on existing workflows Strong knowledge of Vertex AI and its components Experience with containerization and orchestration technologies such as Docker and Kubernetes Familiarity with LLMs and anomaly detection techniques Proficiency in Python and other scripting languages Experience with cloud monitoring and logging tools like Stackdriver Familiarity with DevOps practices and tools including CI/CD pipelines Ability to quickly diagnose and resolve complex technical issues Strong analytical and troubleshooting skills Proactive approach to identifying and preventing potential problems Ability to effectively communicate technical concepts to both technical and non-technical stakeholders Excellent written and verbal communication skills Ability to collaborate effectively with cross-functional teams Experience with machine learning frameworks such as TensorFlow or PyTorch Knowledge of data engineering and pipeline management Understanding of security best practices for AI/ML platforms Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.
Created: 2026-03-10