Data Engineer
Purple Drive - Bentonville, AR
Apply NowJob Description
About the Role: We're seeking a skilled Data Engineer to join our Bentonville team for a 6-month project focused on building and maintaining large-scale data processing systems. You'll be working with modern big data technologies, cloud platforms, and stream-processing systems to support our data infrastructure and analytics initiatives. What You'll Do: Design and develop scalable data pipelines using Java, Python, and Scala Build and maintain APIs for data access and integration Work with big data technologies including Hadoop, Hive, and Spark (Scala-based) Implement and manage data processing workflows using Apache Airflow Deploy and manage containerized applications using Kubernetes Develop stream-processing solutions using Apache Storm and Spark Streaming Work with cloud platforms and data lake architectures Optimize data processing performance and troubleshoot pipeline issues Collaborate with data science and analytics teams on data requirements What We're Looking For: Strong hands-on experience with Java, Python, and Scala programming languages Proficiency in API development and RESTful services Experience with SQL and database query optimization (GQL knowledge preferred) Solid background with big data technologies: Hadoop, Hive, Apache Spark Knowledge of workflow orchestration tools like Apache Airflow (Luigi experience a plus) Experience with containerization and Kubernetes deployment Understanding of cloud platforms and data lake concepts Hands-on experience with stream-processing systems (Apache Storm, Spark Streaming) Knowledge of distributed computing and data processing best practices Strong problem-solving and analytical skills Preferred Qualifications: Experience with Vertex AI and machine learning pipelines Knowledge of Presto/Trino for distributed SQL queries Familiarity with Node.js for backend development Understanding of data governance and security practices Experience with monitoring and logging in distributed systems
Created: 2026-03-04