AI Data Scientistu2013 RAG, SLM & Distributed Data (...
Insight Global - Hartford, CT
Apply NowJob Description
Job Description We are looking for a mid-level AI Engineer with hands-on experience in Retrieval-Augmented Generation (RAG)systems, Small Language Models (SLMs), and distributed databases such as Google Cloud Spanner. You will work closely with senior engineers and product teams to build scalable AI systems that integrate retrieval pipelines, language models, and distributed transactional infrastructure. This role is ideal for someone who has already built AI features in production and wants to deepen their expertise in applied GenAI systems. Contract: Through the end of the year What Youu2019ll Be Working On u2022u2003Production RAG features. u2022u2003Distributed knowledge storage backed by Spanner. u2022u2003AI-powered APIs and services. u2022u2003Retrieval optimization and evaluation. u2022u2003Model cost/latency optimization. ________________________________________ Technical Skills Snapshot Categoryu2003Skills AIu2003RAG pipelines, Embeddings, Prompt engineering Modelsu2003SLM/LLM integration Databaseu2003Spanner schema design, SQL optimization Backendu2003Python, APIs Cloudu2003GCP We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: Skills and Requirements u2022u20033u20135 years of software engineering experience. u2022u20031u20132 years working with LLM or RAG-based systems. u2022u2003Strong proficiency in Python. u2022u2003Experience with: ou2003Embedding models and vector search ou2003LangChain, LlamaIndex, or similar frameworks ou2003API development (FastAPI/Flask) u2022u2003Experience working with Google Cloud Spanner or similar distributed SQL databases. u2022u2003Solid understanding of distributed systems fundamentals. u2022u2003Comfortable working in cloud environments (GCP preferred). u2022u2003Experience fine-tuning or quantizing small language models. u2022u2003Familiarity with evaluation metrics for retrieval systems (Recall@K, etc.). u2022u2003Knowledge of: ou2003Vertex AI ou2003Pub/Sub ou2003Dataflow u2022u2003Experience optimizing AI inference for cost and latency. u2022u2003Exposure to CI/CD pipelines.
Created: 2026-03-02