Senior Machine Learning Engineer, Generative AI Applications
Compensation: $100/hour (Contract)
Location: Remote (US-based preferred)
A mid-size healthcare technology company driving AI innovation in health operations.
Join a mission-driven team reimagining how healthcare organizations leverage generative AI to navigate complex information ecosystems. From streamlining access to internal knowledge to parsing intricate documents, this team is building high-impact tools for real users at scale.
We’re looking for a hands-on engineer to lead applied R&D efforts in Generative AI and LLM-based applications. You'll play a key role in scoping, prototyping, and evaluating real-world use cases—including intelligent document understanding and internal knowledge retrieval tools.
What You’ll Do
- Conduct lightweight user interviews to validate needs and define user personas.
- Design and manage phased rollouts with integrated usage tracking.
- Build backend services to process documents and interface with OpenAI APIs.
- Prototype knowledge access bots using embeddings and RAG techniques.
- Develop secure front-end interfaces integrated with Microsoft 365 (MSAL/Azure AD).
- Experiment with hybrid search strategies and retrieval pipelines.
- Fine-tune prompts or models tailored to healthcare and contracting use cases.
- Build and run evaluation pipelines to track hallucination, recall, and accuracy.
- Optionally shape architectural direction for scaling successful prototypes.
What You’ll Bring
- Strong experience deploying LLM-powered applications (OpenAI, Claude, open-source).
- Advanced Python skills with LangChain, LlamaIndex, or HuggingFace.
- Experience with RAG pipelines and vector databases (e.g., Pinecone, FAISS).
- Familiarity with OCR tools and layout-aware document parsers.
- Experience integrating Microsoft 365 APIs and authentication flows (MSAL, Azure AD).
- Self-sufficiency in managing projects and adapting to feedback in agile environments.
Tech Stack
- Python (LangChain, LlamaIndex, Transformers)
- LLMs: OpenAI, Claude, open-source (LLaMA, Mistral, Phi)
- Vector DBs: Pinecone, Weaviate, FAISS
- Frontend: [TECH TBD] with Microsoft Security integration
- APIs: Microsoft Graph, MSAL, Azure AD
- Tools: RAG, embeddings, hybrid search, evaluation harnesses
Why Join?
- Contract flexibility: ~20–30 hours/week over 3–4 months
- Own end-to-end development of novel Gen AI applications
- Work on real-world problems with clear user impact
- Influence the design and scalability of future production systems
- Opportunity for follow-on work based on outcomes
About People In AI
People In AI is a specialist recruitment firm connecting top-tier talent with high-impact roles in AI, ML, and data. We work with pioneering teams solving meaningful problems—discretely, and at speed.