We are looking for an experienced LLMOps Engineer to design, implement, and maintain production-grade large-language-model (LLM) pipelines, deployment architectures, and monitoring systems across enterprise environments. The Senior LLMOps Engineer will play a critical role in operationalizing generative AI capabilities, ensuring that LLM-based applications are scalable, secure, reliable, and compliant with emerging AI risk and governance frameworks. This role spans the spectrum of model deployment, orchestration, evaluation, and optimization. Contributions Architect and maintain scalable LLM and RAG pipelines, including model hosting, inference optimization, retrieval layers, and context management frameworks. Lead the design and implementation of secure GenAI infrastructure across cloud environments, ensuring reliability, performance, and cost efficiency. Build and manage automated evaluation systems that assess LLM output quality, safety, latency, and adherence to AI governance requirements. Develop CI/CD workflows tailored for LLM- and GenAI-based applications, including dataset versioning, model lineage, and automated testing of prompt and model behaviors. Collaborate with AI Product Engineers and Data Scientists to productionize LLM-based prototypes into enterprise-grade, maintainable systems. Integrate vector databases, model gateways, content filters, and guardrail frameworks into end-to-end LLM solutions. Implement observability and monitoring solutions that track performance metrics, hallucination rates, cost profiles, and user interaction patterns. Lead troubleshooting and root-cause analysis for issues related to LLM deployment, inference performance, or pipeline reliability. Stay current with emerging LLM architectures, inference optimizations, fine-tuning techniques, and relevant MLSecOps patterns. Ensure compliance with data privacy, ethical AI, and AI-governance frameworks throughout pipeline design and operations. Mentor junior engineers and contribute to Steampunk’s AI engineering best practices, tooling, and reusable infrastructure patterns. You will contribute to the growth of our AI & Data Exploitation Practice!
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level