AI Operations Platform Consultant
Job Description:
- Brings extensive experience operating large-scale GPU-accelerated AI platforms, deploying and managing LLM inference systems on Kubernetes with strong expertise in Triton Inference Server and TensorRT-LLM.
- They have repeatedly built and optimized production-grade LLM pipelines with GPU-aware scheduling, load balancing, and real-time performance tuning across multi-node clusters. Their background includes designing containerized microservices, implementing robust deployment workflows, and maintaining operational reliability in mission-critical environments.
- They have led end-to-end LLMOps processes involving model versioning, engine builds, automated rollouts, and secure runtime controls.
- The candidate has also developed comprehensive observability for inference systems, using telemetry and custom dashboards to track GPU health, latency, throughput, and service availability.
- Their work consistently incorporates advanced optimization methods such as mixed precision, quantization, sharding, and batching to improve efficiency. Overall, they bring a strong blend of platform engineering, AI infrastructure, and hands-on operational experience running high-performance LLM systems in production
Basic Info:
- AI Operations Platform Consultant
- Experience deploying, managing, operating, and troubleshooting containerized services at scale on Kubernetes for mission-critical applications (OpenShift)
- Experience with deploying, configuring, and tuning LLMs using TensorRT-LLM and Triton Inference server.
- Managing MLOps/LLMOps pipelines, using TensorRT-LLM and Triton Inference server to deploy inference services in production
- Setup and operation of AI inference service monitoring for performance and availability.
- Experience deploying and troubleshooting LLM models on a containerized platform, monitoring, load balancing, etc.
- Operation and support of MLOps/LLMOps pipelines, using TensorRT-LLM and Triton Inference server to deploy inference services in production
- Experience deploying and troubleshooting LLM models on a containerized platform, monitoring, load balancing, etc.
- Experience with standard processes for operation of a mission critical system – incident management, change management, event management, etc.
- Managing scalable infrastructure for deploying and managing LLMs
- Deploying models in production environments, including containerization, microservices, and API design
- Triton Inference Server, including its architecture, configuration, and deployment.
- Model Optimization techniques using Triton with TRTLLM
- Model optimization techniques, including pruning, quantization, and knowledge distillation
Recommended Jobs
Pest Control SENIOR Technician
Job Description Job Description PEST CONTROL TECHNICIAN MONDAY-FRIDAY 8AM-5PM NO WEEKENDS SERVING BERGEN COUNTY RESIDENTIAL HOMES ONLY. Company Description We are the fastest growi…
Risk & Compliance Senior Director Consulting Practice
Req ID: 356626 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organiza…
Dishwasher
Job Description Job Description Benefits: ~ Opportunity for advancement ~401(k) ~ Paid time off Benefits/Perks Flexible Scheduling Competitive Compensation Career Advancement O…
Neurophysiology Tech Spec (NR)
Work Shift: Capital Health is the region's leader in providing progressive, quality patient care with significant investments in our exceptional physicians, nurses and staff, as well as advance te…
Graduate Nurse & Registered Nurse Internship Opportunities - Warren Campus, NJ (Full Time)
St. Luke's is proud of the skills, experience and compassion of its employees. The employees of St. Luke's are our most valuable asset! Individually and together, our employees are dedicated to satis…
Director of AI
Founded in 2009, EDETEK is a global leader in clinical technology and services, delivering high-quality, AI-powered platforms and solutions to pharmaceutical, biotechnology, and medical device compan…
Server
Job Description Job Description Border Cafe in Woodbridge, NJ is looking for a qualified Server to join our growing team to meet the demands of our fast past, fully operational, full service r…
Nurse Practitioner/ Physicians Assistant
Job Description Job Description Kids Care Pediatrics is a thriving, community focused pediatric practice seeking a Nurse Practitioner/Physicians Assistant. Under the supervision of a physician,…
RN
Job Title: RN Location: Barnabas Health Medical Group Department Name: Comprehensive Breast Health Ct Req #: 0000229723 Status: Hourly Shift: Day Pay Range: $39.11 - $50.00 per hour …
Registered Dietitian Health Care Facility Surveyor
Registered Dietitian Health Care Facility Surveyor - New Jersey (#1317) Medical, Dental, and Vision insurance Flexible Spending Account Paid Time Off, Retirement Savings Commuter Benefits …