AI Optimization Engineer - ONSITE
Job Description Summary – AI Optimization Engineer (Onsite, Jersey City, NJ)
We are seeking an experienced AI Optimization Engineer to support large-scale AI/ML and Generative AI workloads for an enterprise environment. This role focuses on optimizing, deploying, and managing machine learning and large language models (LLMs) on GPU-accelerated HPC infrastructure. The ideal candidate will have strong experience in Python-based machine learning, deep learning frameworks, model optimization techniques, and scalable AI infrastructure.
The engineer will work closely with AI, infrastructure, and DevOps teams to design efficient model training and inference pipelines, implement SLURM-based workload orchestration, and deploy containerized ML solutions in production environments. Responsibilities include optimizing model performance using techniques such as pruning, quantization, and knowledge distillation, managing inference workflows using Triton Inference Server, and monitoring system performance using Prometheus and Grafana.
This role requires hands-on experience with HPC environments, GPU clusters, containerization technologies, and Linux system administration, along with strong knowledge of machine learning algorithms, deep learning architectures, and modern AI development tools. Experience with cloud platforms, vector embedding, and enterprise-scale AI deployments is highly preferred.
Core Responsibilities
Design and optimize AI/ML workloads on GPU-based HPC clusters.
Deploy and manage large language models (LLMs) in scalable production environments.
Implement model optimization techniques including pruning, quantization, and knowledge distillation.
Develop and manage automated job scheduling using SLURM with REST and Flask APIs.
Deploy ML models using containerized microservices architectures.
Monitor system performance using Prometheus and Grafana.
Optimize inference pipelines using Triton Inference Server and TRTLLM.
Conduct exploratory data analysis and model performance evaluation.
Collaborate with infrastructure and ML teams to improve scalability and efficiency.
Skills Required
The AI Optimization Engineer must have strong experience in Python-based machine learning and deep learning , including NumPy, scikit-learn, TensorFlow, PyTorch, and Keras, with hands-on knowledge of supervised and unsupervised learning, neural networks, transformer-based models, NLP, CNNs, and Generative AI concepts. The role requires expertise in AI infrastructure and optimization , including HPC environments, GPU clusters, SLURM workload management, Triton Inference Server, TRTLLM, and model optimization techniques such as pruning, quantization, and distillation for scalable LLM deployment.
Candidates should also have experience with DevOps and deployment tools such as Docker, Kubernetes, MLFlow, Terraform, Jenkins, GitHub, and HuggingFace, along with strong skills in performance monitoring using Prometheus and Grafana. Additional requirements include Flask API development, Linux administration (RHEL/CentOS), container runtimes like Enroot, Pyxis, and Podman, and experience with data analysis and visualization tools such as Plotly, Seaborn, and Matplotlib.
Recommended Jobs
Helpdesk Level 2 support | Fulltime
Title: Helpdesk Level 2 support Work Location : Cherry Hill, NJ Type : Full Time with our client Salary: Market + Benefits Requirements Job Responsibilities: # Respond to inc…
Junior Broker
Join Our Team at Premium Merchant Funding About Us: At Premium Merchant Funding, we are more than just a fintech company; we are a cohesive team dedicated to empowering small and medium-sized bus…
Continuous Improvement Manager (Wayne, NJ, US, 07470)
At Graphic Packaging International, we produce the paper cup that held your coffee this morning, the basket that transported those bottles of craft beer you enjoyed last weekend, and the microwa…
ChristianaCare Body Imaging Radiologist - Hybrid - Newark, DE w/ uncapped incentive & sign-on bonus
ChristianaCare is looking to hire and employ a BE/BC Body Fellowship trained Radiologist to join our Body Imaging Radiology team. The successful candidate will join a team of excellent subspec…
Car Detailer
Job Description Job Description We're looking for entry-level Lot Attendants to build an exciting career at Carvana - the fastest-growing used automotive retailer in U.S. history and one of the …
Respiratory Therapist Registered, (PT- Night), Somerville
Job Title: Respiratory Therapist Reg Location: RWJUH Somerset Department Name: Respiratory Care Req #: 0000237409 Status: Hourly Shift: Night Pay Range: $44.75 - $54.88 per hour P…
Part-Time Sales Associate
Job Description Job Description Description: Sales Associate – Lori’s Gifts At Lori’s Gifts, our mission is to bring comfort, joy, and connection to hospital communities. As a Sales Associate…
Line Cook
Job Description Job Description Job Title: Line Cook Salary: $22-$23 per hour (Based on experience) Hire Date: Immediate (We are looking to fill this position to meet the current business ne…
Front Desk Attendant
Since 1994, SPORTIME has been proud to operate the finest tennis and sports facilities in New York State. SPORTIME’s 20 club locations most recently expanded to include the iconic Port Washington Ten…
Labor/Cleaning
Job Description Job Description Description: General Laborer / Cleaning Associate RRS is seeking dependable and hardworking individuals to join our team for a short-term assignment supporti…