AI Optimization Engineer - ONSITE

Simple Solutions

Jersey City, NJ

Job Description Summary – AI Optimization Engineer (Onsite, Jersey City, NJ)

We are seeking an experienced AI Optimization Engineer to support large-scale AI/ML and Generative AI workloads for an enterprise environment. This role focuses on optimizing, deploying, and managing machine learning and large language models (LLMs) on GPU-accelerated HPC infrastructure. The ideal candidate will have strong experience in Python-based machine learning, deep learning frameworks, model optimization techniques, and scalable AI infrastructure.

The engineer will work closely with AI, infrastructure, and DevOps teams to design efficient model training and inference pipelines, implement SLURM-based workload orchestration, and deploy containerized ML solutions in production environments. Responsibilities include optimizing model performance using techniques such as pruning, quantization, and knowledge distillation, managing inference workflows using Triton Inference Server, and monitoring system performance using Prometheus and Grafana.

This role requires hands-on experience with HPC environments, GPU clusters, containerization technologies, and Linux system administration, along with strong knowledge of machine learning algorithms, deep learning architectures, and modern AI development tools. Experience with cloud platforms, vector embeddings, and enterprise-scale AI deployments is highly preferred.

Core Responsibilities

Design and optimize AI/ML workloads on GPU-based HPC clusters.
Deploy and manage large language models (LLMs) in scalable production environments.
Implement model optimization techniques including pruning, quantization, and knowledge distillation.
Develop and manage automated job scheduling using SLURM with REST and Flask APIs.
Deploy ML models using containerized microservices architectures.
Monitor system performance using Prometheus and Grafana.
Optimize inference pipelines using Triton Inference Server and TRTLLM.
Conduct exploratory data analysis and model performance evaluation.
Collaborate with infrastructure and ML teams to improve scalability and efficiency.

Skills Required

The AI Optimization Engineer must have strong experience in Python-based machine learning and deep learning , including NumPy, scikit-learn, TensorFlow, PyTorch, and Keras, with hands-on knowledge of supervised and unsupervised learning, neural networks, transformer-based models, NLP, CNNs, and Generative AI concepts. The role requires expertise in AI infrastructure and optimization , including HPC environments, GPU clusters, SLURM workload management, Triton Inference Server, TRTLLM, and model optimization techniques such as pruning, quantization, and distillation for scalable LLM deployment.

Candidates should also have experience with DevOps and deployment tools such as Docker, Kubernetes, MLFlow, Terraform, Jenkins, GitHub, and HuggingFace, along with strong skills in performance monitoring using Prometheus and Grafana. Additional requirements include Flask API development, Linux administration (RHEL/CentOS), container runtimes like Enroot, Pyxis, and Podman, and experience with data analysis and visualization tools such as Plotly, Seaborn, and Matplotlib.

Posted 2026-02-14

Recommended Jobs

INTERN- Food & Beverage

Ocean Casino Resort

Atlantic City, NJ

The F&B Intern is responsible for overseeing various outlets and processes daily and making sure all areas are operating efficiently. Position Responsibilities Provide exceptional, …

View Details

Posted 2025-03-07

Registered Dietician

Autumn Lake Healthcare

Union, NJ

Join our wonderful team as a Registered Dietitian today! Cornell Care & Rehabilitation Center managed by Autumn Lake Healthcare at Union is an exceptional team-oriented company hiring for …

View Details

Posted 2026-02-13

LHI - Optometrist Newark, DE (Tom)

Newark, NJ

TGB3 is seeking to contract an Optometrist or Ophthalmologist to perform Compensation & Pension Exams (C&P) for our military Veterans at various sites throughout the USA. Length: 6-12 months (opti…

View Details

Posted 2025-12-08

Bartender

Echo Lake Country Club

Westfield, NJ

Echo Lake Country Club is seeking polished, professional bartenders to provide exceptional beverage service in a refined private club environment. Bartenders must balance speed, accuracy, and hospita…

View Details

Posted 2026-01-21

Remote Recruiter (Freelance)

RecXchange

Jersey City, NJ

Remote Recruiter - No Experience Needed Location: Remote (Worldwide) Type: Freelance / Work Your Own Way Overview: RecXchange is a global platform where anyone can earn money by helpin…

View Details

Posted 2025-12-29

Assembler

EVS Broadcast Equipment

Allendale, NJ

Globally recognized as the leader in live video technology for broadcast and new media productions, our passion, and purpose are to help our customers craft compelling stories that trigger the highest…

View Details

Posted 2026-01-23

Dishwasher/Utility

Wayne, NJ

$16.90 per hour - $21.00 per hour Our Dishwasher/Utility Team Members are the soul of our kitchens. They keep us running like a well-oiled machine, ensuring that we always have clean, sani…

View Details

Posted 2026-01-06

Mental Health Worker

355 Grand Street

Jersey City, NJ

Job Title: Mental Health Worker Location: Jersey City Medical Center Department Name: Mental Health Unit-Open Unit Req #: 0000229123 Status: Hourly Shift: Evening Pay Range: $22.34 -…

View Details

Posted 2025-12-19

Azure PaaS Solution Architect

Akkodis

Montvale, NJ

Akkodis is seeking an Azure PaaS Solution Architect for a Contract with a client in Montvale, NJ. This role focuses on architecting secure, scalable Azure PaaS and IaaS solutions across modern …

View Details

Posted 2026-02-18

APPLICATIONS ANALYST II - EPIC GRAND CENTRAL

Cooper University Health Care

Camden, NJ

About Us At Cooper University Health Care , our commitment to providing extraordinary health care begins with our team. Our extraordinary professionals are continuously discovering clinical inno…

View Details

Posted 2026-01-25