Senior AI Engineer (Generative AI / Google Cloud / Vertex AI)
Job Description
Job Description
Senior Gen AI EngineerJob at a Glance
- Title: Senior Gen AI Engineer
- Location: Paramus,NJ / Hybrid
- Contract: W2 only, 6-12 month contract with potential for extension or conversion to full time with either the client or CEI
- Pay: $90-95/hour + optional medical, dental, vision, 401(k) match
We are seeking a highly experienced Senior AI Engineer with deep expertise in Google AI technologies, Generative AI. The ideal candidate brings 10–15 years of broad software engineering experience, with the last 4+ years focused exclusively on Artificial Generative Intelligence, including designing, building, deploying, and monitoring production-grade AI systems. This role demands mastery of the Google ecosystem — including Google Workspace, Google Agent Development Kit (ADK), and Vertex AI — alongside a strong command of modern LLM/SLM frameworks, cloud-native infrastructure, and MLOps best practices.
Key Responsibilities- Large & Small Language Model Engineering Design, develop, and deploy Agents leveraging commercial LLMs such as Gemini (Google), GPT (OpenAI), and Claude Sonnet (Anthropic) for high-performance, large-context, and multimodal tasks. Work with open-source/self-hosted LLMs including Mixtral (Mistral AI). Architect and implement SLM-based solutions using lightweight models such as Phi-3 (Microsoft), Gemma (Google), and Mistral for resource-constrained environments. Lead fine-tuning and customization of models using Vertex AI Tuning, Hugging Face Transformers, and parameter-efficient fine-tuning (PEFT) methods including LoRA and QLoRA. Apply training frameworks such as PyTorch, TensorFlow, or JAX for model experimentation and development. Generate synthetic data and evaluate models using HELM, lm-evaluation-harness, and custom benchmarks.
- Google AI & Workspace Integration Lead the design and implementation of AI-powered solutions deeply integrated with Google Workspace (Docs, Sheets, Drive, Gmail, Meet), Big Query and Lakehouse. Architect and build intelligent agents and workflows using Google Agent Development Kit (ADK). Leverage Google AI Studio as the primary IDE, VSCode for AI application development and prototyping. Utilize Google Cloud Platform (GCP) services including: Vertex AI for ML model training, tuning, and deployment GKE (Google Kubernetes Engine) for container orchestration Cloud Run for serverless deployment Cloud Functions for event-driven AI tasks Vertex AI Vector DBs for semantic search and retrieval.
- Design & Planning Lead requirements gathering using Confluence for documentation and team collaboration. Create detailed system architecture diagrams and AI workflows using Lucidchart. Design UI/UX prototypes in Figma for AI-powered application interfaces. Manage project delivery and sprint planning using Jira. Oversee data preparation and management: cleaning, transforming, and organizing data for AI/ML workflows. Conduct data analysis using Jupyter Notebooks and pandas for exploration and preprocessing. Leverage Hugging Face Model Hub for model comparison, selection, and download.
- Development Frameworks & Tools Orchestrate LLM/SLM applications using LangChain, LlamaIndex, and LangGraph. Build multi-agent systems with Semantic Kernel, and LangGraph. Manage and optimize prompts using LangSmith and PromptLayer. Deploy models locally with Ollama or at scale with vLLM for efficient inference. Track experiments, metrics, and results with MLflow or Weights & Biases. Manage code and data versioning with Git.
- Vector Databases & Semantic Search Implement semantic search and Retrieval-Augmented Generation (RAG) pipelines using Vertex AI Vector DBs and ChromaDB. Design and optimize end-to-end RAG architectures for enterprise-grade knowledge retrieval.
- Backend Development Develop robust RESTful APIs using FastAPI (Python) or Express.js (Node.js). Manage and secure APIs using Mulesoft, Apigee.
- Frontend Development Build modern user interfaces using React or Angular. Utilize Material-UI for consistent, accessible, and modern UI components. Prototype and plan UI/UX workflows using Figma.
- Development Tools & Code Quality Write and debug code in VS Code with Python and GitHub Copilot extensions. Leverage GitHub Copilot for AI-assisted code suggestions and productivity. Manage source code with GitHub or GitLab. Enforce code quality and standards using SonarQube, ESLint, and Pylint.
- Testing & Quality Assurance Conduct LLM-specific testing using RAGAS and DeepEval for LLM/RAG pipeline evaluation. Use LangSmith Evaluators for prompt testing and hallucination detection. Write and execute unit tests using pytest. Ensure output quality and reliability using LangChain Evaluators and custom metrics.
- Deployment & Infrastructure Orchestrate containers at scale with Kubernetes (K8s), and Google GKE. Automate CI/CD pipelines using GitHub Actions or GitLab CI. Support on-premise, cloud (GCP/Vertex AI), and hybrid infrastructure deployments including edge devices for local inference.
- LLM Monitoring & Observability Monitor LLM performance and usage with LangSmith and Weights & Biases. Track and optimize AI infrastructure costs using OpenMeter and custom dashboards. Set up continuous evaluation pipelines to ensure ongoing model quality and reliability. Monitor application and model performance end-to-end with LangSmith observability tools.
- 10–15 years of overall software engineering experience.
- 5+ years of hands-on experience in Artificial Generative Intelligence, including LLMs, SLMs, RAG, and multi-agent systems.
- Deep expertise in Google AI ecosystem: Gemini, Vertex AI, Google ADK, Google AI Studio, and Google Workspace integrations.
- Proficiency in Python (primary) and familiarity with Node.js.
- Strong background in cloud-native development on GCP.
- Demonstrated experience with model fine-tuning (LoRA, QLoRA, PEFT) and model evaluation frameworks.
- Solid understanding of MLOps, CI/CD for AI systems, and production deployment best practices.
- Experience with multi-agent AI architectures using Semantic Kernel, or LangGraph.
- Google Cloud Professional certifications (Professional ML Engineer, Professional Cloud Architect).
- Contributions to open-source AI/ML projects.
- Experience with edge AI deployments and hybrid cloud-edge inference.
- Familiarity with synthetic data generation pipelines.
- Prior experience mentoring junior engineers or interns in AI/ML domains.
About CEI:
As a trusted technology partner, CEI delivers solutions that help our customers transform their business and achieve meaningful results. From strategy and custom application development through application management - our technology and digital experience services are tailored to meet each unique need of our customers. Our staffing solutions bring specialized skills to complement our customers' workforce and project requirements.
#ZR
#INDGEN
Recommended Jobs
Summer Camp Counselor 2026 - Jersey City
TURN YOUR SUMMER INTO SOMETHING EXTRAORDINARY – BECOME A SEASONAL CAMP COUNSELOR WITH KIDS IN THE GAME! Empower Staff, Inspire Campers, and Build a Summer That Matters. Kids in the Game prov…
Associate Communications Manager
This role supports consistent and engaging employee experience by partnering in the development and delivery of aligned, transparent communication across the organization. The Associate Communication…
Part-Time Bakery Assistant
Job Description Job Description Location: 631 Grove Street, Jersey City, NJ, 07310 Company: Carlo’s Bakery Job Type: Part-Time Shift: Wednesday - Friday, 3:00pm - 1:00am Pay: 16.00/hr…
Preschool Lead Teacher (Toddler & Threes)
Nature-Based Early Childhood Program | Full-Time | $20–$23/hour Are you an early childhood educator who believes children and teachers learn best through curiosity, exploration, relationships, and…
Talent Acquisition Manager
Job Description Job Description About Hand Over Hand LLC At Hand Over Hand, we believe every child deserves the opportunity to thrive. As a leading provider of Applied Behavior Analysis (ABA…
Health Care Specialist/Combat Medic
The Health Care Specialist /Combat Medic (MOS 68W) is the primary Health Care representative for the United States Army both in a clinical and a tactical environment. The Healthcare Specialist in a c…
Prompt Engineering Architect
Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cuttin…
Pharmacy Resident
Job Title: Pharmacy Resident Location: Cooperman Barnabas Medical Ctr Department Name: Pharmacy Residents Req #: 48655 Status: Salaried Shift: Day Pay Range: $61,689.06 - $61,689.06 …
Injection Molding Quality Control
Job Description Job Description Quality Control Inspector Performs inspections on all molded and incoming material. Checks bill of materials on all outgoing orders. High School Educatio…
Product Manager
Job Description Job Description Spark Car Wash is a high growth, institutionally backed express exterior car wash company serving the Northeast. Spark is growing fast and plans to open a signific…