Senior Data Engineer - Python & PySpark

Purple Drive
Jersey City, NJ

Senior Data Engineer - Python & PySpark

Job Summary

We are seeking an experienced Senior Data Engineer with strong expertise in Python, PySpark, SQL, and Big Data technologies .

The ideal candidate will be responsible for designing, developing, and optimizing scalable data pipelines and ETL/ELT workflows for processing large volumes of structured and unstructured data. The role requires hands-on experience with distributed data processing, cloud platforms, orchestration tools, and performance optimization of big data applications.

️ Key Responsibilities

Data Pipeline Development

  • Design, develop, and maintain scalable data pipelines using:
    • Python
    • Apache Spark / PySpark
  • Build reusable and efficient data processing frameworks.

ETL / ELT Development

  • Develop and optimize ETL/ELT workflows for:
    • Data ingestion
    • Data transformation
    • Data processing
  • Process large volumes of structured and unstructured data.

Big Data Processing

  • Work with big data technologies such as:
    • Hadoop ecosystem
    • Hive
    • Spark
  • Implement distributed computing solutions for high-performance processing.

Data Modeling & Warehousing

  • Support:
    • Data modeling
    • Data architecture
    • Data warehousing solutions
  • Ensure scalability and maintainability of data systems.

SQL & Database Management

  • Write and optimize:
    • Complex SQL queries
    • Data transformation logic
  • Work with:
    • Relational databases
    • Non-relational databases

Cloud & Orchestration

  • Deploy and manage data solutions on cloud platforms such as:
    • AWS
    • Azure
    • GCP
  • Work with orchestration tools like:
    • Apache Airflow

Data Quality & Governance

  • Perform:
    • Data validation
    • Data cleansing
    • Data transformation
  • Ensure compliance with:
    • Data governance
    • Security standards

Performance Optimization

  • Optimize:
    • Spark jobs
    • SQL queries
    • Data pipelines
  • Improve:
    • Scalability
    • Reliability
    • Processing performance

Collaboration & Agile Delivery

  • Collaborate with:
    • Data Analysts
    • Data Scientists
    • DevOps teams
    • Business stakeholders
  • Participate in:
    • Agile ceremonies
    • Sprint planning
    • Continuous improvement initiatives

✅ Required Skills

Programming & Data Engineering

  • Python
  • PySpark
  • Apache Spark
  • SQL

Big Data Technologies

  • Hadoop ecosystem
  • Hive
  • Distributed computing platforms

ETL / ELT & Orchestration

  • ETL / ELT pipelines
  • Apache Airflow or similar orchestration tools

Cloud Platforms

  • AWS / Azure / GCP
  • Cloud-based data services

Databases & Data Warehousing

  • Relational databases
  • NoSQL databases
  • Data warehousing concepts
  • Data modeling

File Formats

  • Parquet
  • Avro
  • JSON
  • CSV

Soft Skills

  • Strong analytical and troubleshooting skills
  • Excellent communication and collaboration abilities
  • Ability to work with cross-functional teams

Experience Required

  • 6-10+ years of experience in:
    • Data Engineering
    • Big Data technologies
    • Distributed data processing

Preferred Skills

  • Performance tuning and optimization expertise
  • Experience with scalable cloud-native data architectures
  • Exposure to DevOps and CI/CD for data platforms
Posted 2026-05-15

Recommended Jobs

Locum Trauma Physician Assistant/ Nurse Practitioner

Palm Careers
Hackensack, NJ

We are hiring for a Locum Trauma Critical Care PA to join our Team at our New Level 1 Trauma Center in NJ! This is a full time need to extend through the end of 2024 with a great possibility to exten…

View Details
Posted 2025-12-16

MO-10-29-Project Manager- Change Management 779890

FHR
Newark, NJ

~ 3 days a week. One Penn Plaza East Newark, NJ 07105 Our direct client has an opening for Project Manager- Change Management 779890 This position is for 6 months, with the option of extension, and…

View Details
Posted 2026-05-02

LCSW/LPC

Mindful Power
Hoboken, NJ

Want to join an awesome team of clinicians? Do you prefer to focus on the clinical work as a therapist and not be bothered with billing, marketing, and other admin responsibilities? Are you looking f…

View Details
Posted 2025-08-12

Bakery Assistant

Two Fields Bakeshop
Stirling, NJ

Job Description Edit ( Seeking a full time Bakery Assistant who is willing to learn and adapt to our amazing culture. Our Bakery assistant will integrate into our team to help facilitate daily pre…

View Details
Posted 2026-01-01

Senior Workers Compensation Claims Adjuster - DE, PA, NJ

Gallagher
Newark, NJ

At Gallagher Bassett, we're there when it matters most because helping people through challenging moments is more than just our job, it’s our purpose.  Every day, we help clients navigate complex…

View Details
Posted 2026-05-03

Assistant Project Manager

Haddad Plumbing & Heating
Newark, NJ

Haddad Plumbing and Heating Inc. is seeking an Assistant Project Manager to join our project operations team based in Newark, New Jersey. The Assistant Project Manager will support the planning, coord…

View Details
Posted 2026-05-12

Survey Crew Chief

Liberty Personnel Services, Inc.
Piscataway, NJ

Job Details: Survey Crew Chief – Central New Jersey A well-established civil engineering and land development firm in Central NJ is seeking an experienced Survey Crew Chief to lead field oper…

View Details
Posted 2026-05-11

Sales Representative - Uniform

Cintas Corporation
Union, NJ

Requisition Number: 220228  Job Description Are you ready to launch your career in sales and be another reason Cintas, a growing Fortune 500 company, has been named to both FORTUNE’s World’s Mo…

View Details
Posted 2026-03-18

Division Director of General Surgery — Vision 2030

AtlantiCare
Egg Harbor, NJ

Southeastern New Jersey Join AtlantiCare as the Division Director of General Surgery — Vision 2030 AtlantiCare is dedicated to revolutionizing healthcare by 2030 through innovation, excellence, …

View Details
Posted 2026-05-14