Senior Data Engineer - Python & PySpark
Senior Data Engineer - Python & PySpark
Job Summary
We are seeking an experienced Senior Data Engineer with strong expertise in Python, PySpark, SQL, and Big Data technologies .
The ideal candidate will be responsible for designing, developing, and optimizing scalable data pipelines and ETL/ELT workflows for processing large volumes of structured and unstructured data. The role requires hands-on experience with distributed data processing, cloud platforms, orchestration tools, and performance optimization of big data applications.
️ Key Responsibilities
Data Pipeline Development
- Design, develop, and maintain scalable data pipelines using:
- Python
- Apache Spark / PySpark
- Build reusable and efficient data processing frameworks.
ETL / ELT Development
- Develop and optimize ETL/ELT workflows for:
- Data ingestion
- Data transformation
- Data processing
- Process large volumes of structured and unstructured data.
Big Data Processing
- Work with big data technologies such as:
- Hadoop ecosystem
- Hive
- Spark
- Implement distributed computing solutions for high-performance processing.
Data Modeling & Warehousing
- Support:
- Data modeling
- Data architecture
- Data warehousing solutions
- Ensure scalability and maintainability of data systems.
SQL & Database Management
- Write and optimize:
- Complex SQL queries
- Data transformation logic
- Work with:
- Relational databases
- Non-relational databases
Cloud & Orchestration
- Deploy and manage data solutions on cloud platforms such as:
- AWS
- Azure
- GCP
- Work with orchestration tools like:
- Apache Airflow
Data Quality & Governance
- Perform:
- Data validation
- Data cleansing
- Data transformation
- Ensure compliance with:
- Data governance
- Security standards
Performance Optimization
- Optimize:
- Spark jobs
- SQL queries
- Data pipelines
- Improve:
- Scalability
- Reliability
- Processing performance
Collaboration & Agile Delivery
- Collaborate with:
- Data Analysts
- Data Scientists
- DevOps teams
- Business stakeholders
- Participate in:
- Agile ceremonies
- Sprint planning
- Continuous improvement initiatives
✅ Required Skills
Programming & Data Engineering
- Python
- PySpark
- Apache Spark
- SQL
Big Data Technologies
- Hadoop ecosystem
- Hive
- Distributed computing platforms
ETL / ELT & Orchestration
- ETL / ELT pipelines
- Apache Airflow or similar orchestration tools
Cloud Platforms
- AWS / Azure / GCP
- Cloud-based data services
Databases & Data Warehousing
- Relational databases
- NoSQL databases
- Data warehousing concepts
- Data modeling
File Formats
- Parquet
- Avro
- JSON
- CSV
Soft Skills
- Strong analytical and troubleshooting skills
- Excellent communication and collaboration abilities
- Ability to work with cross-functional teams
Experience Required
- 6-10+ years of experience in:
- Data Engineering
- Big Data technologies
- Distributed data processing
Preferred Skills
- Performance tuning and optimization expertise
- Experience with scalable cloud-native data architectures
- Exposure to DevOps and CI/CD for data platforms
Recommended Jobs
Locum Trauma Physician Assistant/ Nurse Practitioner
We are hiring for a Locum Trauma Critical Care PA to join our Team at our New Level 1 Trauma Center in NJ! This is a full time need to extend through the end of 2024 with a great possibility to exten…
MO-10-29-Project Manager- Change Management 779890
~ 3 days a week. One Penn Plaza East Newark, NJ 07105 Our direct client has an opening for Project Manager- Change Management 779890 This position is for 6 months, with the option of extension, and…
LCSW/LPC
Want to join an awesome team of clinicians? Do you prefer to focus on the clinical work as a therapist and not be bothered with billing, marketing, and other admin responsibilities? Are you looking f…
Bakery Assistant
Job Description Edit ( Seeking a full time Bakery Assistant who is willing to learn and adapt to our amazing culture. Our Bakery assistant will integrate into our team to help facilitate daily pre…
Senior Workers Compensation Claims Adjuster - DE, PA, NJ
At Gallagher Bassett, we're there when it matters most because helping people through challenging moments is more than just our job, it’s our purpose. Every day, we help clients navigate complex…
Assistant Project Manager
Haddad Plumbing and Heating Inc. is seeking an Assistant Project Manager to join our project operations team based in Newark, New Jersey. The Assistant Project Manager will support the planning, coord…
Survey Crew Chief
Job Details: Survey Crew Chief – Central New Jersey A well-established civil engineering and land development firm in Central NJ is seeking an experienced Survey Crew Chief to lead field oper…
Sales Representative - Uniform
Requisition Number: 220228 Job Description Are you ready to launch your career in sales and be another reason Cintas, a growing Fortune 500 company, has been named to both FORTUNE’s World’s Mo…
Division Director of General Surgery — Vision 2030
Southeastern New Jersey Join AtlantiCare as the Division Director of General Surgery — Vision 2030 AtlantiCare is dedicated to revolutionizing healthcare by 2030 through innovation, excellence, …