Senior Data Engineer Real-Time & Distributed Systems (GCP)
Job description
Who we are:
Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are the AI technology solutions provider-of-choice to 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine.
By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of clean and optimized digital data to all industries. Innodata offers a powerful combination of both digital data solutions and easy-to-use, high-quality platforms.
Our global workforce includes over 3,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We’re poised for a period of explosive growth over the next few years.
Key Responsibilities
Design, build, and optimize scalable data pipelines for batch and real-time processing
Develop and maintain event-driven architectures for high-throughput systems
Ensure data reliability, performance, and low-latency processing across distributed environments
Collaborate with data scientists and application teams to enable analytics and AI use cases
Implement best practices in performance tuning, monitoring, and cost optimization
Job requirements
Advanced proficiency in Python for backend and large-scale data processing
Strong experience building and managing big data pipelines in production environments
Hands-on expertise with workflow orchestration tools such as Airflow or Google Cloud Composer
Proven experience in batch and streaming data processing using:
Apache Spark
Apache Beam (Dataflow)
Experience designing and operating event-driven systems using Pub/Sub
Strong understanding of distributed systems architecture and scalability patterns
Experience managing globally distributed, low-latency datasets
Hands-on experience with NoSQL databases and/or Google Cloud Spanner
Strong knowledge of system reliability, fault tolerance, and performance optimization
Preferred Skills
Proficiency in Go, Java, or Scala
Experience with Kafka or Flume for streaming ingestion
Deep familiarity with the Google Cloud Platform ecosystem
Experience with production monitoring, logging, and observability frameworks
Exposure to high-availability, multi-region deployments
Please be aware of recruitment scams involving individuals or organizations falsely claiming to represent employers. Innodata will never ask for payment, banking details, or sensitive personal information during the application process. To learn more on how to recognize job scams, please visit the Federal Trade Commission’s guide at
If you believe you’ve been targeted by a recruitment scam, please report it to Innodata at [email protected] and consider reporting it to the FTC at ReportFraud.ftc.gov .
#LI-NS1
All done!
Your application has been successfully submitted!
Recommended Jobs
Retail Support Specialist
Join Our Team! At DSI, we have over 40 years of sales enablement and customized business solution experience, providing enhanced value that delivers results for our clients and partners. We're on t…
Nurse Practitioner
Job Title: Nurse Practitioner Location: Barnabas Health Medical Group Department Name: Primary Care at Bell Works Req #: 0000227302 Status: Salaried Shift: Day Pay Range: $89,000.00 …
Call Center Support
Job Details: Call Center Support Are you passionate about delivering exceptional customer service? Join our dynamic team and play a vital role in solving challenges and ensuring seamless experi…
Class A Loader Operator
Mazza Recycling is a family-owned leader in sustainable waste solutions, operating one of New Jersey’s most advanced recycling facilities. As a Class A Loader Operator, you will play a vital role in …
Registered Nurse (RN) Mental Health Unit-Open Unit Full Time Night
Job Title: RN Location: Jersey City Medical Center Department Name: Mental Health Unit-Open Unit Req #: 0000231703 Status: Hourly Shift: Night Pay Range: $46.47 - $64.93 per hour …
Sales Account Manager, Catalog Products
About GenScript GenScript Biotech Corporation (Stock Code: 1548.HK) is a global biotechnology group. Founded in 2002, GenScript has an established global presence across North America, Europe, the Gre…
Land Surveyor - Project Manager
Job Details: Land Surveyor – Project Manager Professional Land Surveying- Civil Engineering- Land Development Our client in the local area is actively seeking to add an experienced Land Surv…
Project Manager - Automated Logic (30189969)
At Carrier we make modern life possible by delivering groundbreaking systems and services that help buildings, homes and the cold chain become more healthy, safe, sustainable, and intelligent. Our gl…
NJ Family Law Attorney (Contract)
About Us First is creating the modern legal ally for consumers. As a team, we exist to create digital products that help people craft sound, efficient, and cost-effective agreements for every stag…
Supply Chain Coordinator
Position Details Title: Supply Chain Coordinator Location: Mt. Laurel, NJ Employment Type: Full Time in Person, 8am-5pm We are currently seeking a Supply Chain Coordinator to support and help mana…