עדיין מחפשים עבודה במנועי חיפוש? הגיע הזמן להשתדרג!
במקום לעבור לבד על אלפי מודעות, Jobify מנתחת את קורות החיים שלך ומציגה לך רק משרות שבאמת מתאימות לך.
מעל 80,000 משרות • 4,000 חדשות ביום
חינם. בלי פרסומות. בלי אותיות קטנות.
RESPONSIBILITIES:
Data Pipeline Architecture and Development: Design, build, and optimize robust and scalable data pipelines to process, transform, and integrate large volumes of data from various sources into our analytics platform.
Data Quality Assurance: Implement data validation, cleansing, and enrichment techniques to ensure high-quality and consistent data across the platform.
Performance Optimization: Identify performance bottlenecks and optimize data processing and storage mechanisms to enhance overall system performance and reduce latency.
Cloud Infrastructure: Work extensively with cloud-based technologies (GCP and AWS), to design and manage scalable data infrastructure.
Collaboration: Collaborate with cross-functional teams including Data Analysts, Data Scientists, Product Managers, and Software Engineers to understand requirements and deliver solutions that meet business needs.
Data Governance: Implement and enforce data governance practices, ensuring compliance with relevant regulations and best practices related to data privacy and security.
Monitoring and Maintenance: Monitor the health and performance of data pipelines, troubleshoot issues, and ensure high availability of data infrastructure.
Mentorship: Provide technical guidance and mentorship to junior data engineers, fostering a culture of learning and growth within the team.
REQUIREMENTS:
Strong hands-on Apache Spark experience - building and operating pipelines in production, not just familiarity.
Proficiency in PySpark or Scala for Spark development.
Proven track record delivering ETL pipelines and data integration at scale.
Solid SQL skills and command of data modeling concepts.
Cloud platform experience (AWS, GCP, or Azure) in a production data context.
Comfortable working with distributed systems and big data formats (Parquet, Delta Lake).
Nice to have:
Experience with pipeline orchestration tools, particularly Apache Airflow.
Exposure to the geospatial or location analytics domain.
Familiarity with Hadoop ecosystem components.
Background in both Python and Scala (beyond Spark context).
במקום לעבור לבד על אלפי מודעות, Jobify מנתחת את קורות החיים שלך ומציגה לך רק משרות שבאמת מתאימות לך.
מעל 80,000 משרות • 4,000 חדשות ביום
חינם. בלי פרסומות. בלי אותיות קטנות.
משרות נוספות מומלצות עבורך
-
Data Engineer
-
תל אביב - יפו
Tenna Systems
-
-
Data Engineer
-
ירושלים
א.מ.ן מחשבים בע"מ
-
-
Data Engineer
-
ירושלים
קבוצת Aman
-
-
Sr. Data Engineer - Cloud Security (Hybrid, ISR)
-
תל אביב - יפו
CrowdStrike
-
-
Sr. Data Engineer, Cloud Security (Hybrid, ISR)
-
תל אביב - יפו
CrowdStrike
-
-
מהנדס נתונים Data Engineer
-
מיקום לא צוין
Consist
-
הרצליה
ערב