Opis stanowiska
Key duties and responsibilities Develop, maintain, and enhance AWS Glue-based ETL pipelines for GWAS dataset ingestion and processing using PySpark and Python. Refactor existing data pipelines to improve efficiency, reliability, maintainability, and scalability, including implementing unit and integration tests. Troubleshoot, debug, and resolve pipeline-related issues and bugs. Collaborate closely with the Human Genetics team to integrate biological context and scientific rigor into data engine…
Szczegóły oferty
Data publikacji
30.06.2025
Aplikuj na to stanowisko
Kliknij poniższy przycisk, aby przejść do strony pracodawcy i złożyć aplikację na to stanowisko.
Aplikuj teraz