ABOUT THIS JOB In your role you will be part of a development team responsible for developing, improving and maintaining our identity graph, which deterministically and probabilistically groups media consumption devices into persons, and persons into households. This data structure powers our marketing activation business, and helps our clients convey their advertising messages in the device- person- or household- level. The construction of the identity graph involves Apache Airflow-based data pipelines, triggering AWS EMR-based Scala-developed Apache Spark jobs handling tens of TBs of data.
Technical skills:
Software Engineering At least 5 years of hands-on experience in server-side development working on several complex data projects
Hands-on development experience with Apache Spark in Scala
Hands-on development experience with Apache Airflow pipelines in Python