Should have extensive hands-on experience on StreamSets data collector and transformer. The candidate must be able to create and configure pipelines, jobs from various sources like Kafka, Xml, Oracle db and how to orchestrate event driven pipelines. The pipelines should populate data in different relational and non-relational database.
- Having extensive knowledge on both Streaming and batch of StreamSets.
- Should be familiar of using PySpark and Scala code using StreamSets with deeper knowledge on PySpark or Scala code.
- Clear understanding of ETL concepts and mandatory experience on ETL.
- Very good communication skills.