Data Solutions Engineer will be responsible for implementing systems and procedures for effective data semantics management, ensuring data is accurately categorized and easily discoverable.
">
* Data Governance: Ensure that data is accurately categorized and easily discoverable through effective data governance practices.
* Automated Pipelines: Develop and maintain automated data pipelines that ensure efficient data flow and processing from multiple sources to our lakehouse architecture.
* Materialized Views: Create strategies and systems for optimal generation of materialized views and data subsumption to ensure that our data architecture remains cutting-edge, minimizes redundancy, and achieves required level of performance.
Requirements
* Extensive experience with Apache Kafka, Apache Flink, and other relevant streaming technologies.
* Deep understanding of lakehouse architecture and its implementation in large-scale environments.
* Strong knowledge of data semantics, discovery processes, and data governance.