Job Description
Mô tả công việc: JOB DESCRIPTION
- Build and manage the data asset using some of the most scalable and resilient open source big data technologies like Airflow, Spark, Kafka, etc
- Build and manage a highly scalable, efficient Data and ML Infrastructure by adopting microservices driven design and architecture with proper DevOps principles and practices
- Design and deliver the next-gen data lifecycle management suite of tools/frameworks, including ingestion and consumption on the top of the data lake to support real-time as well as batch use cases
- Help the team in integrating various data sources across GFGs group vertical
- Build and expose metadata catalog for the Data Lake for easy exploration, profiling as well as lineage requirements
- At least 3+ years of relevant experience in developing scalable, secured, fault tolerant, resilient mission-critical Big Data platform.
- Ab...