Job Description
Purpose of the role
Build and maintain all data pipelines feeding the Databricks lakehouse, ensuring clean, timely, and governed data flows from every source across the Investment Division’s five portfolio clusters and, in later phases, the Group’s operating divisions. Own the bronze-to-silver-to-gold transformation logic, data quality monitoring, security master management, and the Document Intelligence Engine’s ingestion and classification pipelines.
Key responsibilities
- Build and maintain automated ingestion pipelines: Addepar API (daily positions, transactions, cash), Capital IQ API (prices, fundamentals, consensus), Canoe (alternative fund documents), eVestment/Mercer (manager database), email (Outlook Graph API).
- Implement the medallion architecture: bronze layer (raw immutable data), silver layer (cleaned, conformed, security master mapped, FX normalised, GICS classified, e...