Job Description
Why this role exists
Our data pipelines today run offline - someone kicks off a script, pulls a CSV, cleans it, and the insight is already stale by the time it reaches the team. We're moving that whole workflow to the cloud so our operational data (CRM activity, lesson bookings, student engagement) stays live and queryable without human intervention. You'll help build that foundation from the ground up.
What you’ll actually do- Build automated ingestion from our source systems - starting with Trengo (our CRM), then expanding to Google Sheets, Telegram bot events, and internal Postgres databases.
- Design and implement ETL pipelines on AWS (Lambda, Glue, Step Functions, or similar — we're open on tooling) that run on a schedule and land clean data in a data lake (S3 + Athena, or equivalent).
- Model the data so it's analyst-friendly - think staging → transformed → mart layers, not raw dumps.
- Figure out what to put on dashboards, then bu...