Job Description
We are seeking a reasoned engineering leader in the area of Artificial Intelligence/Machine Learning Platforms to lead the development and management of our comprehensive suite of tools and services to support the entire lifecycle of AI/ML projects. This role is critical for enabling our internal researchers to bring to bear very large-scale systems for training foundational models with flexibility and efficiency.
What you will be doing:
+ Lead the strategic direction, development, and continuous improvement of the AI/ML platform, ensuring it meets the needs of internal researchers for large-scale model training and deployment.
+ Optimize efficiency and resilience of different stages of ML workflow, including data ingestion, preprocessing, check-pointing, model training, deployment, and monitoring.
+ Lead and mentor a team of highly skilled engineers, fostering a collaborative and high-performance culture.
+ Work closely with various internal teams, including...
What you will be doing:
+ Lead the strategic direction, development, and continuous improvement of the AI/ML platform, ensuring it meets the needs of internal researchers for large-scale model training and deployment.
+ Optimize efficiency and resilience of different stages of ML workflow, including data ingestion, preprocessing, check-pointing, model training, deployment, and monitoring.
+ Lead and mentor a team of highly skilled engineers, fostering a collaborative and high-performance culture.
+ Work closely with various internal teams, including...