Lead Data Scientist
Grab
- Beijing
- Permanent
- Full-time
- Conduct thorough analyses of time durations among major events of orders by examining historical data and design effective feature engineering methods accordingly.
- Develop and improve data processing ETL pipelines to efficiently support data analysis, model training and model serving works.
- Develop accurate, robust, and cost-efficient machine learning models for the prediction of time durations among major events of orders.
- Create technical documents outlining the methodologies and findings, and communicate solutions to business stakeholders in a clear, non-technical manner. This ensures that stakeholders have a comprehensive understanding of the models and services being developed.
- Take charge of project management, collaborate closely with the technical teams to deploy the machine learning models in a production environment, ensuring their smooth integration into the system.
- Communicate and collaborate with country teams to implement and roll out the model services effectively. This involves coordinating efforts, addressing any concerns or challenges, and ensuring consistent execution across different markets.
- At least Master Degree in computer science, statistics, mathematics, operation research, economy, physics, software engineering or related fields.
- 3+ years of experience in one or more of the following areas: statistical modeling, generic machine learning, deep learning, and causal inference.
- Experience translating complex business problems into ML/AI formulations.
- Proficiency in Python, Tensorflow/PyTorch, SQL, Spark. Experience in writing efficient SQL query and readable, maintainable and testable codes.
- Experience developing production quality Pipelines to automate the model tuning and deployment.
- Excellent communication skills to manage the stakeholders.
- Proficiency in English writing and speaking skills.
- PhD degree in computer science, statistics, mathematics or related field.
- Candidates with expertise in streaming data using frameworks like Apache Flink will have an added advantage.
- Experience with distributed systems and cloud services such as AWS, GCP, and Azure is highly valued.
- The ability to autonomously explore new ideas and learn new skills to accomplish tasks is a crucial attribute we seek in potential candidates.