数据科学家(算法/工程) - Applied AI | Data Scientist(Data/Engineering)- Applied AI
字节跳动
- Beijing
- Permanent
- Full-time
1、利用自然语言处理、机器学习或计算机视觉等内容理解能力设计和构建产品核心能力,提取数据洞察并优化变现策略;
2、基于最新的深度学习、机器学习、统计和优化技术的算法开发创新性解决方案并构建业务问题原型;
3、从0到1管理数据项目,并与产品经理协作定义用户故事和成功指标来指导开发过程;
4、使用不限于AB测试等方法验证项目的商业价值和预计收益;
5、与工程团队合作部署数据模型并将解决方案规模化。We are looking for generalists and specialists in AI/ML techniques, including computer vision (CV), natural language processing (NLP), and audio signal processing. You will be responsible for partnering with a variety of stakeholders (product, operations, policy, and engineering) and developing state-of-the-art models.
1. Design and build core capabilities by leveraging content understanding; capabilities, such as natural language processing, machine learning, or computer vision, to extract insights and improve monetization strategies;
2. Develop creative solutions and build prototypes for business problems using algorithms based on the latest deep learning, machine learning, statistics, and optimization techniques;
3. Independently manage data projects from 0 to 1, and collaborate with product managers to define user stories, and success metrics to guide the development process;
4. Verify the business value and estimated revenue of the project using methods such as AB testing;
5. Collaborate with engineering teams to deploy and scale data science solutions.职位要求:1、了解统计学,机器学习和分析的基本数学基础知识;
2、至少 3 年数据分析经验,具有 ML/DL 和 CV/NLP/Speech 之一的行业经验;
3、具有探索性数据分析、统计分析和假设检验以及模型开发的经验;
4、精通 SQL、Hive、Presto 或 Spark,并具有处理大型数据集的经验;
5、熟练掌握Python和SQL,以及tensorflow、pytorch等ML/DL框架;
6、清楚地了解数据pipeline、模型开发、模型测试和部署;
7、有 CI/CD(如 git)和云服务(如 AWS/GCP/Azure)的经验者优先;
8、具备英文沟通能力强;能够以易于理解的方式向技术/非技术同事传达分析和技术内容;
9、具有求知欲以及出色的解决问题和量化能力,包括拆解问题、找出根本原因并提出解决方案。1. Knowledge of underlying mathematical fundamentals in statistics, machine learning and analytics;
2. At least 3 years experience in data modeling/analysis, with industry experience in ML/DL and one of CV/NLP/Speech;
3. Experience with exploratory data analysis, statistical analysis and hypothesis testing, and model development;
4. Fluency in SQL, Hive, Presto, or Spark and having experience working with large datasets
5. High proficiency in Python and SQL, and ML/DL frameworks such as tensorflow, pytorch;
6. Clear understanding of data pipeline, model development, model testing and deployment;
7. Experience in CI/CD such as git and cloud services such as AWS/GCP/Azure will be highly desirable;
8. Advanced English with good communications skills; able to communicate analytical and technical content in an easy to understand way to both technical and non-technical audiences;
9. Intellectual curiosity, along with excellent problem-solving and quantitative skills, including the ability to desegregate issues, identify root causes and recommend solutions.