Overview
Data Scientist (Reinforcement Learning / LLM Agent / Vision Language Model - either 1) at Binance. Join to apply for this role. Binance is a leading global blockchain ecosystem focused on security, transparency, and scalable AI-enabled products.
Responsibilities
- Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
- Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
- Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
- Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
- Monitor model performance in production and continuously improve through iterative training and fine-tuning.
Requirements
Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.5+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.Strong coding skills in Python, with experience in ML frameworks and RL libraries.Experience with large-scale distributed training and optimization.Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.Why Binance
Shape the future with the world’s leading blockchain ecosystemCollaborate with world-class talent in a user-centric global organization with a flat structureTackle unique, fast-paced projects with autonomy in an innovative environmentThrive in a results-driven workplace with opportunities for career growth and continuous learningCompetitive salary and company benefitsWork-from-home arrangement (the arrangement may vary depending on the work nature of the business team)Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.
By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.
#J-18808-Ljbffr