About the company
The Binance Exchange is a leading cryptocurrency exchange founded in 2017 in Hong Kong. It features a strong focus on altcoin trading. Binance offers crypto-to-crypto trading in more than 600 cryptocurrencies and virtual tokens, including Bitcoin (BTC), Ether (ETH), Litecoin (LTC), Dogecoin (DOGE), and its own token Binance Coin (BNB).
Job Summary
Responsibilities:
šResearch and develop state-of-the-art RL algorithms, focusing on large model optimization and alignment techniques. šDesign and implement RL training pipelines, including environment simulation, data generation, and reward function design. šApply RL methods to enhance LLM/VLM/Agentic AI capabilities in reasoning, planning, and autonomous decision-making. šCollaborate with engineers and researchers to integrate RL solutions into enterprise AI platforms. šMonitor model performance in production and continuously improve through iterative training and fine-tuning.
Requirements:
šMasterās degree in Computer Science, Applied Mathematics, Machine Learning, or related fields. š3+ years of hands-on experience in RL or LLM/VLM/Agentic AI optimization. šStrong coding skills in Python, with experience in ML frameworks and RL libraries. šExperience with large-scale distributed training and optimization. šSelf-driven, ownership mindset, and strong problem-solving skills. šExcellent communication skills for cross-functional collaboration.
If this role isnāt the perfect fit, there are plenty of exciting opportunities in blockchain technology, cryptocurrency startups, and remote crypto jobs to explore. Check them on our Jobs Board.



