WhatsApp
已收藏的职位
3453 份职位空缺
排序:
相关性|
日期
LLM Engineer (Data and Optimization)
TCL Corporate Research (Hong Kong) Co., Limited (沙田)
2天前
I. T. Manager
Tai Cheung Properties Limited (香港岛)
2天前
Developer/Programmer
Skyline Capital Asia Limited (赤腊角)
2天前
IT Infrastructure Manager (Ref.: CV_ITD_ITM_SC_202505)
Shanghai Pudong Development Bank Co., Ltd., Hong Kong Branch (湾仔)
2天前
Data Analyst (Fresh Graduate Welcome)
SF Supply Chain (Hong Kong) Limited (青衣)
2天前
IT Manager / Asst Manager
Quon Hing Concrete Company Limited (长沙湾)
2天前
Information Technology Manager (Ref: BOD/ITM)
Pok Oi Hospital (不指定)
2天前
IT Support Officer (Project-Based)
Plan International Hong Kong (牛头角)
2天前
Analyst Programmer (MOOV)
PCCW Media Group (九龙湾)
2天前
IT Support Officer / Assistant
PBP Limited (尖沙咀)
2天前
Junior AI Developer (STEM Internship)
Pantheon Lab Limited (不指定)
2天前
Full Stack Developer
NextGen Solutions Limited (太古)
$35,000-$50,000/月2天前
AI & Data Engineer
Nexi AI Limited (湾仔)
$30,000-$45,000/月2天前
Senior Programmer (Server)
Mad Head App Limited (沙田)
2天前
Regional Application Developer (1-year contract)
Lockton Companies (Hong Kong) Ltd (鲗鱼涌)
2天前
Business Analyst (IT / Project)
Lion Rock Group Limited (观塘)
2天前
Program Manager
Landis & Gyr Pty Ltd (长沙湾)
2天前
Summer Intern
Laboratory for AI-Powered Financial Technologies Limited (沙田)
2天前
已建立笋工提示!有新职缺时我们会即时通知你
立即应征

LLM Engineer (Data and Optimization)


沙田
0年工作经验

职位描述

Job Description
As a Large Model Algorithm Engineer, you will focus on the technical research and development of Large Language Models (LLMs) and multimodal large models, driving their application in industrial vertical domains. You will participate in core processes such as model training, optimization, and inference deployment while collaborating with top university research teams to explore cutting-edge technologies and improve model performance and efficiency.

Key Responsibilities

  • Responsible for training, fine-tuning (SFT), and system deployment of vertical domain large models, promoting efficient application of large models in industrial environments.
  • Research and implement compression and optimization techniques for large models, including pruning, quantization, and knowledge distillation, to improve inference efficiency and deployment performance.
  • Participate in the algorithm design and development of RAG (Retrieval-Augmented Generation) and Agent modules to enhance reasoning capabilities in dynamic and complex environments.
  • Research and apply multimodal understanding technologies to optimize the application of Large Vision Models (LVM) in industrial vision and other fields.
  • Translate business rules into efficient workflow code and participate in the design and implementation of Agentic Workflow to enhance workflow intelligence.
  • Build industry datasets to support large model training and applications, including data preprocessing, pretraining data construction, and training/application/evaluation dataset setup.
  • Research and implement large model merging techniques, exploring collaborative optimization solutions for multiple models.
  • Develop and maintain validation, evaluation, and performance monitoring processes for large models to ensure system stability and usability.
  • Participate in the development and optimization of large model application platforms (microservices) to enhance system modularity and usability.

Qualifications

  • Master’s degree or higher in Mathematics, Electrical Engineering, Computer Science, Data Science, or a related field is preferred.
  • Proficient in machine learning, deep learning, and Transformer architecture, with hands-on experience in end-to-end training and development of large models.
  • Familiar with large model compression and optimization methods such as pruning, quantization, and knowledge distillation.
  • Strong capabilities in large-scale data processing and familiarity with big data tools (e.g., Hadoop, Spark), with experience in data preprocessing, cleaning, and building training datasets.
  • Skilled in Python, C/C++, and Linux programming, with a solid foundation in algorithms and data structures.
  • Familiar with mainstream large model training and inference frameworks such as PyTorch, Hugging Face (HF), DeepSpeed, PEFT, vLLM, TRL, etc.
  • Knowledge of Triton or other high-performance inference tools, with experience in applying model optimization to real-world deployments.
  • Proficient in Docker and Linux shell scripting; experience with FastAPI development is a plus.
  • Experience in enterprise-level large model development, optimization, deployment, and tool development is preferred.
  • Strong teamwork and communication skills, capable of collaborating with cross-domain teams.
  • Passionate about cutting-edge large model technologies and their applications in industrial vertical domains.

Bonus Points

  • Experience in RAG systems and Agent module development and optimization.
  • Familiarity with CUDA programming, distributed computing, or related high-performance computing technologies.
  • Publications in top-tier conferences (e.g., NeurIPS, ICLR, CVPR).
  • Knowledge of hardware acceleration technologies (e.g., GPU, TPU) and their applications in model optimization.


工作种类
工作地区 沙田

有关招聘公司
TCL Corporate Research (Hong Kong) Co., Limited