AI & Data Engineer
职位亮点
职位描述
Team: Small cross-functional team of AI and medical experts
Focus: Organizing production data, fine-tuning AI chat models, supporting product R&D acceleration
Role Summary
We are seeking a motivated and curious AI & Data Engineer to join our innovative team. You will play a key role in managing and organizing production data pipelines and assisting in the fine-tuning of AI chat models to accelerate product research and development in the medical AI domain. This is an excellent opportunity for early-career professionals eager to grow their skills in AI, data engineering, and healthcare technology.Key
Responsibilities
- Develop and maintain data pipelines to organize and preprocess production data for AI model training and evaluation.
- Collaborate with AI researchers and medical experts to fine-tune AI chat models, ensuring data quality and relevance.
- Support data integration from multiple sources, ensuring clean, accurate, and structured datasets for AI workflows.
- Assist in monitoring and optimizing AI model performance and data pipeline efficiency.
- Assist in prompt engineering and testing to improve AI chatbot performance in clinical scenarios
- Participate in data labeling and annotation tasks for supervised learning and model evaluation.
Contribute to internal tooling and dashboards to monitor data quality and model behavior.
Document data processes, pipelines, and model tuning experiments for team collaboration.- Stay up-to-date with AI and data engineering best practices and tools relevant to model fine-tuning and production data management.
Requirements
- Bachelor’s degree in Computer Science, Data Science, Engineering, or related field (fresh graduates or up to 2 years experience preferred).
- Strong curiosity and eagerness to learn about AI, machine learning, and data engineering in a medical context.
- Basic programming skills in Python, Langchain; familiarity with data manipulation libraries (e.g., Pandas, NumPy).
- Understanding of relational databases and SQL for querying structured data, data pipeline concepts, ETL processes, and data quality principles.
- Exposure or interest in AI/ML frameworks such as TensorFlow, PyTorch, or Hugging Face is a plus.
- Ability to work collaboratively in a small, interdisciplinary team environment.
- Good communication skills in English; Cantonese proficiency is a plus.
- Detail-oriented with strong logical thinking and problem-solving skills.
- Legally permitted to work in Hong Kong.
Preferred Skills (Nice to Have)
- Experience or coursework related to natural language processing (NLP) or chatbots.
- Exposure to vector databases (e.g., FAISS, Pinecone) and semantic search concepts.
- Experience with automated testing or data validation tools to ensure model and pipeline reliability.
- Familiarity with cloud platforms (preferably GCP) and data pipeline tools
- Knowledge of data privacy and compliance considerations in healthcare data.
工作种类 | |
工作地区 | 湾仔 |