We are hiring a Summer Intern to join our AI Lab team. This role will work on the most cutting-edge Large Language Model (LLM) and Multi-modality data.
Responsibilities:
- LLM end-to-end data pipeline, which includes SFT data, RLHF data design, creation, cleaning and refinement.
- LLM model training, including pretraining, SFT, RLHF and rewarding phases.
- Large scale distributed training
- New algorithms and training techniques R&D in LLM field
- LLM quantization techniques
Requirements:
- Pursuing a degree in Computer Science, Data Science, Statistics, or related field
- Familiar with Python, PyTorch, Deepspeed, FSDP
- Experience with Llama.cpp, C++ is a plus
- Previous internship experience or project work in AI/LLM/ML/NLP/DL
- Ability to work onsite in Morrisville, NC