论文

通用

Pipedream: Fast and efficient pipeline parallel dnn training. arXiv:1806.03377, 2018. Accurate, large minibatch SGD: training imagenet in 1 hour. CoRR, abs/1706.02677, 2017. Gpipe: Efficient training of giant neural networks using pipeline parallelism. CoRR, abs/1811.06965, 2018. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.

Agents

大模型调优

分布式模型

NLP LLM

MoE LLM

Vision LLm

LLMMultimodal

LLM强化学习

LLM 安全

数据集&数据蒸馏

Framework

ML

RAG

Tools

手机业务

AGI

others