Megatron-Lm - Search

About 86,700 results

Open links in new tab

Any time

github.com
https://github.com › NVIDIA › Megatron-LM
GitHub - NVIDIA/Megatron-LM: Ongoing research training …
Megatron Core expands upon Megatron-LM's GPU-optimized techniques with more cutting-edge innovations on system-level optimizations, featuring composable and modular APIs.
github.com
https://github.com › NVIDIA › Megatron-LM › releases
Releases · NVIDIA/Megatron-LM - GitHub
Ongoing research training transformer models at scale - Releases · NVIDIA/Megatron-LM
github.com
https://github.com › NVIDIA › Megatron-LM › blob › main › megatron › co…
Megatron-LM/megatron/core/QuickStart.md at main - GitHub
Ongoing research training transformer models at scale - Megatron-LM/megatron/core/QuickStart.md at main · NVIDIA/Megatron-LM
github.com
https://github.com › NVIDIA › Megatron-LM › blob › main › megatron › co…
Megatron-LM/megatron/core/README.md at main - GitHub
Ongoing research training transformer models at scale - Megatron-LM/megatron/core/README.md at main · NVIDIA/Megatron-LM
github.com
https://github.com › ROCm › Stanford-Megatron-LM
ROCm/Stanford-Megatron-LM - GitHub
Ongoing research training transformer models at scale - ROCm/Stanford-Megatron-LM
github.com
https://github.com › epfLLM › Megatron-LLM
GitHub - epfLLM/Megatron-LLM: distributed trainer for LLMs
Our repository is a modification of the original Megatron-LM codebase by Nvidia. Added key features include: architectures supported: Llama, Llama 2, Code Llama, Falcon and Mistral …
github.com
https://github.com › shizhengLi › megatron-learning
GitHub - shizhengLi/megatron-learning: 分布式训练框架Megatron …
本项目是一个专注于Megatron-LM框架的学习和研究项目，涵盖了从基础概念到高级实现技术的完整知识体系。通过系统性的学习和实践，帮助开发者深入理解大规模语言模型训练的核心原 …
github.com
https://github.com › deepspeedai › Megatron-DeepSpeed
Megatron-DeepSpeed - GitHub
DeepSpeed version of NVIDIA's Megatron-LM that adds additional support for several features such as MoE model training, Curriculum Learning, 3D Parallelism, and others.
github.com
https://github.com › Ascend › Megatron-LM
GitHub - Ascend/Megatron-LM
Megatron-LM 概述准备训练环境开始训练训练结果展示版本说明
github.com
https://github.com › alibaba › Pai-Megatron-Patch
alibaba/Pai-Megatron-Patch - GitHub
The design philosophy of Pai-Megatron-Patch is to avoid invasive modifications to the source code of Megatron-LM. In other words, it does not add new modules directly to Megatron-LM.

Pagination
- 1
- 2
- 3
- Next