1:26Efficient Training for GPU Memory using TransformersRajistics - data science, AI, and machine learning511 viewsView & Download
11:21Run Very Large Models With Consumer Hardware Using 🤗 Transformers and 🤗 Accelerate (PT. Conf 2022)PyTorch1.6K viewsView & Download
8:02How FlashAttention Fixes the Biggest Bottleneck in TransformersAI Researcher5 viewsView & Download
9:15Accelerate Transformer inference on GPU with Optimum and Better TransformerJulien Simon4.8K viewsView & Download
27:01ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep Learningjie mao3.0K viewsView & Download
15:41USENIX ATC '21 - Zico: Efficient GPU Memory Sharing for Concurrent DNN TrainingUSENIX990 viewsView & Download
17:56Reversible Transformer: ReFORMER for GPU Memory Optimization! Reversible Residual Layers?Discover AI1.2K viewsView & Download
5:21Kaffae Day 391 - DeepSpeed with Transformers and GPU A100Masatoshi Nishimura19 viewsView & Download
5:41Optimize NLP Model Performance with Hugging Face Transformers: A Comprehensive TutorialNeuralTalk363 viewsView & Download
24:04Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Jared Casper@Scale8.3K viewsView & Download