Benchmarking on The Login Node

Benchmarking on The Login Node https://theloginnode.com/tags/benchmarking/ Recent content in Benchmarking on The Login Node Hugo -- gohugo.io en © 2026 Will Paik Sun, 14 Jun 2026 00:00:00 +0000 PyTorch DDP Scaling Benchmark https://theloginnode.com/portfolio/pytorch-ddp-bench/ Sun, 14 Jun 2026 00:00:00 +0000 https://theloginnode.com/portfolio/pytorch-ddp-bench/ PyTorch DDP Scaling: V100 vs A100 on 8 GPUs with ResNet-152 and ViT-B/16 https://theloginnode.com/posts/hpc-special-topics-02/ Sun, 14 Jun 2026 00:00:00 +0000 https://theloginnode.com/posts/hpc-special-topics-02/ V100 and A100 both scale past 95% efficiency across 8 GPUs, but A100 delivers 2.4 to 2.7x more throughput per GPU. This post covers measured PyTorch DDP scaling results on 8xV100 SXM2 and 8xA100 SXM4, using ResNet-152 and ViT-B/16 with fp16 and bf16, and explains what the numbers actually mean for system selection. [HPC Special Topics] Rclone for HPC: Benchmarking and Tuning Cloud Storage Transfers https://theloginnode.com/posts/hpc-special-topics-01/ Mon, 16 Feb 2026 00:00:00 +0000 https://theloginnode.com/posts/hpc-special-topics-01/ Running rclone copy with default settings works, but it is usually much slower than it needs to be. This post covers how to configure Rclone for HPC workflows, benchmark real transfer speeds across cloud storage providers, and tune the parameters that actually affect throughput on a cluster network.