<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Benchmarking on The Login Node</title>
    <link>https://theloginnode.com/tags/benchmarking/</link>
    <description>Recent content in Benchmarking on The Login Node</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en</language>
    <copyright>© 2026 Will Paik</copyright>
    <lastBuildDate>Sun, 14 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://theloginnode.com/tags/benchmarking/index.xml" rel="self" type="application/rss+xml" />
    
    <item>
      <title>PyTorch DDP Scaling Benchmark</title>
      <link>https://theloginnode.com/portfolio/pytorch-ddp-bench/</link>
      <pubDate>Sun, 14 Jun 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/portfolio/pytorch-ddp-bench/</guid>
      <description></description>
      
    </item>
    
    <item>
      <title>PyTorch DDP Scaling: V100 vs A100 on 8 GPUs with ResNet-152 and ViT-B/16</title>
      <link>https://theloginnode.com/posts/hpc-special-topics-02/</link>
      <pubDate>Sun, 14 Jun 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-special-topics-02/</guid>
      <description>V100 and A100 both scale past 95% efficiency across 8 GPUs, but A100 delivers 2.4 to 2.7x more throughput per GPU. This post covers measured PyTorch DDP scaling results on 8xV100 SXM2 and 8xA100 SXM4, using ResNet-152 and ViT-B/16 with fp16 and bf16, and explains what the numbers actually mean for system selection.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-special-topics-02/featured.svg" />
    </item>
    
    <item>
      <title>[HPC Special Topics] Rclone for HPC: Benchmarking and Tuning Cloud Storage Transfers</title>
      <link>https://theloginnode.com/posts/hpc-special-topics-01/</link>
      <pubDate>Mon, 16 Feb 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-special-topics-01/</guid>
      <description>Running rclone copy with default settings works, but it is usually much slower than it needs to be. This post covers how to configure Rclone for HPC workflows, benchmark real transfer speeds across cloud storage providers, and tune the parameters that actually affect throughput on a cluster network.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-special-topics-01/features.svg" />
    </item>
    
  </channel>
</rss>
