<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Posts on The Login Node</title>
    <link>https://theloginnode.com/posts/</link>
    <description>Recent content in Posts on The Login Node</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en</language>
    <copyright>© 2026 Will Paik</copyright>
    <lastBuildDate>Sun, 14 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://theloginnode.com/posts/index.xml" rel="self" type="application/rss+xml" />
    
    <item>
      <title>PyTorch DDP Scaling: V100 vs A100 on 8 GPUs with ResNet-152 and ViT-B/16</title>
      <link>https://theloginnode.com/posts/hpc-special-topics-02/</link>
      <pubDate>Sun, 14 Jun 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-special-topics-02/</guid>
      <description>V100 and A100 both scale past 95% efficiency across 8 GPUs, but A100 delivers 2.4 to 2.7x more throughput per GPU. This post covers measured PyTorch DDP scaling results on 8xV100 SXM2 and 8xA100 SXM4, using ResNet-152 and ViT-B/16 with fp16 and bf16, and explains what the numbers actually mean for system selection.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-special-topics-02/featured.svg" />
    </item>
    
    <item>
      <title>[HPC From Scratch] Episode 6: Slurm Accounting, QOS, and Fair Share</title>
      <link>https://theloginnode.com/posts/hpc-from-scratch-06/</link>
      <pubDate>Sun, 07 Jun 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-from-scratch-06/</guid>
      <description>Without accounting, Slurm treats every user the same regardless of how much they have already consumed. Episode 6 of HPC From Scratch builds the full accounting layer: account hierarchy with sacctmgr, QOS policies with wall-time limits, and fair share scheduling with exponential decay. The result is a scheduler that adapts to actual usage and prevents any single user from dominating the queue.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-from-scratch-06/featured.svg" />
    </item>
    
    <item>
      <title>[HPC From Scratch] Episode 5: How to Install Slurm from Source on Rocky Linux</title>
      <link>https://theloginnode.com/posts/hpc-from-scratch-05/</link>
      <pubDate>Wed, 20 May 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-from-scratch-05/</guid>
      <description>Getting Slurm running from source is more involved than installing from a package manager, but it gives you full control over version and build options. Episode 5 covers the complete setup: MUNGE for authentication, slurmctld on the controller, slurmd on compute nodes, and verifying everything works with a real batch job.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-from-scratch-05/featured.svg" />
    </item>
    
    <item>
      <title>[HPC From Scratch] Episode 4: NFS Storage &amp; FreeIPA: One Drive, One Login</title>
      <link>https://theloginnode.com/posts/hpc-from-scratch-04/</link>
      <pubDate>Tue, 05 May 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-from-scratch-04/</guid>
      <description>A cluster where each node has its own home directory and user list is not really a cluster. Episode 4 fixes that by installing NFS for shared storage and FreeIPA for centralized authentication, so every node sees the same files and every user logs in with the same credentials from anywhere.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-from-scratch-04/featured.svg" />
    </item>
    
    <item>
      <title>[HPC From Scratch] Episode 3: Rocky Linux Setup, DHCP, NAT, and SSH Across All Nodes</title>
      <link>https://theloginnode.com/posts/hpc-from-scratch-03/</link>
      <pubDate>Tue, 21 Apr 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-from-scratch-03/</guid>
      <description>Building a home HPC cluster does not require a perfect network on day one. Episode 3 covers setting up the login node on WiFi, planning a clean IP scheme for the cluster subnet, and getting SSH working reliably across all nodes while keeping the setup simple enough to upgrade later.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-from-scratch-03/featured.svg" />
    </item>
    
    <item>
      <title>[HPC From Scratch] Episode 2: RAM, NVMe, and the iGPU Memory Trap</title>
      <link>https://theloginnode.com/posts/hpc-from-scratch-02/</link>
      <pubDate>Wed, 25 Mar 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-from-scratch-02/</guid>
      <description>Buying more RAM does not always mean your jobs get more of it. Episode 2 of HPC From Scratch covers the hardware upgrades made to the cluster nodes and explains how iGPU shared memory quietly reduces available system memory on compute nodes, with no error message to warn you.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-from-scratch-02/featured.svg" />
    </item>
    
    <item>
      <title>[HPC From Scratch] Episode 1: Building a 6-Node HPC Cluster for $1,264</title>
      <link>https://theloginnode.com/posts/hpc-from-scratch-01/</link>
      <pubDate>Fri, 13 Mar 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-from-scratch-01/</guid>
      <description>University clusters have waitlists and cloud HPC gets expensive fast. This series documents building a real 6-node HPC cluster from consumer hardware for $1,264. Episode 1 covers hardware selection, how to assign node roles, and the decisions that actually matter before you buy anything.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-from-scratch-01/featured.svg" />
    </item>
    
    <item>
      <title>[HPC Special Topics] Rclone for HPC: Benchmarking and Tuning Cloud Storage Transfers</title>
      <link>https://theloginnode.com/posts/hpc-special-topics-01/</link>
      <pubDate>Mon, 16 Feb 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc-special-topics-01/</guid>
      <description>Running rclone copy with default settings works, but it is usually much slower than it needs to be. This post covers how to configure Rclone for HPC workflows, benchmark real transfer speeds across cloud storage providers, and tune the parameters that actually affect throughput on a cluster network.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc-special-topics-01/features.svg" />
    </item>
    
    <item>
      <title>[HPC 101] Job Debugging: Why Did My Job Fail?</title>
      <link>https://theloginnode.com/posts/hpc101-04/</link>
      <pubDate>Wed, 28 Jan 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc101-04/</guid>
      <description>Job failures on HPC clusters are frustrating, especially without clear feedback. This post breaks down how to use sacct, seff, and Slurm log files to figure out exactly what went wrong and how to prevent it next time.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc101-04/featured.svg" />
    </item>
    
    <item>
      <title>[HPC 101] Python on HPC: Conda and venv Without Root Access</title>
      <link>https://theloginnode.com/posts/hpc101-03/</link>
      <pubDate>Sun, 18 Jan 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc101-03/</guid>
      <description>Shared HPC clusters do not give you root access, and the system Python is not yours to modify. This post shows how to set up isolated Python environments using Conda and venv so you can install whatever you need without permission errors or breaking someone else&amp;rsquo;s workflow.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc101-03/featured.svg" />
    </item>
    
    <item>
      <title>[Linux 101] Linux Terminal for Beginners: Commands You Will Actually Use</title>
      <link>https://theloginnode.com/posts/linux101-01/</link>
      <pubDate>Fri, 09 Jan 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/linux101-01/</guid>
      <description>Most people&amp;rsquo;s first experience with the Linux terminal is copying a command from somewhere and hoping it works. This post takes a different approach: explain what is actually happening, cover the commands you will use every single day, and make the terminal feel like a tool instead of a trap.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/linux101-01/featured.svg" />
    </item>
    
    <item>
      <title>[HPC 101] File Transfer on HPC: SCP, Rsync, and Git Explained</title>
      <link>https://theloginnode.com/posts/hpc101-02/</link>
      <pubDate>Fri, 02 Jan 2026 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc101-02/</guid>
      <description>Copying files to a remote cluster is easy to get wrong. A basic scp works fine until your dataset is large and the connection drops halfway through. This post covers scp for quick transfers, rsync for large or resumable jobs, and git for code that belongs in version control anyway.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc101-02/featured.svg" />
    </item>
    
    <item>
      <title>[HPC 101] Getting Started on HPC: SSH Login, Module System, and First Slurm Job</title>
      <link>https://theloginnode.com/posts/hpc101-01/</link>
      <pubDate>Sat, 27 Dec 2025 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hpc101-01/</guid>
      <description>Every HPC user starts somewhere, and it usually involves staring at a login prompt with no idea what to do next. This post covers the three things you need on day one: SSH access, loading software with the module system, and submitting your first Slurm batch job.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hpc101-01/featured.png" />
    </item>
    
    <item>
      <title>Hello World: What This Blog Is and Why It Exists</title>
      <link>https://theloginnode.com/posts/hello-world/</link>
      <pubDate>Tue, 16 Dec 2025 00:00:00 +0000</pubDate>
      
      <guid>https://theloginnode.com/posts/hello-world/</guid>
      <description>The first post on The Login Node. Will Paik explains why he started the blog, what direction he is planning to take it, and how he is approaching it.</description>
      <media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://theloginnode.com/posts/hello-world/featured.jpg" />
    </item>
    
  </channel>
</rss>
