The torch.profiler module allows developers to track CPU and GPU execution times during model training. This guide explains how to identify bottlenecks and optimize memory allocation. It is an incremental tutorial for beginners. Practitioners can use these insights to reduce latency and improve hardware utilization in PyTorch workflows.