The torch.profiler tool allows developers to analyze GPU and CPU execution times during model training. This guide explains how to identify bottlenecks and optimize memory usage within PyTorch workflows. It provides a practical starting point for engineers. Mastering these profiling techniques reduces latency and lowers compute costs for large-scale deployments.