The Nemotron ecosystem is best understood as a layered LLM training stack. Different repositories handle different pieces of the problem: data preparation, pretraining, post-training, evaluation, orchestration, and low-level distributed systems behavior.
The useful mental model is to separate blueprint from execution. One repo shows how the system is structured, another makes a compact run practical, and another explains the machinery underneath.