A small class-conditional Diffusion Transformer trained on MNIST. Pick a digit and sample new handwritten-digit images.