The more components you use, the slower the simulation became. You need to find the perfect steps per clock tick for your project. More component= More steps per clock tick.
You can try to simplify by using less component. Create your chip via nand. Use ROM for decoder, 7 seg disp, etc...
That's an 8-bits reg faster than the old ways.