Is VRAM the only bottleneck or processing power is also insufficient to run top models on a single GPU?