To answer OP:
they will fail at some point. Moore's law is just about computational power in total, not individual CPUs.
<2nm will, as far as my high school understanding goes, bring some physical challenges we can't solve yet because of Quantum Mechanics.
CPU frequency scaling hasn't done much for performance since early 2000s, and more transistors or cores also not really since the 70s.
The problem's always been memory accesses from the CPU to a much much slower either RAM or even slower disk. Multi-core processors, scaling frequencies and a lot of innovation in CPU/DRAM architecture have mitigated it some, but these also seem to have hard limitations.
A lot of the applications you're using probably aren't even utilizing multiple cores because it wouldn't improve memory accessing speed.
There are developments in the traditional sense that will improve things for years to come.
But long-term there should be huge changes coming still. I think they will have to do with quantum computing, distributed computing, close-to-memory computation, D2A mechanisms to compute in memory, other decentralized strategies to address the problems of shared memory.
Some neat graphs that explain some of the inherent limitations in Von Neumann architecture:
View attachment 170393
View attachment 170396