While the press and analyst community is having a field day saying that the UltraSparc-IV processor is dead, what is probably closer to the truth is that Sun has killed those projects so it can get the Niagara and Rock multithreaded processors and their servers to market as quickly as possible.
It would not be entirely surprising to see the UltraSparc-V moniker slapped on either Niagara or Rock designs (or both), which is why anyone saying that the UltraSparc-IV is dead is probably not going to turn out to be true in the long run. For the past three years within Sun, the company’s engineers have been pursuing two different future server architectures, says David Yen, executive vice president in charge of Sparc chip development. Sun had two competing Sparc designs, and one of them had to go.
On the one side of Sun are those people working on Millennium processors and Eagle servers, and on the other are those who, in the wake of the acquisition of Afara Systems, which was building an eight-core processor based on a stripped down UltraSparc-II processor, decided that cramming as many cores and threads onto a processor would be, in the long run, a better strategy for Sun to take. In a world where Sun has to cut 3,300 employees and trim its R&D costs, something has to give. And despite the fact that both Millennium and Gemini have taped out and Gemini is already in silicon, Sun stared down the amount of further investment it would have to make to productize these chips, and decided that it had better concentrate on the throughput computing designs.
The Millennium chip had a very aggressive pipeline compared to the UltraSparc-III core that is at the heart of the current Cheetah UltraSparc-III and Jaguar UltraSparc-IV processors, Yen explained. Sun’s goal when it started the Millennium project nearly five years ago was to have a chip that was better than Intel’s Itaniums and IBM Corp’s Power processors. Specifically, Millennium, which would have been productized as UltraSparc-V, would be able to switch between two modes: one optimized for multithreaded applications (as the multicore Niagara and Rock processors are, by their very design); and one optimized for the kind of single-threaded applications that dominate database and ERP applications today.
As Yen explained almost two years ago, Itanium chips use sophisticated compilers to look for parallelism in applications, and compile them in such a way to make them run efficiently. Even the Power5 processors only have four threads per chip (two cores with two threads each). Sun wanted to have it both ways with the Eagle platforms. The Millennium processors were to be built using a 90 nanometer copper/low-k process from Texas Instruments, Sun’s fab partner, and run at clock speeds in the range of 1.8GHz to 3GHz or higher. Yen said that, at the time, the ideas embodied in the Millennium chips and the Eagle platforms were the right ideas, but that they had been superceded by the ideas embodied in the radically multithreaded Niagara and Rock designs that Sun has been touting so hard in the past year.
We want to invest more aggressively in these throughput computing approaches, Yen said in an interview. It was time for us to make a decision. If we do not do a new breed of processors, we believe that our competitors will. He would not elaborate on how much money Sun spent on the Millennium chips and Eagle servers, but he said that with a radically different design, as Millennium encompassed, it was going to take a lot of money to get customers and software partners to use its new features. He also said that lots of the intellectual property developed for Millennium and Eagle will make it into the multithreaded designs, including memory subsystems and error correction electronics across the server.
As for Gemini, which was a dual core implementation of the Blackbird UltraSparc-II processor, this chip was in essence a first pass idea to test the throughput computing approach. However, at a 1GHz to 1.2GHz clock speed, Yen explained, it did not have a significant performance advantage–although a slight one on multithreaded applications–compared to the current Jalapeno UltraSparc-IIIi processor, which has a large L2 on the chip to boost performance and which runs at the same clock speed. With Opteron machines, Xeon machines, and two Sparc machines all competing for the same edge of the network space, even though Gemini was essentially finished, Sun cannot afford to productize it and doesn’t believe it is wise to considering the overlap.
While Sun gave the Rock moniker to its future radical multithreaded chip in February, it has not said much about what it looks like and when it might be delivered. The Rock processors will follow the Niagara processors, which will be aimed at entry and some midrange servers. In February, Yen said that in the late 2005 to early 2006 timeframe, TI would have a 65 nanometer process to create the Niagara processors, which will pack eight of the four-threaded simplified UltraSparc-II cores onto a single piece of silicon. In Sun’s roadmap, relative performance is being reckoned against the oomph supplied by the Hummingbird UltraSparc-IIi processor running at 650MHz. Sun has said that it expects that the Niagara chips will provide about 15 times the performance of this baseline processor. Sun has said that the third generation throughput computing chips–now called Rock–will deliver 30 times the performance of this baseline. But raw performance is not the only thing of value with these future CMT designs, according to Sun.
In the meantime, Yen said that Sun can crank out some more clock speed on the new Jaguar UltraSparc-IVs using the 130 nanometer process and that Sun will be able to push performance further with one or two iterations of a 90 nanometer copper/low-k/strained silicon process, creating the Panther UltraSparc-IV+ processor. The new UltraSparc-IV is a dual core processor, just like IBM’s Power4 and Power5 and Hewlett-Packard Corp’s PA-8800 processors.
With the Panther chips, Sun is adding microarchitecture improvements to the cores to tweak performance, but the shrinking in size of the chip core by half (in terms of area) should allow the chip to hit 2GHz to 2.5GHz clock speeds, and eventually go higher. The Panther chips, Yen said, would also bring the outboard L2 cache in the UltraSparc-III and UltraSparc-IV chips onto the chip as well as an L3 cache controller. Sun will be adding an L3 cache to the UltraSparc-IV+ packaging, much as it now does with L2 caches on current UltraSparc processors.
This article is based on material originally published by ComputerWire