Adding a dense inference cluster to an air-cooled hall.
The situation
A Tier-III hall built for ~7 kW racks needs to host a new GPU inference cluster at 30–40 kW per rack. The room can’t go offline, and full immersion isn’t on the table.
How we’d approach it
Direct-to-chip cold plates on the GPU servers, in-row CDUs, and rear-door heat exchangers on adjacent racks to take the residual air load. A dedicated technology cooling loop tied into the existing chilled-water plant, filled and commissioned without dropping the surrounding rows.
What it targets
A PUE in the 1.15–1.2 range for the new cluster, no whitespace downtime during the cut-in, and lower fan noise as air load comes off the room.