Generative AI servers: Spec ’em out

Puget Systems has introduced a new custom generative AI and machine learning server with powerful Nvidia GPUs and AMD Epyc processors.

Puget Systems, which specializes in high-performance custom-built computers, earlier this month released its newest custom generative AI and machine learning server.

The LLM AI Training and Inference server, for enterprises or data centers, are rack-mount workstations designed for GPU-intensive generative AI workflows. They support up to eight Nvidia RTX Ada-generation GPUs for a total of 752GB of VRAM. Optionally, they can be configured with Nvidia L40S GPUs or Nvidia H100 NVL tensor-core GPUs, the latter targeting LLM inference, notes Puget, due to its high compute density, memory bandwidth, and energy efficiency, as well as its NVLink architecture. It also delivers extraordinary acceleration to power high-performing elastic data centers for AI, data analytics, and HPC applications, Puget adds.

The AI Training and Inference Server also come with AMD’s Epyc line of processors offering up to 128 cores with support for 1.5TB DDR5 ECC RAM and 128 PCIe Gen 5 lanes. The server uses a pair of those CPUs to enable up to eight dual-width GPUs in a 4U rack-mount chassis.

The server can be configured for a wide range of generative AI applications. Pricing is based on configuration.

Augmented RealityMobileNianticPokemon Go

Niantic: An AR cautionary tale

Riding the early wave of AR gaming, the company’s fast growth eventually outpaced its revenue, resulting in hard decisions to stay afloat.

Artificial IntelligenceCudaEditorialNvidia

Can anyone catch up with Nvidia?

Are there any playbooks AI processor companies can use to catch up with Nvidia? A lot of people are asking that question.

Remote workTIFCA

Remote work and play: Our new reality

Jon Peddie addresses the pros (and cons) of this work scenario during a TIFCA presentation.

Overview of PC Client CPUs and iGPUs

February 10, 2025

Overview of PC Client CPUs and iGPUs is the JPR market report on the CPU and iGPU market. The Overview of PC Client CPUs and iGPUs is a look at the global PC client-based CPU market broken down into the desktop, notebook, and server markets, providing overall shipment numbers for the quarter as well as market share.

learn more

Summary Report on the Worldwide Total GPU market

November 11, 2024

The Summary Report on the Worldwide Total GPU Market is an in-depth look at the market, covering the unit shipments and market value of 13 segments that use GPUs. Over 20 firms, including seven IP vendors, supply the market with discrete GPUs, integrated and embedded GPUs, and system-on-chip devices. The research discusses these companies.

AMD	Broadcom	Innosilicon	Loongson Zhongke	Siroywe
Apple	Denglin	Intel	MetaX	Xi-Silicon
AzurEngine	HiSilicon	Jingjia	Moore Threads	Zhaoxin
Biren	HongShan Micro	Lingjiu Micro	Nvidia
Bolt	Iluvatar	Lisuan	Qualcomm

Table of Contents

learn more

2024 Worldwide CAD Report

March 20, 2024

The Worldwide CAD Report JPR’s CAD market report has been published since 2005. As a result, it comes with a strong historical perspective as well as current data on the rapidly changing CAD industry. The 2024 report provides information on market segments, individual company market shares, new workflows, and new players.

Worldwide CAD Report Executive Summary
Worldwide CAD Report Table of Contents

learn more

Generative AI servers: Spec ’em out

Related posts

Niantic: An AR cautionary tale

Can anyone catch up with Nvidia?

Remote work and play: Our new reality

Recent products