Nvidia Tesla - AbsoluteAstronomy.com

Graphics processing unit

A graphics processing unit or GPU is a specialized circuit designed to rapidly manipulate and alter memory in such a way so as to accelerate the building of images in a frame buffer intended for output to a display...

(GPU) is nVidia's

NVIDIA

Nvidia is an American global technology company based in Santa Clara, California. Nvidia is best known for its graphics processors . Nvidia and chief rival AMD Graphics Techonologies have dominated the high performance GPU market, pushing other manufacturers to smaller, niche roles...

third brand of GPUs. It is based on high-end GPUs from the G80

GeForce 8 Series

The GeForce 8 Series, is the eighth generation of NVIDIA's GeForce line of graphics processing units. The third major GPU architecture developed at NVIDIA, the GeForce 8 represents the company's first unified shader architecture.-Naming:...

(and on), as well as the Quadro

NVIDIA Quadro

The Nvidia Quadro series of AGP, PCI, and PCI Express graphics cards comes from the NVIDIA Corporation. Their designers aimed to accelerate CAD and DCC , and the cards are usually featured in workstations....

lineup. Tesla is nVidia's first dedicated General Purpose GPU

GPGPU

General-purpose computing on graphics processing units is the technique of using a GPU, which typically handles computation only for computer graphics, to perform computation in applications traditionally handled by the CPU...

. The Tesla series takes its name from pioneering Serbian electrical engineer Nikola Tesla

Nikola Tesla

Nikola Tesla was a Serbian-American inventor, mechanical engineer, and electrical engineer...

Tesla overview

Because of their very high computational power (measured in floating point operations per second or FLOPS

FLOPS

In computing, FLOPS is a measure of a computer's performance, especially in fields of scientific calculations that make heavy use of floating-point calculations, similar to the older, simpler, instructions per second...

) compared to previous microprocessors, the Tesla products target the high performance computing market.
The lack of ability to output images to a display is the main difference between Tesla products and ordinary video cards. For equivalent single precision output, Fermi-based nVidia Geforce cards have four times less dual-precision performance. Tesla products primarily operate:

in simulations and in large scale calculations (especially floating-point calculations)
for high-end image generation for applications in professional and scientific fields
with the use of OpenCL
OpenCL
OpenCL is a framework for writing programs that execute across heterogeneous platforms consisting of CPUs, GPUs, and other processors. OpenCL includes a language for writing kernels , plus APIs that are used to define and then control the platforms...

or CUDA
CUDA
CUDA or Compute Unified Device Architecture is a parallel computing architecture developed by Nvidia. CUDA is the computing engine in Nvidia graphics processing units that is accessible to software developers through variants of industry standard programming languages...

.

nVidia Teslas power the second-fastest supercomputer in the world, Tianhe-1A, in Tianjin

Tianjin

' is a metropolis in northern China and one of the five national central cities of the People's Republic of China. It is governed as a direct-controlled municipality, one of four such designations, and is, thus, under direct administration of the central government...

, China

China

Chinese civilization may refer to:* China for more general discussion of the country.* Chinese culture* Greater China, the transnational community of ethnic Chinese.* History of China* Sinosphere, the area historically affected by Chinese culture...

Specifications and configurations

Configuration	Model	# of GPUs	Core clock in MHz (each)	Shaders		Memory					Processing Power (peak) GFLOPs			Compute capability⁴	TDP Thermal Design Power The thermal design power , sometimes called thermal design point, refers to the maximum amount of power the cooling system in a computer is required to dissipate. For example, a laptop's CPU cooling system may be designed for a 20 watt TDP, which means that it can dissipate up to 20 watts of heat... watts	Form factor and features
Configuration	Model	# of GPUs	Core clock in MHz (each)	Thread Processors (total)	Clock in MHz (each)	Bandwidth max (GB Gigabyte The gigabyte is a multiple of the unit byte for digital information storage. The prefix giga means 109 in the International System of Units , therefore 1 gigabyte is... /s)	Bus type	Bus width (bit Bit A bit is the basic unit of information in computing and telecommunications; it is the amount of information stored by a digital device or other physical system that exists in one of two possible distinct states... , each GPU)	Total size (MiB)	Clock (MHz)	Single Precision(SP) Single precision floating-point format Single-precision floating-point format is a computer number format that occupies 4 bytes in computer memory and represents a wide dynamic range of values by using a floating point.... Total(MUL+ADD+SF)	Single Precision(SP) Single precision floating-point format Single-precision floating-point format is a computer number format that occupies 4 bytes in computer memory and represents a wide dynamic range of values by using a floating point.... MAD(MUL+ADD)	Double Precision(DP) FMA	Compute capability⁴		Form factor and features
GPU Computing Processor¹	C870	1	600	128	1350	76.8	GDDR3	384	1536	1600	518.4	345.6	0	1.0	170.9	Full-height video card Video card A video card, Graphics Card, or Graphics adapter is an expansion card which generates output images to a display. Most video cards offer various functions such as accelerated rendering of 3D scenes and 2D graphics, MPEG-2/MPEG-4 decoding, TV output, or the ability to connect multiple monitors...
Deskside Supercomputer¹	D870	2	600	2 × 128 (256)	1350	153.6	GDDR3	384	3072	1600	1036.8	691.2	0	1.0		Deskside system NVIDIA Quadro Plex The Nvidia Quadro Plex is an external graphics processing unit designed for large-scale 3D visualizations. The system consists of a box containing a pair of high-end Nvidia Quadro graphics cards featuring a variety of external video connectors... or Rack unit Rack unit A rack unit or U is a unit of measure used to describe the height of equipment intended for mounting in a 19-inch rack or a 23-inch rack...
GPU Computing Server¹	S870	4	600	4 × 128 (512)	1350	307.2	GDDR3	384	6144	1600	2073.6	1382.4	0	1.0		1U Rack 19-inch rack A 19-inch rack is a standardized frame or enclosure for mounting multiple equipment modules. Each module has a front panel that is wide, including edges or ears that protrude on each side which allow the module to be fastened to the rack frame with screws.-Overview and history:Equipment designed...
C1060 Computing Processor ²	C1060	1	602	240	1300	102.4	GDDR3	512	4096	1600	933.12	622.08	77.76	1.3	187.8	2 slot video card
S1075 1U GPU Computing Server^3,4	S1070	4	602	4 × 240 (960)	1440	409.6	GDDR3	512	16384	1600	4147.2	2764.8	345.6	1.3		1U Rack 19-inch rack A 19-inch rack is a standardized frame or enclosure for mounting multiple equipment modules. Each module has a front panel that is wide, including edges or ears that protrude on each side which allow the module to be fastened to the rack frame with screws.-Overview and history:Equipment designed... IEEE 754-2008 capabilities
C2050/C2070 GPU Computing Processor	C2050/C2070	1	575	448	1150	144	GDDR5	384	3072/6144⁵	3000	1288	1030.4⁶	515.2	2.0	238/247	Full-height video card Video card A video card, Graphics Card, or Graphics adapter is an expansion card which generates output images to a display. Most video cards offer various functions such as accelerated rendering of 3D scenes and 2D graphics, MPEG-2/MPEG-4 decoding, TV output, or the ability to connect multiple monitors... IEEE 754-2008 FMA capabilities
M2050 GPU Computing Module	M2050	1	575	448	1150	148.4	GDDR5	384	3072⁵	3092	1288	1030.4⁶	515.2	2.0	225	Computing Module IEEE 754-2008 FMA capabilities
M2070/M2070Q GPU Computing Module	M2070/M2070Q	1	575	448	1150	150.336	GDDR5	384	6144⁵	3132	1288	1030.4⁶	515.2	2.0	225	Computing Module IEEE 754-2008 FMA capabilities
M2090 GPU Computing Module	M2090	1	650	512	1300	177	GDDR5	384	6144⁵	1850	1331	?	665	2.0	225	Computing Module IEEE 754-2008 FMA capabilities
S2050 1U GPU Computing System	S2050	4	575	4 × 448 (1792)	1150	4 × 148.4 (593.6)	GDDR5	384	12288⁵	3092	5152	4121.6⁶	2060.8	2.0	900	1U Rack 19-inch rack A 19-inch rack is a standardized frame or enclosure for mounting multiple equipment modules. Each module has a front panel that is wide, including edges or ears that protrude on each side which allow the module to be fastened to the rack frame with screws.-Overview and history:Equipment designed... IEEE 754-2008 FMA capabilities

Notes

¹ Specifications not specified by NVIDIA are assumed to be based on the GeForce 8800GTX
² Specifications not specified by NVIDIA are assumed to be based on the GeForce GTX 285
³ A host system/server is required to connect to the 1U GPU computing server by the PCI Express
PCI Express
PCI Express , officially abbreviated as PCIe, is a computer expansion card standard designed to replace the older PCI, PCI-X, and AGP bus standards...

card (similar set-up as the Nvidia Quadro Plex
NVIDIA Quadro Plex
The Nvidia Quadro Plex is an external graphics processing unit designed for large-scale 3D visualizations. The system consists of a box containing a pair of high-end Nvidia Quadro graphics cards featuring a variety of external video connectors...

)
⁴ Core architecture version according to the CUDA programming guide.
⁵ With ECC on, a portion of the dedicated memory is used for ECC bits, so the available user memory is reduced by 12.5%. (e.g. 3 GB total memory yields 2.625 GB of user available memory.)
⁶ Fermi implements the new fused multiply–add (FMA) instruction for both 32-bit single-precision and 64-bit double-precision floating point numbers (GT200 supported FMA only in double precision) that improves upon multiply-add by retaining full precision in the intermediate stage.
For the basic specifications of Tesla, refer to the GPU Computing Processor specifications.
Performance figures are for single-precision except where noted.
NVIDIA Tesla Supercomputers are also available with up to 8x Fermi GPUs from Manufacturers.

External links

NVIDIA Product Overview and Technical Brief

The source of this article is wikipedia, the free encyclopedia. The text of this article is licensed under the GFDL.

Tesla overview

Specifications and configurations

See also

External links