Nvidia Tesla
Encyclopedia
The Tesla graphics processing unit
Graphics processing unit
A graphics processing unit or GPU is a specialized circuit designed to rapidly manipulate and alter memory in such a way so as to accelerate the building of images in a frame buffer intended for output to a display...

 (GPU) is nVidia's
NVIDIA
Nvidia is an American global technology company based in Santa Clara, California. Nvidia is best known for its graphics processors . Nvidia and chief rival AMD Graphics Techonologies have dominated the high performance GPU market, pushing other manufacturers to smaller, niche roles...

 third brand of GPUs. It is based on high-end GPUs from the G80
GeForce 8 Series
The GeForce 8 Series, is the eighth generation of NVIDIA's GeForce line of graphics processing units. The third major GPU architecture developed at NVIDIA, the GeForce 8 represents the company's first unified shader architecture.-Naming:...

 (and on), as well as the Quadro
NVIDIA Quadro
The Nvidia Quadro series of AGP, PCI, and PCI Express graphics cards comes from the NVIDIA Corporation. Their designers aimed to accelerate CAD and DCC , and the cards are usually featured in workstations....

 lineup. Tesla is nVidia's first dedicated General Purpose GPU
GPGPU
General-purpose computing on graphics processing units is the technique of using a GPU, which typically handles computation only for computer graphics, to perform computation in applications traditionally handled by the CPU...

. The Tesla series takes its name from pioneering Serbian electrical engineer Nikola Tesla
Nikola Tesla
Nikola Tesla was a Serbian-American inventor, mechanical engineer, and electrical engineer...

.

Tesla overview

Because of their very high computational power (measured in floating point operations per second or FLOPS
FLOPS
In computing, FLOPS is a measure of a computer's performance, especially in fields of scientific calculations that make heavy use of floating-point calculations, similar to the older, simpler, instructions per second...

) compared to previous microprocessors, the Tesla products target the high performance computing market.
The lack of ability to output images to a display is the main difference between Tesla products and ordinary video cards. For equivalent single precision output, Fermi-based nVidia Geforce cards have four times less dual-precision performance. Tesla products primarily operate:
  • in simulations and in large scale calculations (especially floating-point calculations)
  • for high-end image generation for applications in professional and scientific fields
  • with the use of OpenCL
    OpenCL
    OpenCL is a framework for writing programs that execute across heterogeneous platforms consisting of CPUs, GPUs, and other processors. OpenCL includes a language for writing kernels , plus APIs that are used to define and then control the platforms...

     or CUDA
    CUDA
    CUDA or Compute Unified Device Architecture is a parallel computing architecture developed by Nvidia. CUDA is the computing engine in Nvidia graphics processing units that is accessible to software developers through variants of industry standard programming languages...

    .


nVidia Teslas power the second-fastest supercomputer in the world, Tianhe-1A, in Tianjin
Tianjin
' is a metropolis in northern China and one of the five national central cities of the People's Republic of China. It is governed as a direct-controlled municipality, one of four such designations, and is, thus, under direct administration of the central government...

, China
China
Chinese civilization may refer to:* China for more general discussion of the country.* Chinese culture* Greater China, the transnational community of ethnic Chinese.* History of China* Sinosphere, the area historically affected by Chinese culture...

.

Specifications and configurations

Configuration Model # of GPUs Core clock
in MHz (each)
Shaders Memory Processing Power (peak)
GFLOPs
Compute capability4 TDP
Thermal Design Power
The thermal design power , sometimes called thermal design point, refers to the maximum amount of power the cooling system in a computer is required to dissipate. For example, a laptop's CPU cooling system may be designed for a 20 watt TDP, which means that it can dissipate up to 20 watts of heat...

 watts
Form factor
and features
Thread Processors (total) Clock in MHz (each) Bandwidth max (GB
Gigabyte
The gigabyte is a multiple of the unit byte for digital information storage. The prefix giga means 109 in the International System of Units , therefore 1 gigabyte is...

/s)
Bus type Bus width (bit
Bit
A bit is the basic unit of information in computing and telecommunications; it is the amount of information stored by a digital device or other physical system that exists in one of two possible distinct states...

, each GPU)
Total size (MiB) Clock (MHz) Single Precision(SP)
Single precision floating-point format
Single-precision floating-point format is a computer number format that occupies 4 bytes in computer memory and represents a wide dynamic range of values by using a floating point....

 Total(MUL+ADD+SF)
Single Precision(SP)
Single precision floating-point format
Single-precision floating-point format is a computer number format that occupies 4 bytes in computer memory and represents a wide dynamic range of values by using a floating point....

 MAD(MUL+ADD)
Double Precision(DP) FMA
GPU Computing
Processor1
C870 1 600 128 1350 76.8 GDDR3 384 1536 1600 518.4 345.6 0 1.0 170.9 Full-height video card
Video card
A video card, Graphics Card, or Graphics adapter is an expansion card which generates output images to a display. Most video cards offer various functions such as accelerated rendering of 3D scenes and 2D graphics, MPEG-2/MPEG-4 decoding, TV output, or the ability to connect multiple monitors...

Deskside Supercomputer1 D870 2 600 2 × 128 (256) 1350 153.6 GDDR3 384 3072 1600 1036.8 691.2 0 1.0 Deskside system
NVIDIA Quadro Plex
The Nvidia Quadro Plex is an external graphics processing unit designed for large-scale 3D visualizations. The system consists of a box containing a pair of high-end Nvidia Quadro graphics cards featuring a variety of external video connectors...

 or Rack unit
Rack unit
A rack unit or U is a unit of measure used to describe the height of equipment intended for mounting in a 19-inch rack or a 23-inch rack...

GPU Computing
Server1
S870 4 600 4 × 128 (512) 1350 307.2 GDDR3 384 6144 1600 2073.6 1382.4 0 1.0 1U Rack
19-inch rack
A 19-inch rack is a standardized frame or enclosure for mounting multiple equipment modules. Each module has a front panel that is wide, including edges or ears that protrude on each side which allow the module to be fastened to the rack frame with screws.-Overview and history:Equipment designed...

C1060
Computing Processor 2
C1060 1 602 240 1300 102.4 GDDR3 512 4096 1600 933.12 622.08 77.76 1.3 187.8 2 slot video card
S1075 1U
GPU Computing
Server3,4
S1070 4 602 4 × 240 (960) 1440 409.6 GDDR3 512 16384 1600 4147.2 2764.8 345.6 1.3 1U Rack
19-inch rack
A 19-inch rack is a standardized frame or enclosure for mounting multiple equipment modules. Each module has a front panel that is wide, including edges or ears that protrude on each side which allow the module to be fastened to the rack frame with screws.-Overview and history:Equipment designed...


IEEE 754-2008 capabilities
C2050/C2070
GPU Computing Processor
C2050/C2070 1 575 448 1150 144 GDDR5 384 3072/61445 3000 1288 1030.46 515.2 2.0 238/247 Full-height video card
Video card
A video card, Graphics Card, or Graphics adapter is an expansion card which generates output images to a display. Most video cards offer various functions such as accelerated rendering of 3D scenes and 2D graphics, MPEG-2/MPEG-4 decoding, TV output, or the ability to connect multiple monitors...


IEEE 754-2008 FMA capabilities
M2050
GPU Computing Module
M2050 1 575 448 1150 148.4 GDDR5 384 30725 3092 1288 1030.46 515.2 2.0 225 Computing Module
IEEE 754-2008 FMA capabilities
M2070/M2070Q
GPU Computing Module
M2070/M2070Q 1 575 448 1150 150.336 GDDR5 384 61445 3132 1288 1030.46 515.2 2.0 225 Computing Module
IEEE 754-2008 FMA capabilities
M2090
GPU Computing Module
M2090 1 650 512 1300 177 GDDR5 384 61445 1850 1331 ? 665 2.0 225 Computing Module
IEEE 754-2008 FMA capabilities
S2050 1U
GPU Computing
System
S2050 4 575 4 × 448 (1792) 1150 4 × 148.4 (593.6) GDDR5 384 122885 3092 5152 4121.66 2060.8 2.0 900 1U Rack
19-inch rack
A 19-inch rack is a standardized frame or enclosure for mounting multiple equipment modules. Each module has a front panel that is wide, including edges or ears that protrude on each side which allow the module to be fastened to the rack frame with screws.-Overview and history:Equipment designed...


IEEE 754-2008 FMA capabilities

Notes
  • 1 Specifications not specified by NVIDIA are assumed to be based on the GeForce 8800GTX
  • 2 Specifications not specified by NVIDIA are assumed to be based on the GeForce GTX 285
  • 3 A host system/server is required to connect to the 1U GPU computing server by the PCI Express
    PCI Express
    PCI Express , officially abbreviated as PCIe, is a computer expansion card standard designed to replace the older PCI, PCI-X, and AGP bus standards...

     card (similar set-up as the Nvidia Quadro Plex
    NVIDIA Quadro Plex
    The Nvidia Quadro Plex is an external graphics processing unit designed for large-scale 3D visualizations. The system consists of a box containing a pair of high-end Nvidia Quadro graphics cards featuring a variety of external video connectors...

    )
  • 4 Core architecture version according to the CUDA programming guide.
  • 5 With ECC on, a portion of the dedicated memory is used for ECC bits, so the available user memory is reduced by 12.5%. (e.g. 3 GB total memory yields 2.625 GB of user available memory.)
  • 6 Fermi implements the new fused multiply–add (FMA) instruction for both 32-bit single-precision and 64-bit double-precision floating point numbers (GT200 supported FMA only in double precision) that improves upon multiply-add by retaining full precision in the intermediate stage.
  • For the basic specifications of Tesla, refer to the GPU Computing Processor specifications.
  • Performance figures are for single-precision except where noted.
  • NVIDIA Tesla Supercomputers are also available with up to 8x Fermi GPUs from Manufacturers.

See also

  • Nvidia Tesla Personal Supercomputer
    Nvidia Tesla Personal Supercomputer
    The Tesla Personal Supercomputer is a desktop computer that is backed by Nvidia and built by Dell, Lenovo and other companies...

  • GeForce 8 series
    GeForce 8 Series
    The GeForce 8 Series, is the eighth generation of NVIDIA's GeForce line of graphics processing units. The third major GPU architecture developed at NVIDIA, the GeForce 8 represents the company's first unified shader architecture.-Naming:...

  • GeForce 200 Series
    GeForce 200 Series
    The GeForce 200 Series is the 10th generation of Nvidia's GeForce graphics processing units. The series also represents the continuation of the company's unified shader architecture introduced with the GeForce 8 Series and the GeForce 9 Series. Its primary competition came from ATI's Radeon HD 4000...

  • GeForce 400 Series
    GeForce 400 Series
    The GeForce 400 Series is the 11th generation of Nvidia's GeForce graphics processing units. The series was originally slated for production in November 2009, but, after a number of delays, launched on March 26, 2010 with availability following in April 2010....

  • GeForce 500 Series
    GeForce 500 Series
    The GeForce 500 Series is a family of graphics processing units developed by Nvidia, based on the refreshed Fermi architecture. Nvidia officially announced the GeForce 500 series on 9 November 2010 with the launch of the GeForce GTX 580.- Overview :...

  • CUDA
    CUDA
    CUDA or Compute Unified Device Architecture is a parallel computing architecture developed by Nvidia. CUDA is the computing engine in Nvidia graphics processing units that is accessible to software developers through variants of industry standard programming languages...

  • GPGPU
    GPGPU
    General-purpose computing on graphics processing units is the technique of using a GPU, which typically handles computation only for computer graphics, to perform computation in applications traditionally handled by the CPU...

  • OpenCL
    OpenCL
    OpenCL is a framework for writing programs that execute across heterogeneous platforms consisting of CPUs, GPUs, and other processors. OpenCL includes a language for writing kernels , plus APIs that are used to define and then control the platforms...

  • Stream Processing
    Stream processing
    Stream processing is a computer programming paradigm, related to SIMD , that allows some applications to more easily exploit a limited form of parallel processing...


External links

NVIDIA Product Overview and Technical Brief
The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK