25
Silicon Graphics, Inc. Linux64 im CAE Umfeld Linux64 im CAE Umfeld Josef Hellauer [email protected] 3. LS 3. LS - - DYNA Forum DYNA Forum 14. 14. - - 15. Oktober 2004 in Bamberg 15. Oktober 2004 in Bamberg

Linux64 im CAE Umfeld - DYNAmore GmbH

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Linux64 im CAE Umfeld - DYNAmore GmbH

Silicon Graphics, Inc.

Linux64 im CAE UmfeldLinux64 im CAE Umfeld

Josef [email protected]

3. LS3. LS--DYNA ForumDYNA Forum

14.14.--15. Oktober 2004 in Bamberg15. Oktober 2004 in Bamberg

Page 2: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 2LS-DYNA Forum, Bamberg

AgendaAgenda

• HPC im CAE-Bereich

• Systemanforderung CAE

• Linux64-Server SGI Altix

Page 3: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 3LS-DYNA Forum, Bamberg

CAE Application Segments CAE Application Segments in Automotive 2003in Automotive 2003

CFD - Computational Fluid DynamicsNHV - Noise, Vibration, HarshnessMFG Processes - Stamping, Casting, Forging

-Crash is the application segment #1

-The fastest growing application is CFD, won 3% shares from 2002

-NVH is the third application segment. However, it is the most demanding in terms of memory and IO bandwidth.

-10 CAE Applications drive 89% of the installed compute power

Crash58%

Mfg Processes2%

NVH12%

Structural8%

Others0%

CFD20%

Source: Top20Auto, Ch.Tanasescu, SC’2003, www.top500.orghttp://www.top500.org/lists/2003/11/Top20Auto_Top500V2.pdf

Page 4: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 4LS-DYNA Forum, Bamberg

Key CAE Applications in AutomotiveKey CAE Applications in Automotive

1160

0

7060

6940

4861

2974

2943

2429

537

528

3068

0

2000

4000

6000

8000

10000

12000

LS-Dyna

Pam-Cras

hNast

ranStar

CD

Fluent

Radioss

AbaqusPowerf

low Fire

Vectis

Inst

alle

d GF

lops

20022003

Other applications: Marc, Ansys, Madymo, AMLS, Pam-Stamp, Magma, Autoform, Permas

79% growth

Source: Top20Auto, Ch.Tanasescu, SC’2003, www.top500.orghttp://www.top500.org/lists/2003/11/Top20Auto_Top500V2.pdf

Fire1,2%

LS-Dyna23,3%

Nastran15,8%

Pam-Crash19,4%

Powerflow6,3%

Abaqus7,0%

Fluent7,4%

StarCD11,5%

Radioss7,1%

Vectis1,1%

LS-DynaNastranPam-CrashRadiossStarCDFluentAbaqusPowerflowFireVectis

Page 5: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 5LS-DYNA Forum, Bamberg

SMP35%

CONSTELLATION27%

CLUSTER33%

MPP5%

System Architectures in AutomotiveSystem Architectures in AutomotiveWorld Wide World Wide -- 20032003

NotesCONSTELLATION - cluster of SMP nodesCLUSTER - cluster of IA32 (1 or 2p)

Ex: SGI Altix 3000, IBM p690, HP Superdome, NEC SX-6,SGI Origin3000

Ex: Cluster of SMP nodes likeSGI Altix 3000, IBM p690, CPQ E45,HP Superdome, SGI Origin3000

Ex: IBM SP3,Fujitsu VPP5000

Ex: Cluster of IA32 ( 1-2p per node) 62% are SMP based systems

Source: Top20Auto, Ch.Tanasescu, SC’2003, www.top500.orghttp://www.top500.org/lists/2003/11/Top20Auto_Top500V2.pdf

Page 6: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 6LS-DYNA Forum, Bamberg

A History of Innovation of SGIA History of Innovation of SGI

R

20011982 1984 1988 1994 1995 1996 20042003

Power Series™, multi-processing systems provide compute power for high-end graphics applications

NASA Ames and Altix® set world record for STREAMS benchmark

First 512p Altix cluster drives ocean research at NASA Ames

SGI introduces its first 64-bit operating system

First systems deployed in Stephen Hawking’s COSMOS system

First generation NUMA System: Origin® 2000

1997

Dockside engineering analysis on Origin®

2000 and Indigo2™ helps Team New Zealand win America’s Cup

Altix®, first scalable 64-bit Linux® Server

1998

DOE deploys 6144p Origin 2000 to monitor and simulate nuclear stockpile

Challenge® XL media server fuels Steven Spielberg’s Shoahproject to document Holocaust survivor stories

Introduced modularNUMAflex™architecturewith Origin® 3000

Jim Clarkfounded SGI onthe vision ofComputerVisualization

IRIS® Workstations become first integrated 3D graphics systems

Images courtesy of Team New Zealand and the University of Cambridge

Page 7: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 7LS-DYNA Forum, Bamberg

SGI Family of Scalable LinuxSGI Family of Scalable Linux®®

SolutionsSolutions

Other vendors’ Linux® Solutions

Image courtesy: NASA Ames

SGI® Altix® 3000Servers and Superclusters

SGI® Altix® 350Servers and Clusters

Desktops Edge servers Mid-range Divisional CapabilityDesktops Edge servers

AA

Page 8: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 8LS-DYNA Forum, Bamberg

System Interconnect

CPU & Memory

Storage Expansion

System I/O

High-Bandwidth I/O Expansion

Standard I/O Expansion

Graphics Expansion

Scalability: No central bus or switch; just modules and NUMAlink™ cables

Scalability: No central bus or switch; just modules and NUMAlink™ cables

Investment protection: Add new technologies as they evolve

Investment protection: Add new technologies as they evolve

Flexibility: Tailored configurations for different dimensions of scalability

Flexibility: Tailored configurations for different dimensions of scalability

Performance: High-bandwidth interconnect with very low latency

Performance: High-bandwidth interconnect with very low latency

Modular SGIModular SGI®® NUMAflexNUMAflex™™ArchitectureArchitecture

Page 9: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 9LS-DYNA Forum, Bamberg

Silicon Graphics PrismSilicon Graphics Prism™™Visualization System Visualization System Innovation that Innovation that SCALESSCALES

“Power”Power User Visualization

$2 - 4 GPUs, 2 - 8 CPUs

Delivers productivity

“Team”Small-Group Visualization

$75k - $250k4-8 GPUs, 8-16 CPUs

Delivers breakthroughs

“Extreme”Ultimate Visualization

Capability

$200k - $2,000k8-16 GPUs, 16-32 CPUs

Delivers ultimate advantage

Image courtesy of Laboratory for Atomospheres, NASA Goddard Space Flight Center

Page 10: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 10LS-DYNA Forum, Bamberg

ComposItor

Network

ScalableGraphics

ScalableCompute

Globally Addressable Memory

Scalable Software DevelopmentScene Graphs, Toolkits and Transparent Scaling

Large DataSets

Architecture Makes A Difference

Silicon Graphics PrismSilicon Graphics Prism™™ Visualization Visualization System System SGI® NUMAflex™ independently scales all resources

Page 11: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 11LS-DYNA Forum, Bamberg

SGISGI AltixAltix 3700 and 350 Servers3700 and 350 ServersWith Global Addressable MemoryWith Global Addressable Memory

Partition 2Partition 1

• Supercomputer, supercluster configurations• SSI up tp 512 processors,• Supercluster up to 2048p via NUMAlink• Up to 8TB of shared memory• 6.4GB/sec dual plane fat tree → 12.8GB/sec with NUMAlink 4 router• 1.3GHz / 3.0MB to 1.6GHz / 9.0MB Intel Itanium2 processors• Infinitely scalable using commercial interconnects

Altix™ 350

Altix™ 3700

Image courtesy: NASA Ames

Page 12: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 12LS-DYNA Forum, Bamberg

InfinibandInfiniband®®/Altix 350 Clusters/Altix 350 Clusters• Competitive advantages:

– large host size (16 P)– SGI ProPack software features and support– MPT MPI

InfinibandSwitch

Page 13: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 13LS-DYNA Forum, Bamberg

SGI ProPack™ HPC Value-Add Enhancements Optimized high-productivity computing

System management: Partitioning (Altix 3000), Performance Co-Pilot

Resource management: CPU sets/memory placement, MPT, array services, SCSL math libraries

Data management: XVM

SGI® Open-Source Enhancements Enabling features and functionality

Contributing SGI expertise in scalability, NUMA

XFS® high-performance filesystem

Standard Linux® Distribution Runs standard 64-bit Linux applications

Red Hat® Enterprise Linux® AS 3.0* compatible for highest performance

Certified SUSE Linux 9 option for highest certified support levels

Easy to develop and administer

Intel® compilers and tools

SGI Production Linux Solution StackSGI Production Linux Solution Stack

Page 14: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 14LS-DYNA Forum, Bamberg

HPC Resource Demands for MCAEHPC Resource Demands for MCAE

MCAE Segment

IFEAStatics

IFEADynamics

EFEA

CFDUnstructured

CFDStructured

Software

ABAQUS®

ANSYS®

MSC.NASTRAN™

ABAQUS®

ANSYS®

MSC.NASTRAN™

LSLS--DYNADYNA®®

….…..

FLUENT®

STAR-CD™PowerFLOW®

FLOWEROVERFLOW

System Resource Benefits/Requirements

Generally Five Types of MCAE Computational Behavior to Consider::

CPU Memory BW I/O Bandwidth Latency Scalability

H H M L L < 10p

M H H H L < 10p

HH M M LL MM HH ~ 64p~ 64p

M H M H H ~ 100p

H H L M M > 100p

Page 15: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 15LS-DYNA Forum, Bamberg

LDLD--DynaDyna mpp970mpp970Communication MatrixCommunication Matrix

90% or messages shorter than 3kbyte , Latency is important for scalability

0 2 4 6 8 10 12 14 16 18 20 22 24 26 28

0

2

4

6

8

10

12

14

16

18

20

22

24

26

28

Sender

Rec

eive

r

LS-Dyna Profile for 30 CPUs - Total MBytes

25-3020-2515-2010-155-100-5

1D

2D

3D

Page 16: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 16LS-DYNA Forum, Bamberg

Interconnect Contention depends on Interconnect Contention depends on TopologyTopologyMPI Communication trace

concurrent communicationslink saturation ?

Communication time slices

Topo

ATo

po B

InterconnectionTopology

BW --

BW ++

Page 17: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 17LS-DYNA Forum, Bamberg

S h a r e d M e m o r y

Total system bisection bandwidth actually scales as system grows!

C-BrickC-BrickC-BrickC-BrickC-BrickC-BrickC-BrickC-Brick

C-BrickC-BrickC-BrickC-Brick

C-BrickC-BrickC-BrickC-Brick

Bisection Bandwidth

C-BrickC-BrickC-BrickC-Brick

C-BrickC-BrickC-BrickC-Brick

C-BrickC-BrickC-BrickC-Brick

C-BrickC-BrickC-BrickC-Brick

16 Processor System32 Processor System64 Processor System128-Processor System

Scalable CPU/Memory Interconnect Scalable CPU/Memory Interconnect FabricFabric

Scalable Compute

andLarge

Memory

Scalable Compute

Page 18: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 18LS-DYNA Forum, Bamberg

Skalierbare Leistung im VergleichSkalierbare Leistung im Vergleich

sec hours sec hours4 56479 15:41:19 1.00 48611 13:30:11 1.00 1.168 30461 8:27:41 1.85 25463 7:04:23 1.91 1.2016 18157 5:02:37 3.11 13012 3:36:52 3.74 1.4032 14610 4:03:30 3.87 7220 2:00:20 6.73 2.0264 13162 3:39:22 4.29 4456 1:14:16 10.9 2.95

CPUsElapsed Time Scalability Elapsed Time PCcluster/ AltixScalability

PC Cluster 3.06GHz Altix 1.3GHz

0

10000

20000

30000

40000

50000

60000

CPUs

Elap

sed

Tim

e (s

ec)

PC Cluster 3.06GHz Altix 1.3GHz

4 8 16 32 64 1.00

3.00

5.00

7.00

9.00

11.00

13.00

4 24 44 64CPUs

Scal

abilit

y

PC Cluster 3.06GHz Altix 1.3GHz

Page 19: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 19LS-DYNA Forum, Bamberg

Beispiel: LSBeispiel: LS--DYNA DYNA StampingStamping> 800.000 > 800.000 ElementeSimulation Time = 21 msec

16,713,8

11,8

22,8

10,1

7,9

6,8

0 4 8 12 16 20 24Elapsed Times Total [h]

8

16

CPU

s

Altix 3700, 1500MHz, MPP970

Altix 3700, 1300MHz, MPP970

XX Itanium2, 1.5 GHz

Athlon, 2.2 GHz, Myrinet

Page 20: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 20LS-DYNA Forum, Bamberg

Performance Relation vs. [email protected], Athlon

2,3

1,9

3,3

1,6

2,9

1,41,0

1,0

1,5

2,0

2,5

3,0

3,5

4 8 12 16

Altix 3700, 1500MHz, MPP970

Altix 3700, 1300MHz, MPP970

XX Itanium2, 1.5 GHz

Athlon, 2.2 GHz, Myrinet

Beispiel: LSBeispiel: LS--DYNA DYNA StampingStamping> 800.000 > 800.000 ElementeSimulation Time = 21 msec

Page 21: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 22LS-DYNA Forum, Bamberg

SGISGI®® AltixAltix™™ und und ccNUMA ccNUMA im CAEim CAE

Der 64-bit Adressraum der Itanium®2 CPUs erlaubt CAE-Anwendungen mit Speicherbedarf >> 4 GBSpeicherbedarf >> 4 GB• kurzfristige Erweiterung (höhere Diskretisierungen, größere Modelle) des

virtuellen und physikalischen Speichers ohne Einschränkungen und Systemwechsel (HW+SW),

• die performante Nutzung von In-Core-Solver >> 4 GB

• das Testen und Debuggen von „Problem-Modellen“ auf 1 CPU im SMP und MPP-Modus:

Hohe I/O-Bandbreiten + SGI-Filesystem XFS beschleunigen Out-of-Core Solver im NVH-Bereich.

Numaflex + optimierte OS-System Funktionen ergeben beste Durchsatzleistungen (s. SPECfp_rate2000)

Page 22: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 23LS-DYNA Forum, Bamberg

New Discoveries with NASA ECCONew Discoveries with NASA ECCOEstimating the Circulation and Climate of the Estimating the Circulation and Climate of the OceanOcean

The Challenge• Assimilate real-time

satellite data directly into into global ocean circulation models

• Drive resolution from today’s .25 degree (~25km) to .1 degree

The Solution• 512p Altix®, 28TB

storage• Reducing typical

ocean simulation times from months to 2-3 days

• World Record STREAM Triad benchmark result, first system to break 1,000 GB/sec

AA

Page 23: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 24LS-DYNA Forum, Bamberg

Optimizing Decisions at Optimizing Decisions at ŠŠkoda Autokoda Auto

The Solution• Deployed 96p Altix®

3700 cluster, 16TB InfiniteStorage disk array

• Reduced processing times allow Skoda to optimize decisions around car development and design efficiency, quality and safety

Škoda AutoMladá Boleslav, Czech Republic

The ChallengeReduce processing times for crash analysis and fluid dynamics analysis

Images courtesy of Škoda Auto

AA

Page 24: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 25LS-DYNA Forum, Bamberg

AltixAltix®®: Breakthrough Deployments: Breakthrough DeploymentsBreakthrough

applicationsChemistry and BiologyGaussian, Amber, Blast, Charm, Accelrys and more

ManufacturingLS-Dyna, Abaqus, Star-CD, NASTRAN, Ansys and more

Physical SciencesModeling of oceans, weather, earth, astronomy, energy, and more

Oak Ridge National Lab, University of Minnesota Supercomputing Institute, National Cancer Institute, many universities

DaimlerChrysler,VW, Skoda, many more

COSMOS, NASA, University of Queensland, Total, Japan Institute of Statistical Mathematics, many more

Large shared memory Oak Ridge National Labs (2TB), Japan Institute of Statistics and Mathematics (1.96TB), NRL (1TB), and others

NASA Ames (512p), NRL (384p), SARA (416p), Total (552p), and others

COSMOS, Netherlands National Supercomputing Facility at SARA, and others

Wichita State University, BP, and others

Total, NASA Ames, NRL, and othersProven performance

Large processor scalability

Grid computing

Hybrid solutions

Early adopters expanding their deployments

Images courtesy of the University of Cambridge, Molecular Simulations Inc., Skoda Auto, and NASA Ames

Breakthrough technologies

Image courtesy: NASA Ames

Page 25: Linux64 im CAE Umfeld - DYNAmore GmbH

14. - 15. Oktober 2004 26LS-DYNA Forum, Bamberg

• Leistungsfähigstes Linux® Computersystem der Welt• Intel® Itanium®2 CPUs mit SGI® NUMAflex™ Architektur• Single System Image mit bis zu 512 CPUs• Shared-Memory Skalierbarkeit bis zu 8TB • MPI mit Shared Memory Features• Linux Erweiterungen für HPC Workflow optimiert• Höchste Bandbreite und kürzeste Latenzzeiten • Standard Linux Distribution und Entwicklungsplatform• Differenzierte “Middleware” und Treiber• Hohe Verfügbarkeit von 64bit Anwendungen• Integration von skalierbarer Graphik möglich

Zusammenfassung Zusammenfassung