Algorithm Engineering Parallele Algorithmen Stefan Edelkamp

Algorithm Engineering „Parallele Algorithmen“

Stefan Edelkamp

Übersicht

Parallele Externe Suche Parallele Verspätete Duplikatselimination

Parallele ExpansionVerteilte Sortierung

Parallele Strukturierte DuplikatseliminationDisjunkte Duplikatserkennungsbereiche ”Schlöser”

Parallele AlgorithmenMatrix-MultiplikationList RankingEuler Tour

VerteilteSuche

Distributed setting provides more space. Experiments show that internal time dominates

Exploiting Independence

Since each state in a Bucket is independent of the other –

they can be expanded in parallel.

Duplicates removal can be distributed on different processors.

Bulk (Streamed) transfers much better than single ones.

Distributed Queue for Parallel Best-First Search

<15,34, 0, 100>

<g, h, start byte, size>

<15,34, 20, 100>TOP

<15,34, 40, 100>

<15,34, 60, 100>

Beware of t

Mutual E

xclusio

Problem!!!

Multiple Processors - Multiple Disks Variant

Sorted buffers w.r.t the hash val

Sorted Files

P1 P2 P3 P4

Divide w.r.t the hash ranges

Sorted buffers from every processor

Sorted File

h0 ….. hk-1 hk ….. hl-1

ParallelExternal A*

Parallel External A*

Distributed Heuristic Evaluation Assume one child processor for each tile one master processor

B3B1 B2

B4 B5 B6 B7

B9 B10 B11

B12 B13 B14 B15

B0B3B1 B2

B4 B5 B6 B7

B9 B10 B11

B12 B13 B14 B15

Distributed Pattern Database Search

Only pattern databases that include the client tile need to be loaded on the client

Because multiple tiles in pattern, from birds eye PDB loaded multiple times

In 15-Puzzle with corner and fringe PDB this saves RAM in the order of factor 2 on each machine, compared to loading all

In 36-Puzzle with 6-tile pattern databases this saves RAM in the order of factor 6 on each machine, compared to loading all

Extends to additive pattern databases

Distributed Heuristic Evaluation

Same bottleneck in external-memory search

Bottleneck: Duplicate detection Duplicate paths cause parallelization overhead

C DDDD

Internal memory External memory

Disjoint duplicate-detection scopes

B1B0 B4

B0 B3B1 B2

B4 B5 B6 B7

B9 B10 B11

B12 B13 B14 B15

B2B3 B7

B13 B15B14

B8B12 B13B11B15 B14

Finding disjoint duplicate-detection scopes

B1B0 B4

0 00 0

0 0 0 0

B2B3 B7

B8B12 B13B11B15 B14

B1B5B6

Implementation of Parallel SDD

Hierarchical organization of hash tablesOne hash table for each abstract nodeTop-level hash func. = state-space projection func.

Shared-memory managementMinimum memory-allocation size mMemory wasted is bounded by O(m#processors)

External-memory version I/O-efficient order of node expansions I/O-efficient replacement strategy

Benötigt nur ein Mutex “Schloss”

B3B1 B2

B4 B5 B6 B7

B9 B10 B11

B12 B13 B14 B15

ParallelleMatrix- Multiplication

ParalleleMatrixMultiplication

Exklusives Schreiben

ParalleleKopien

FazitMatrix Multiplication

Paralleles List Ranking

List Ranking

Erster Algorithmus

Prinzip

Komplexität

Verbesserungen

Strategie

Unabhängige Mengen

2-Färbung

Reduktion

Restauration

Beispiel

Variablen

Beispiel(ctd.)

PseudoCode

NächsterSchritt

Analyse

Backup

Speicher

Analyse

Ausblick:Randomisiertin O(n) whp?

Problememit DFS

IdeeEulerTour

ParallelDFS

DFSNummern

Allgemein

Beispiel

Ein Zyklusoder mehrere?

Korrektheit

Beispiel

KonstruktionEulerTour

Fazit Euler Touren

GPU Architektur

Effektivität

Hierarchischer Speicher

Hash-based Partitioning

Kernel Functions

Algorithm Engineering Parallele Algorithmen Stefan Edelkamp

Documents

Adapting Quantum Approximation Optimization Algorithm

Matroids, greedy algorithm, independent set polytope · Matroids, greedy algorithm, independent set polytope Ingo Kleinert Institut fur Mathematik, TU Berlin¨ kleinert@cs.tu-berlin.de

Parallele Algorithmen zur Matrix Multiplikation Matthias Dohm Parallele Algorithmen zur Matrix Multiplikation Seminar Parallele Programmierung und Parallele

POWER GENERATION INVESTMENT: AN ALGORITHM TO GET A …premat.fing.edu.uy/ingenieriamatematica/archivos/... · POWER GENERATION INVESTMENT: AN ALGORITHM TO GET A GOOD FEASIBLE SOLUTION

Standardsoftwarebasiertes Projektcontrolling für parallele

Self Organizing Tree Algorithm

Lean Konferenz 2017 - IHK Hessen innovativ...17:30 Get-together 11:00 Parallele Foren 13:30 Parallele Foren 15:15 Parallele Foren 3a 2a 1a Programm. ... Die Alphadi Business Academy

Übungen zu Rechnerarchitektur – Tomasulo’s Algorithm · 2019. 11. 15. · Informatik 12 | DAES Übungen zu Rechnerarchitektur – Tomasulo’s Algorithm – Sommersemester 2017

Algorithm Engineering „Symbolische Suche“

Algorithm Engineering „ Teilmengen-Suche “

Algorithm Engineering von Anfang an: Sortieren und ...mehlhorn/AlgorithmEngineering/sanders_sb.pdf · Sanders: Algorithm Engineering 1 Algorithm Engineering von Anfang an: Sortieren

Einführung in die Programmierung - Lehrstuhl 11 Algorithm ... · Einführung in die Programmierung Wintersemester 2017/18 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering

Parallele Programmierung und Parallele Algorithmen : Matrix- Vektor - Multiplikation

Parallele Rechnerarchitektur II - Heidelberg University...Parallele Rechnerarchitektur II Stefan Lang Interdisziplinäres Zentrum für Wissenschaftliches Rechnen Universität Heidelberg

Parallele E/A auf Clustern

Design and Analysis of Superresolution Algorithm and

Algorithm Engineering „Parallele Algorithmen“

Algorithm Engineering Symbolische Suche Peter Kissmann

Spezialvorlesung Suchalgorithmen Thema: External Probabilistic Planning Stefan Edelkamp

Algorithm Engineering Sommersemester 2009 Universität Bremen