Large Scale Computing for Semantic Web Technologies

  • View
    477

  • Download
    1

Embed Size (px)

Text of Large Scale Computing for Semantic Web Technologies

  • Large Scale Processing for Semantic Web Technologies

    SeminarDr. Harald Sack / Dr. Peter Trger

    Jrg Waitelonis / Magnus Knuth / Nadine LudwigHasso-Plattner-Institut fr Softwaresystemtechnik

    Universitt PotsdamWintersemester 2010/11

    Die nichtkommerzielle Vervielfltigung, Verbreitung und Bearbeitung dieser Folien ist zulssig (Lizenzbestimmungen CC-BY-NC).

    Dienstag, 19. Oktober 2010

    http://creativecommons.org/licenses/by-nc/3.0/deed.dehttp://creativecommons.org/licenses/by-nc/3.0/deed.de

  • 1. Dozenten / Tutoren

    2. Semantic Web und Linked Data

    3. Large Scale Processing im FutureSOC Lab

    4. Administratives

    Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    2

    Large Scale Processing for Semantic Web Technologies

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    Dr. Harald Sack seit 1.1.2009 Senior Researcher am HPI und Leiter

    der Forschungsgruppe ,Semantische Technologien

    Forschungsschwerpunkte:

    Semantic Web Technologien

    Multimedia Retrieval

    Wissensreprsentation

    Videosuchmaschine yovisto.com

    3

    Large Scale Processing for Semantic Web TechnologiesDozenten / Tutoren

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    Dr. Peter Trger

    Seit Februar 2010 Senior Researcher am HPIim Bereich Verlssliche Many-Core Systeme

    Forschungsschwerpunkte:

    Verlssliche Systeme, Fehlervorhersage

    Skalierbare Programmierung paralleler Systeme

    Intel Single Chip Cloud Computer (SCC)

    CiteMaster.net

    4

    Large Scale Processing for Semantic Web TechnologiesDozenten / Tutoren

    MC0

    MC1

    MC2

    MC3

    System InterfaceVRC

    Router

    IA-32 Core0

    L2$0256KB

    L2$1256KB

    IA-32 Core1

    MPB16KB

    Router Tile

    2 core clusters in 6x4 2-D mesh

    16B

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    Dipl. Inform. Jrg Waitelonis

    Studium Informatik Uni-Jena bis 2006

    2006-2007 Exist-Seed Projekt Osotis

    seit 2007 Grnder von yovisto.com

    Entwickler von REPLAY (ETH-Zrich)

    Forschung: Semantic Web, Multimedia-Retrieval, Suchmaschinen Technologien

    5

    Large Scale Processing for Semantic Web TechnologiesDozenten / Tutoren

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    Dipl.-Inf. Nadine Ludwig

    Studium Informatik TU Ilmenau bis 2005

    2005-2010 TU Berlin:

    kooperative Lernszenarien

    Integration von Semantic Web Technologien in kooperative Lernplattformen

    seit 05/2010 HPI:

    Semantische Analyse, Entity Mapping, Disambiguierung

    6

    Large Scale Processing for Semantic Web TechnologiesDozenten / Tutoren

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    Dipl. Inform. Magnus Knuth

    Studium Informatik Uni Leipzig bis 2007

    2007-2010 Institut fr Medizinische Informatik, Statistik und Epidemiologie Leipzig

    Forschung: Semantic Web, Multimedia-Retrieval, Suchmaschinen Technologien

    7

    Large Scale Processing for Semantic Web TechnologiesDozenten / Tutoren

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    8

    Large Scale Processing for Semantic Web TechnologiesDozenten / Tutoren

    Bereitstellung der wissenschaftlichen Prsentation im Internet

    yovisto.com Videosuchmaschine mit dem

    Schwerpunkt akademischer Lehrveranstaltungen

    aktuell mehr als 10.000 Vorlesungen und wissenschaftliche Vortrge aus der ganzen Welt

    automatische Segmentierung und Videoanalyse

    benutzergenerierte Co-Annotation

    Social Tagging Diskussionen Rezensionen Wikis Lernmaterialien

    Zielgenauer Zugriff auf gesuchte Videoinhalte

    www.yovisto.com

    Dienstag, 19. Oktober 2010

    http://www.yovisto.comhttp://www.yovisto.com

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    9

    THESEUS Forschungsprogramm: Neue internetbasierte Wissensinfrastruktur.

    UseCase Contentus: Technologien fr die Mediathek der Zukunft.

    Projekt Mediaglobe: Effizientes Arbeiten mit Mediadaten in Medienarchiven und Rundfunkanstalten.

    effiziente Suche nach/in AV-Inhalten in Medienarchiven und Rundfunkanstalten

    Arbeitsprozesslsung fr die effiziente Erfassung, Aufbereitung und Verwertung von AV-Inhalten

    Large Scale Processing for Semantic Web TechnologiesDozenten / Tutoren

    Dienstag, 19. Oktober 2010

  • 1. Dozenten / Tutoren

    2. Semantic Web und Linked Data

    3. Large Scale Processing im FutureSOC Lab

    4. Administratives

    Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    10

    Large Scale Processing for Semantic Web Technologies

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    11

    The Web is huge....

    To be more precise, the WWW is rather huge...more than 25 x 109 documents in

    Search engine indexes (TNL Blog: Google has 24 billion items index, considers MSN search nearest competitor, September 2005)

    Google Web Crawler found more than 1012 documents(The Official Google Blog: We knew the Web was Big....., Juli 25, 2008)

    New Google Search Index Caffeine comprises 100 Million Gigabytes of datai.e. 1017 Byte (SMX Video: Googles Matt Cutts On Caffeine Launch, June 9, 2010,http://searchengineland.com/smx-video-googles-matt-cutts-on-caffeine-launch-43933)

    And then, there is also the DeepWeb (Darkweb) ...and it is supposed to be up to 500 time larger than the Surface Web(Bergman, 2001)

    Dienstag, 19. Oktober 2010

    http://www.tnl.net/blog/2005/09/27/google-has-24-billion-items-index-considers-msn-search-nearest-competitor/http://www.tnl.net/blog/2005/09/27/google-has-24-billion-items-index-considers-msn-search-nearest-competitor/http://www.tnl.net/blog/2005/09/27/google-has-24-billion-items-index-considers-msn-search-nearest-competitor/http://www.tnl.net/blog/2005/09/27/google-has-24-billion-items-index-considers-msn-search-nearest-competitor/http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.htmlhttp://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    12

    The Web is growing...

    Multimedia, Real-Time Data, Sensor Data, ....

    in 06/2010: 7 TB/day

    in 05/2010: 24 h of video upload / minute2 billion streamed videos per day

    in 06/2010: 7 TB/dayDienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    12

    The Web is growing...

    Multimedia, Real-Time Data, Sensor Data, ....

    in 06/2010: 7 TB/day

    in 05/2010: 24 h of video upload / minute2 billion streamed videos per day

    in 06/2010: 7 TB/dayDienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    13

    How to find something on the Web?

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    14

    The Web of Data

    Semantic Web Technologies

    Interoperable and machine understandabledata semantics

    Based on formal knowledge representations Creating a Web of Data

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    15

    Semantic Web and Linked Data

    From World Wide Web to Web of DataThe Web was designed as an information space, with the goal that it should be useful not only for human-human communication, but also that machines would be able to participate and help

    Prerequisites:

    Content can be read and interpreted correctly (=understood) by machines

    Tim Berners-Lee, Semantic Web Roadmap, Sept 1998

    Semantic Web (natural language) web content is

    explicitely annotated with semantic metadata

    semantic metadata encode the meaning (semantics) of web content and can be read andinterpreted correctly my machine

    Natural Language Processing Technology from traditional Information

    Retrieval (WWW Search Engines)

    Dienstag, 19. Oktober 2010

  • Seminar: Large Scale Computing 4 Semantic Web Technologies, Dr. Harald Sack et. al., Hasso-Plattner-Institut, Universitt Potsdam

    16

    Semantic Web and Linked Data

    Understanding Web Content - I

    Natural Language Processing Technology from traditional Information

    Retrieval (WWW Search Engines)

    Dienstag, 19. Oktober 2010

  • S

Recommended

View more >