- AutorIn
- Dipl. Inform. Philipp Große
- Dr. Norman May
- Prof. Dr. Wolfgang Lehner
- Titel
- A Study of Partitioning and Parallel UDF Execution with the SAP HANA Database
- Zitierfähige Url:
- https://nbn-resolving.org/urn:nbn:de:bsz:14-qucosa-144026
- Schriftenreihe
- Technische Berichte
- Bandnummer
- 2014,03 (TUD-FI14-03 Mai 2014)
- Erstveröffentlichung
- 2014
- ISSN
- 1430-211X
- Abstract (EN)
- Large-scale data analysis relies on custom code both for preparing the data for analysis as well as for the core analysis algorithms. The map-reduce framework offers a simple model to parallelize custom code, but it does not integrate well with relational databases. Likewise, the literature on optimizing queries in relational databases has largely ignored user-defined functions (UDFs). In this paper, we discuss annotations for user-defined functions that facilitate optimizations that both consider relational operators and UDFs. We believe this to be the superior approach compared to just linking map-reduce evaluation to a relational database because it enables a broader range of optimizations. In this paper we focus on optimizations that enable the parallel execution of relational operators and UDFs for a number of typical patterns. A study on real-world data investigates the opportunities for parallelization of complex data flows containing both relational operators and UDFs.
- Freie Schlagwörter (DE)
- UDF, Datenbanken, Optimierung, SAP HANA, benutzerdefinierte Funktionen
- Freie Schlagwörter (EN)
- UDF, Database, Optimization, SAP HANA, user-defined functions
- Klassifikation (DDC)
- 004
- Klassifikation (RVK)
- SS 5514
- Publizierende Institution
- Technische Universität Dresden, Dresden
- Förder- / Projektangaben
- URN Qucosa
- urn:nbn:de:bsz:14-qucosa-144026
- Veröffentlichungsdatum Qucosa
- 08.07.2014
- Dokumenttyp
- Forschungsbericht
- Sprache des Dokumentes
- Englisch
- Lizenz / Rechtehinweis