Philipp M. Grulich

Query Processing on Heterogeneous Hardware

Scalable Data Management for Future Hardware | Anastasiia Kozar, Janis von Bleichert, Sebastian Breß, Philipp M Grulich, Clemens Lutz, Tilmann Rabl, Viktor Rosenfeld, Jonas Traub, Steffen Zeuch, Volker Markl

Paper

Abstract: In modern processor design, power efficiency has become the primary constraint, prompting manufacturers to develop processors that balance energy consumption with the growing demand for speed. This shift has initiated an era of heterogeneous multi-core computing, characterized by machines utilizing various processors such as GPUs, MICs, and FPGAs. These processors significantly enhance performance due to their computational capabilities and memory bandwidth, essential for optimizing query processing performance. However, executing database queries efficiently across diverse processors presents challenges due to architectural differences, leading to varied performance outcomes for different operator implementations. This chapter explores methodologies for executing database queries on any processor with maximum efficiency without manual adjustments. We propose compiling database queries into optimized code that can adapt continuously to achieve optimal performance across a wide array of processors. Key areas of focus include the use of GPUs in database systems, addressing challenges such as workload distribution and data transfer bottlenecks, and introducing a classification scheme for strategies developed to tackle these issues. Additionally, we examine NVLink 2.0 technology’s potential to improve data transfer efficiency between GPUs and CPUs, enhancing GPU-accelerated query processing. Furthermore, we present a novel adaptive query compilation-based stream processing engine (SPE) that surpasses traditional interpretation-based SPEs by incorporating runtime optimizations and task-based parallelization. This approach allows for dynamic adjustments to data characteristics, significantly improving query execution efficiency and throughput. Through these explorations, we aim to provide insights into current systems and highlight areas for future research, ultimately contributing to the advancement of heterogeneous query processing systems.

Scalable Data Management for Future Hardware

Query compilation without regrets

Sigmod'24 | Philipp M. Grulich, Aljoscha Lepping, Dwi Prasetyo Adi Nugroho, Bonaventura Del Monte, Varun Pandey, Steffen Zeuch, Volker Markl

Bridging the Gap: Complex Event Processing on Stream Processing Systems

EDBT'24 | Ariane Ziehn, Philipp M. Grulich, Steffen Zeuch, Volker Markl

Benchmarking Stream Join Algorithms on GPUs: A Framework and its Application to the State-of-the-art

EDBT'24 | Dwi PA Nugroho, Philipp M. Grulich, Steffen Zeuch, Clemens Lutz, Stefano Bortoli, Volker Markl

Towards Unifying Query Interpretation and Compilation

CIDR'23 | Philipp M. Grulich, Aljoscha Lepping, Dwi Prasetyo Adi Nugroho, Bonaventura Del Monte, Varun Pandey, Steffen Zeuch, Volker Markl

Survey of window types for aggregation in stream processing systems

VLDB Journal | Juliane Verwiebe, Philipp M. Grulich, Jonas Traub, Volker Markl

Babelfish: Efficient Execution of Polyglot Queries

VLDB'22 | Philipp M. Grulich, Steffen Zeuch, Volker Markl

Scotty: General and Efficient Open-source Window Aggregation for Stream Processing Systems.

TODS | Jonas Traub, Philipp M. Grulich, Alejandro Rodriguez Cuellar, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl

An Energy-Efficient Stream Join for the Internet of Things.

DaMoN 21 | Adrian Michalke, Philipp M. Grulich, Clemens Lutz, Steffen Zeuch, Volker Markl

Parallelizing Intra-Window Join on Multicores: An Experimental Study

SIGMOD 21 | Shuhao Zhang, Yancan Mao, Jiong He, Philipp M. Grulich, Steffen Zeuch, Bingsheng He, Richard T. B. Ma, Volker Markl

ExDRa: Exploratory Data Science on Federated Raw Data.

SIGMOD 21 | Sebastian Baunsgaard, Matthias Boehm, Ankit Chaudhary, Behrouz Derakhshan, Stefan Geißelsöder, Philipp M. Grulich, Michael Hildebrand, Kevin Innerebner, Volker Markl, Claus Neubauer, Sarah Osterburg, Olga Ovcharenko, Sergey Redyuk, Tobias Rieger, Alireza Rezaei Mahdiraji, Sebastian Benjamin Wrede, Steffen Zeuch

Grizzly: Efficient Stream Processing Through Adaptive Query Compilation

SIGMOD'20 | Philipp M. Grulich, Sebastian Breß, Steffen Zeuch, Jonas Traub, Janis von Bleichert, Zongxiong Chen, Tilmann Rabl, Volker Markl

Disco: Efficient Distributed Window Aggregation

EDBT'20 | Philipp M. Grulich, Sebastian Breß, Steffen Zeuch, Jonas Traub, Janis von Bleichert, Zongxiong Chen, Tilmann Rabl, Volker Markl

The NebulaStream Platform: Data and Application Management for the Internet of Things

CIDR'20 | Steffen Zeuch, Ankit Chaudhary, Bonaventura Del Monte, Haralampos Gavriilidis, Dimitrios Giouroukis, Philipp M. Grulich, Sebastian Bress, Jonas Traub, Volker Markl

Generating Reproducible Out-of-Order Data Streams

DEBS'19 | Philipp M. Grulich, Jonas Traub, Sebastian Breß, Asterios Katsifodimos, Volker Markl, Tilmann Rabl

Efficient Window Aggregation with General Stream Slicing

EDBT'19 | Jonas Traub, Philipp M. Grulich, Alejandro Rodriguez Cuellar, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl

Collaborative Edge and Cloud Neural Networks for Real-Time Video Processing

VLDB'18 | Philipp M. Grulich, Faisal Nawab

Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive Windowing

EDBT'18 | Philipp M. Grulich, René Saitenmacher, Jonas Traub, Sebastian Breß, Tilmann Rabl, Volker Markl

Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing.

ICDE'18 | Jonas Traub, Philipp Marian Grulich, Alejandro Rodriguez Cuellar, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl

I2: Interactive Real-Time Visualization for Streaming Data.

EDBT'17 | Jonas Traub, Nikolaas Steenbergen, Philipp M. Grulich, Tilmann Rabl, Volker Markl