Brytlyt makes ground-breaking progress on the TPC-H benchmark!

15th April 2019 by Brytlyt

The world’s fastest and most advanced GPU database company tackles the TPC-H benchmark record and sets the bar for GPU database vendors.

The most recent version of our GPU Database, Brytlyt 3.1 Alpha, has succeeded in running the industry-recognised TPC-H benchmark, and in doing so looks capable of beating the previous record set in 2014. The Brytlyt development team are achieving results that are at least 4 times faster than the record, for many of the queries that make up the benchmark. These results are likely to be very interesting to companies looking at GPU solutions for ultra-fast data analytics with complex queries.

Our team worked closely with NVIDIA, using just one of their DGX-2 machines to run the queries. The results were announced last month at NVIDIA’s annual GTC technology conference in Silicon Valley.

Using the DGX-2 for our TPC-H queries was an obvious choice, because this hardware is a step-change in the amount of GPU compute contained in a single server. The DGX-2 is made up of 16 NVIDIA v100 GPUs, each with 32GB of VRAM. This provides a total of 512 GB of VRAM and 2 petaflops of compute. The GPUs are hooked up to NVIDIA’s NVSwitch which provides 2.4 TB/s of data transfer between GPUs. This is important because many of the TPC-H queries contain JOINs that need a data shuffle to happen before the query can be processed When the benchmark record was last set in 2014 it was considered much faster than anything else around at the time.  The hardware used back in 2014 was also impressive and required a cluster of 20 servers to be coupled together. Five years down the line and Brytlyt can now run the same benchmark using just one machine, the DGX-2.

Demonstrating this ability to set new query times for the TPC-H benchmark is a measure of our maturity in the GPU database space. The TPC-H benchmark examines large volumes of data by executing queries with a high degree of complexity, giving answers on real-world business decisions. Twenty-two queries are run both as single user and concurrently, based on a typical retail use-case making it meaningful to real-life business demands.  All but two of the queries contain joins, and all of them include aggregations, complex expressions, with nested queries and correlated queries.

Some of the largest retailers in the UK and the US regularly run analytics on two years of Electronic Point of Sale (EPOS) data. While the total dataset is massive, 99% of queries are run using a 10% sample. This 10% sample can be around 400GB and contain 4 billion rows. What is so relevant about NVIDIA’s DGX-2 hardware is this typical retail use case can fit onto just one machine. A query that might have taken up to an hour can now be executed immediately. For the first time, users can in just couple of seconds gain a full suite of insight such as category management, repeat purchases, behavioural analytics, information on where sales are coming from and going to and weekly key measures over time.

Richard Heyns, Brytlyt’s CEO and founder, had the following comments: “We are very excited with how well our GPU database performed on NVIDIA’s hardware. For quite a while now, I’ve had my eye on the TPC-H benchmark. Our success here proves the immense value of our product to many industries including telecommunications, retail and finance. Having software that we have built, breaking records using NVIDIA’s machines, is very satisfying. We are incredibly lucky to have some of the world’s finest engineers on the Brytlyt team, who continue to prove that they are true innovators in the GPU database space.”

There are no results of the full TPC-H benchmark by other GPU vendors in public domain.

Brytlyt is based in the UK and has spent the last 5 years developing innovative analytics software. The suite of products includes the web visualisation analytics tool, SpotLyt. Data scientists and analysts can get the most out of the Brytlyt GPU database with real-time analysis and interactive exploration. SpotLyt allows users to visually discover correlations and anomalies in billion row datasets. Richard Heyns commented: ‘We designed SpotLyt after realising that popular visualisation packages on the market were not designed to cope with the processing speeds that Brytlyt delivers. This meant that delivering the whole package took longer, but it has definitely been worth the wait!’

More details on our work on the TPC-H benchmark can be found here.

About Brytlyt

Founded in 2013, Brytlyt’s GPU database acceleration technology, with its patent-pending IP, features:

  • Astonishing Performance:
    Brytlyt’s GPU-accelerated database technology is transforming the way businesses use data. With Brytlyt, companies can query multi-billion row datasets in milliseconds.
  • Easy integration with existing systems:
    There’s no need for businesses to give up their current investments in code, analytics, and visualisation. Instead, they can accelerate them with Brytlyt with little to no effort.
  • Smooth scalability:
    Businesses can add and remove GPU resources at will, scaling their processing capability to suit their needs, ensuring they can massively reduce data processing costs.
  • Functionality-rich and easy to use:
    Brytlyt is built on PostgreSQL with a full SQL editor, and its deep functionality is complimented by outstanding ease of use.

Brytlyt’s mission is to empower organisations through Speed of Thought Analytics.

  • The world’s fastest database according to independent benchmarking.
  • Four years in research and development.
  • Only vendor to have patent pending IP for JOINs.
  • Fourth generation GPU Manager bridges the gap between SQL and AI.

The true value of Brytlyt lies in how this extreme performance is packaged for the end user.