VLDB 2020: TiDB, A Raft-based HTAP Database

Author: PingCAP

Editors: Queeny Jin, Tom Dewan

At PingCAP, our goal is to build a database that is a single, real-time source of truth so that the applications and the businesses on top of it can adapt to uncertainty. As we built TiDB, we benefited from databases and distributed systems research such as Raft and Spanner. Along the journey, we’ve developed some novel ideas, and we are thrilled to contribute them back to the academic community.

A group of our engineers have summarized what we’ve learned in a new research paper, “TiDB: A Raft-based HTAP Database.” In this paper, we propose TiDB as a Hybrid Transactional and Analytical Processing (HTAP) database, a “one size fits many” solution for OLTP and OLAP applications.

HTAP databases require processing transactional and analytical queries in isolation to remove the interference between them. To achieve this, it is necessary to maintain different replicas of data specified for the two types of queries. To this end, we propose extending replicated state machine-based consensus algorithms to provide consistent replicas for HTAP workloads. Based on this novel idea, we present a Raft-based HTAP database: TiDB.

A multi-Raft storage system consisting of a row store and a column store

The figure above shows the key architecture of TiDB. TiDB is built on a multi-Raft storage system which consists of a row store (TiKV) and a column store (TiFlash). The row store is built based on the Raft algorithm to materialize updates from transactional requests. In particular, it asynchronously replicates Raft logs to learners which transform row format to column format for tuples, forming a column store. This column store allows analytical queries to read fresh and consistent data in real time. On the top of this storage system, we build an SQL engine to optimally access TiKV and TiFlash for HTAP queries. Comprehensive experiments show that TiDB achieves isolated high performance under CH-benCHmark, a benchmark focusing on HTAP workloads.

TiDB presents an innovative solution that helps consensus-algorithm-based NewSQL systems evolve into HTAP systems. NewSQL systems ensure high availability, scalability, and data durability for OLTP requests by replicating their database like Google Spanner and CockroachDB. They synchronize data across data replicas via replication mechanisms typically from consensus algorithms. Based on the log replication, NewSQL systems can provide a columnar replica dedicated to OLAP requests so that they can support HTAP requests in isolation like TiDB.

We believe that the future of the database is about unification, adaptiveness, and intelligence. When we face uncertainty in the future, we will have the unified infrastructure to adapt intelligently. We hope that “TiDB: A Raft-based HTAP Database” can inspire researchers and software engineers to make the future come earlier. The paper has been published in August, 2020, in PVLDB, Volume 13, Issue 12. Download the paper here.

We would like to offer a big “thank you” to authors and reviewers of the paper. We also want to thank the many contributors and users in our TiDB community. You have made TiDB what it is today, and you will continue to make it better tomorrow.

HTAP VLDB

Thought Leadership

May 21, 2025

10年前の登場から再び脚光を浴びる「HTAP」は何がスゴイのか？──野心的なTiDBの構成から紐解く

Engineering

March 29, 2024

Web3Bench: A New HTAP Benchmark for Web3 Workloads

Community

October 5, 2023

Introducing Data in the Hallway: A New Podcast Series for Open Source Data Enthusiasts

Thought Leadership

May 21, 2025

10年前の登場から再び脚光を浴びる「HTAP」は何がスゴイのか？──野心的なTiDBの構成から紐解く

Engineering

March 29, 2024

Web3Bench: A New HTAP Benchmark for Web3 Workloads

Community

October 5, 2023

Introducing Data in the Hallway: A New Podcast Series for Open Source Data Enthusiasts

View All

Have questions? Let us know how we can help.

TiDB Cloud Dedicated

TiDB Cloudのエンタープライズ版。
専用VPC上に構築された専有DBaaSでAWSとGoogle Cloudで利用可能。

サインアップ詳細を見る

TiDB Cloud Starter

TiDB Cloudのライト版。
TiDBの機能をフルマネージド環境で使用でき無料かつお客様の裁量で利用開始。

無料で始める詳細を見る

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

VLDB 2020: TiDB, A Raft-based HTAP Database

Related Resources

10年前の登場から再び脚光を浴びる「HTAP」は何がスゴイのか？──野心的なTiDBの構成から紐解く

Web3Bench: A New HTAP Benchmark for Web3 Workloads

Introducing Data in the Hallway: A New Podcast Series for Open Source Data Enthusiasts

10年前の登場から再び脚光を浴びる「HTAP」は何がスゴイのか？──野心的なTiDBの構成から紐解く

Web3Bench: A New HTAP Benchmark for Web3 Workloads

Introducing Data in the Hallway: A New Podcast Series for Open Source Data Enthusiasts

Have questions? Let us know how we can help.

TiDB Cloud Dedicated

TiDB Cloud Starter