In this comprehensive paper, Turing Award recipient Michael Stonebraker and his student Andrew Pavlo examine the evolution of database technology over the past two decades. They cover the continued dominance of relational models and SQL, the emergence of NoSQL systems, and the rise of new technologies such as column-store systems, cloud databases, and data lakes.
The authors argue that despite the emergence of alternative solutions, relational models and SQL remain the primary choice for database management systems (DBMS). This is attributed to SQL's ability to incorporate best practices from other models and the ongoing influence of hardware advancements.
The paper also delves into the evolution of NoSQL systems, noting their gradual integration into SQL/RM systems. For example, document databases and vector databases are evolving to support SQL and ACID transactions.
Looking ahead, the authors predict continued growth in cloud databases, data lakes, and NewSQL systems. They also anticipate a deeper integration of vector databases with AI tools, enabling applications like semantic search and recommendation systems. Relational databases, they suggest, will continue to adapt to new application scenarios through ongoing expansion.