TypeDB is an open-source, distributed database management system that relies on a user-defined type system to model, manage, and query data.
Original author(s) | Haikal Pribadi |
---|---|
Developer(s) | TypeDB |
Initial release | 9 September 2016 |
Stable release | 2.28.3
/ 10 June 2024[1] |
Repository | github |
Written in | Java[2] |
Operating system | Cross-platform |
License | AGPL 3.0 |
Website | www |
Overview
editThe data model of TypeDB is based on primitives from conceptual data modeling, which are implemented in a type system (see § Data and query model). The type system can be extended with user-defined types, type dependencies, and subtyping, which together act as a database schema. The model has been mathematically defined under the name polymorphic entity-relation-attribute model.[3]
To specify schemas and to create, modify, and extract data from the TypeDB database, programmers use the query language TypeQL. The language is noteworthy for its intended resemblance to natural language, following a subject-verb-object statement structure for a fixed set of “key verbs” (see § Examples).
History
editTypeDB has roots in the knowledge representation system Grakn (a portmanteau of the words "graph" and "knowledge"), which was initially developed at the University of Cambridge Computer Science Department.[4] Grakn was commercialized in 2017, and development was taken over by Grakn Labs Ltd.[4] Later that year, Grakn was awarded the "Product of the Year" award by the University of Cambridge Computer Science Department.[5]
In 2021, the first version of TypeDB was built from Grakn with the intention of creating a general-purpose database.[6] The query language of Grakn, Graql, was incorporated into TypeDB's query language, TypeQL, at the same time.
TypeDB Cloud, the database-as-a-service edition of TypeDB, was first launched at the end of 2023.[7]
Grakn version history
editThe initial version of Grakn, version 0.1.1, was released on September 15, 2016.[8]
Grakn 1.0.0 was released on December 14, 2017.[9]
Grakn 2.0.0 was released on April 1, 2021.[10]
TypeDB version history
editTypeDB 2.1.0, the first public version of TypeDB, was released on May 20, 2021.[6]
Features
editTypeDB is offered in two editions: an open-source edition, called TypeDB Core, and a proprietary edition, called TypeDB Cloud, which provides additional cloud-based management features.
TypeDB features a NoSQL data and querying model, which aims to introduce ideas from type systems and functional programming to database management.[11]
Database architecture
editGeneral database features include the following.
- ACID-compliance[2]
- Static type-checking of queries[2]
- Graphical user interface (TypeDB Studio)[2]
- Storage engine based on RocksDB[12]
- Synchronous replication through RAFT for scalability[2]
- TLS support
- Unicode support
Data and query model
editTypeDB's data and query model differs from traditional relational database management systems in the following points.
- Instead of tables and columns, TypeDB employs types, subtypings between types, and type dependencies to describe the database schema. It is argued that this may facilitate schema extensions and normalization, and may help clarify data dependencies.[13]
- Instead of formulating queries with algebraic operators as in SQL, TypeQL queries are sequences of statements that represent composite types. It is argued that this yields a “more declarative” querying style (see § Examples).[14]
- TypeDB provides support for Datalog-like functions (based on the correspondence of logical implication to function types), which can be defined recursively. This can have advantages for graph data workloads, as most graph algorithms are formulated recursively.[15]
- TypeDB's data model, based on subtyping and type dependencies, is aimed at modeling a variety of data structures. This subsumes relational data, structured tree-like data, structured graph-like data, data with inheritance, and hypergraph-like data.[16][17]
Limitations
editBy relying on a non-standard data and query model, TypeDB (at present) has no support for the integration of established relational or column-oriented database standards, file formats (such as CSV, Parquet), or the query language SQL. Moreover, TypeDB has no direct facility for working with unstructured data or vector data.
Query language
editTypeQL, the query language of TypeDB, acts both as data definition and data manipulation language.
The query language builds on well-known ideas from conceptual modeling, referring to independent types holding objects as entity types, dependent types holding objects as relation types, and types holding values as attribute types.[18] The language is composed of query clauses comprising statements. Statements, especially for data manipulation, usually follow a subject-verb-object structure.
The formal specification of the query language was presented at ACM PODS 2024, where it received the "Best Newcomer" Award.[19]
Examples
editThe following (incomplete) query creates a type schema using a define
query clause.
define
person sub entity,
owns name,
plays booking:passenger;
booking sub relation,
relates passenger,
relates flight,
owns booking_date;
name sub attribute,
value string;
...
The following query retrieves objects and values from the database that match the pattern given in the match
clause.[20]
match
$j isa person, has name $n;
$n contains "Jane";
$b isa booking,
links (passenger: $j, flight: $f);
has booking_date >= 2024-01-01;
$f has flight_time < 120;
$f links (destination: $c);
$c has name "Santiago de Chile";
Licensing
editThe open-source edition of TypeDB is published under the Mozilla Public License.[12]
References
edit- ^ "Releases · vaticle/typedb". GitHub.
- ^ a b c d e "TypeDB System Properties". DB Engines.
- ^ Dorn & Pribadi 2024
- ^ a b "TypeDB". Database of Databases.
- ^ "Hall of Fame". Department of Computer Science and Technology. 23 January 2018.
- ^ a b "TypeDB 2.1.0". Github.
- ^ "New Foundations for Building with TypeDB". TypeDB Blog. 27 March 2024.
- ^ "Grakn 0.1.1". Github.
- ^ "Grakn 1.0.0". Github.
- ^ "Grakn 2.0.0". Github.
- ^ "Functional Database Programming Paradigm". TypeDB.
- ^ a b "TypeDB Github". GitHub. June 2024.
- ^ Dorn & Pribadi 2024, §1.7
- ^ Dorn & Pribadi 2024, §1.5
- ^ Dorn & Pribadi 2024, §3.2
- ^ Sijs & Fletcher, 2022
- ^ Dorn & Pribadi 2024, App. A
- ^ "TypeDB Lecture Course". TypeDB. June 2024.
- ^ "PODS Awards". ACM SIGMOD/PODS. June 2024.
- ^ "TypeQL PODS 2024 Talk". ACM Digital Library. June 2024. doi:10.1145/3651611.
Bibliography
edit- Dorn, Christoph; Pribadi, Haikal (2024), "TypeQL: a Type-Theoretic and Polymorphic Query Language", Proc. ACM Manag. Data, 2 (2), New York, NY, USA: Association for Computing Machinery: 1–27, doi:10.1145/3651611
- Sijs, Joris; Fletcher, James (2022), "On a hypergraph structuring semantic information for robots navigating and conducting their task in real-world, indoor environments", 2022 26th International Conference on Methods and Models in Automation and Robotics (MMAR), IEEE, pp. 430–435, doi:10.1109/MMAR55195.2022.9874265, ISBN 978-1-6654-6858-9