Lesson 9.7: Avoiding interface redundancies

In the previous lesson, we learned how to use interface hierarchies in our schema to achieve different behaviours. In this lesson, we’ll see how having multiple interfaces that fulfill the same behaviour damages the querying capabilities of the model.

Data table

ISBN-13	ISBN-10	Title	Format	Authors	Editors	Illustrators	Other contributors	Publisher	Year	City	State	Country	Page count	Genres	Price	Stock
9780008627843	0008627843	The Hobbit	ebook	J.R.R. Tolkien		J.R.R. Tolkien		Harper Collins	2023	New York City	New York	United States	310	fantasy;fiction	16.99
9780060929794	0060929790	One Hundred Years of Solitude	paperback	Garcia Marquez, Gabriel				Perennial	1998	New York City	New York	United States	458	fiction;historical fiction	6.12	4
9780195153446	0195153448	Classical Mythology	paperback	Lenardon, Robert J.;Morford, Mark P. O.				Oxford University Press	2002	New York City	New York	United States	820	history;nonfiction	34.98	12
9780375801679	0375801677	The Iron Giant	ebook	Hughes, Ted		Davidson, Andrew		Knopf Books for Young Readers	1999	New York City	New York	United States	79	fiction;children’s fiction	33.97
9780387881355	0387881352	Electron Backscatter Diffraction in Materials Science	hardback		Schwartz, Adam J.;Kumar, Mukul;Adams, Brent L.;Field, David P.			Springer	2009	New York City	New York	United States	425	nonfiction;technology	230.37	9
9780393045215	0393045218	The Mummies of Urumchi	paperback	Barber, Elizabeth Wayland				W.W. Norton & Company	1999	New York City	New York	United States	240	history;nonfiction	21.6	1
9780393634563	0393634566	The Odyssey	ebook	Homer			Wilson, Emily	W.W. Norton & Company	2017	New York City	New York	United States	656	fiction;classics	13.99
9780446310789	0446310786	To Kill a Mockingbird	paperback	Harper Lee				Grand Central Publishing	1988	New York City	New York	United States	281	fiction;historical fiction	21.64	16
9780451162076	0451162072	Pet Sematary	paperback	King, Stephen				Signet	1984	New York City	New York	United States	374	horror;fiction	93.22	1
9780500026557	0500026556	Hokusai’s Fuji	paperback	Wada, Kyoko		Katsushika, Hokusai		Thames & Hudson	2024	London		United Kingdom	416	art;nonfiction	24.47	11
9780500291221	0500291225	Great Discoveries in Medicine	paperback		Bynum, William;Bynum, Helen			Thames & Hudson	2023	London		United Kingdom	352	history;nonfiction	12.05	18
9780553212150	055321215X	Pride and Prejudice	paperback	Austen, Jane				Bantam Classics	1983	New York City	New York	United States	295	fiction;historical fiction	17.99	15
9780575104419	0575104414	Dune	ebook	Herbert, Frank				Hachette Book Group	2010	New York City	New York	United States	624	fiction;science fiction	5.49
9780671461492	0671461494	The Hitchhiker’s Guide to the Galaxy	paperback	Adams, Douglas				Pocket	1982	New York City	New York	United States	215	fiction;science fiction	91.47	9
9780679425601	0679425608	Under the Black Flag: The Romance and the Reality of Life Among the Pirates	hardback	Cordingly, David				Random House	1996	New York City	New York	United States	296	history;nonfiction	34.73	13
9780740748479	0740748475	The Complete Calvin and Hobbes	hardback	Watterson, Bill		Watterson, Bill		Andrews McMeel Publishing	2005	Kansas City	Missouri	United States	1451	comics;fiction	128.71	6
9781098108274	1098108272	Fundamentals of Data Engineering	ebook	Reis, Joe;Housley, Matt				O’Reilly Media	2022	Sevastopol	California	United States	450	nonfiction;technology;children’s fiction	47.99
9781489962287	148996228X	Interpretation of Electron Diffraction Patterns	paperback	Keown, Samuel Robert;Andrews, Kenneth William;Dyson, David John				Springer	1967	New York City	New York	United States	199	nonfiction;technology	47.17	15
9781859840665	1859840663	The Motorcycle Diaries: A Journey Around South America	paperback	Guevara, Ernesto			Wright, Ann	Verso	1996	London		United Kingdom	160	biography;nonfiction	14.52	4
9783319398778		Physical Principles of Electron Microscopy: An Introduction to TEM, SEM, and AEM	ebook	Egerton, R.F.				Springer	2016	London		United Kingdom	196	nonfiction;technology	19.5
9798691153570		Business Secrets of The Pharoahs	paperback	Crorigan, Mark				British London Publishing	2020	London		United Kingdom	260	business;nonfiction	11.99	8

Lesson 9.6 schema

define
book sub entity,
    abstract,
    owns isbn-13,
    owns isbn-10,
    owns title,
    owns page-count,
    owns genre,
    owns price,
    plays contribution:work,
    plays publishing:published;
paperback sub book,
    owns stock;
hardback sub book,
    owns stock;
ebook sub book;
contributor sub entity,
    owns name,
    plays contribution:contributor;
publisher sub entity,
    owns name,
    plays publishing:publisher;
place sub entity,
    abstract,
    owns name,
    plays locating:location,
    plays locating:located;
city sub place,
    plays publishing:location;
state sub place;
country sub place;
contribution sub relation,
    relates contributor,
    relates work;
authoring sub contribution;
editing sub contribution;
illustrating sub contribution;
publishing sub relation,
    relates publisher,
    relates published,
    relates location,
    owns year;
locating sub relation,
    relates located,
    relates location;
isbn sub attribute, abstract, value string;
isbn-13 sub isbn;
isbn-10 sub isbn;
title sub attribute, value string;
page-count sub attribute, value long;
genre sub attribute, value string;
price sub attribute, value double;
stock sub attribute, value long;
name sub attribute, value string;
year sub attribute, value long;

Identifying redundancies

Let’s begin by adding the following rule to our schema to make locating relations transitive, which we first encountered in Lesson 5.4.

define
rule transitive-location:
    when {
        (location: $parent-place, located: $child-place) isa locating;
        (location: $child-place, located: $x) isa locating;
    } then {
        (location: $parent-place, located: $x) isa locating;
    };

With this rule defined, we can use the following patterns to describe things that are located in a particular country:

To describe states located in the country:

$country isa country, has name ?country-name;
$state isa state;
(location: $country, located: $state) isa locating;

To describe cities located in the country:

$country isa country, has name ?country-name;
$city isa city;
(location: $country, located: $city) isa locating;

To describe publications that were located in the country:

$country isa country, has name ?country-name;
(location: $country, located: $city) isa locating;
$publishing (location: $city) isa publishing;

It’s very easy to write a polymorphic pattern that describes states or cities located in a particular country.

$country isa country, has name ?country-name;
(location: $country, located: $x) isa locating;

Here the type of $x is determined to be locating:located, so instances of state and city are returned as they can be cast into this role. Now, what happens if we want to polymorphically query for everything located in the country? Well, we must use the following pattern instead.

$country isa country, has name ?country-name;
{
    (location: $country, located: $x) isa locating;
} or {
    (location: $country, located: $city) isa locating;
    $x (location: $city) isa publishing;
};

Here we need a disjunction. This is due to the fact that publishing cannot be cast into locating:located, so we need to make a special case for it using the second branch of the disjunction. When querying them individually, the patterns for states and cities were structurally identical, while the pattern for publications was different. This means that we cannot query all three together using a structurally common pattern.

Essentially, we are using two different interfaces to represent common behaviour: the locating:location interface and the publishing:location interface. This is a bad practice in PERA model design, as it would be in application model design. While it is not too much of a problem at this stage, it has the potential to grow much more prevalent if we do not build the model’s interfaces in an extensible manner.

Eliminating redundancies

In order to eliminate the interface redundancy, we must choose which to keep. Between locating:location and publishing:location, the former is the more general-purpose interface. We will keep that interface, and change the way we record the locations where books are published to use it as well. Let’s begin by examining the section of the current schema used for publishing information.

define
book sub entity,
    plays publishing:published;
publisher sub entity,
    plays publishing:publisher;
city sub place,
    plays publishing:location;
publishing sub relation,
    relates publisher,
    relates published,
    relates location,
    owns year;

We need to choose an object type to play the locating:located role. The entity type publisher would be a very poor choice.

define
book sub entity,
    plays publishing:published;
publisher sub entity,
    plays publishing:publisher,
    plays locating:located;
publishing sub relation,
    relates publisher,
    relates published,
    owns year;

Publishers can publish different books in different cities, and if we record the publishing location on the publisher, we won’t be able to tell which book was published in which city. Consider if we chose this approach and then inserted, for example, the following data from the dataset.

match
$book-1 isa book, has isbn-13 "9780387881355";
$springer isa publisher, has name "Springer";
$nyc isa city, has name "New York City";
insert
(published: $book-1, publisher: $springer) isa publishing;
(located: $springer, location: $nyc) isa locating;

match
$book-2 isa book, has isbn-13 "9783319398778";
$springer isa publisher, has name "Springer";
$london isa city, has name "London";
insert
(published: $book-2, publisher: $springer) isa publishing;
(located: $springer, location: $london) isa locating;

If we were to then query the location that one of the books was published, we’d get two results back.

match
$book-1 isa book, has isbn-13 "9780387881355";
$publisher isa publisher;
$city isa city, has name $city-name;
(published: $book-1, publisher: $publisher) isa publishing;
(located: $publisher, location: $city) isa locating;
fetch
$city-name;

{ "city-name": { "value": "New York City", "type": { "label": "name", "root": "attribute", "value_type": "string" } } }
{ "city-name": { "value": "London", "type": { "label": "name", "root": "attribute", "value_type": "string" } } }

This occurs because both publisher and city have functional dependencies on book, but there is no functional dependency between city and publisher. As a result, we can’t use publisher as the roleplayer because the data model will store the data in a lossy manner, as we have just seen.

Moving to the next option for the new roleplayer of locating:located, the entity type book is a reasonable choice.

define
book sub entity,
    plays publishing:published,
    plays locating:located;
publisher sub entity,
    plays publishing:publisher;
publishing sub relation,
    relates publisher,
    relates published,
    owns year;

We won’t run into the issue we do with publisher as there is a functional dependency of city on book. But this feels semantically odd. If we chose this approach, then the above query for a book’s publication city would become the following.

match
$book-1 isa book, has isbn-13 "9780387881355";
$city isa city, has name $city-name;
(located: $book-1, location: $city) isa locating;
fetch
$city-name;

While this model would not incur data loss, it is strange to talk about the location of a book as if it is somehow fixed, and the intent of the query is not immediately obvious as there is no mention of the book’s publication. In fact, there is a better choice.

In this case, the best option is to make the relation type publishing the roleplayer of location:located.

define
book sub entity,
    plays publishing:published;
publisher sub entity,
    plays publishing:publisher;
publishing sub relation,
    relates publisher,
    relates published,
    owns year,
    plays locating:located;

Because publishing is an object type, it can implement any interface like an entity type can. And because it depends functionally on book (by definition), doing so does not incur information loss. Here we have created a nested relation type: a relation type in which another relation type plays a role. Now we can query for a book’s publication city in the following manner.

match
$book-1 isa book, has isbn-13 "9780387881355";
$publishing (published: $book-1) isa publishing;
$city isa city, has name $city-name;
(located: $publishing, location: $city) isa locating;
fetch
$city-name;

With this model, we do not incur information loss, and the intent of the query is clear. If we return to our polymorphic pattern from earlier, it will now match instances of city, state, and publishing for $x, as they can all be upcast to location:located!

$country isa country, has name ?country-name;
(location: $country, located: $x) isa locating;

When we extend the model to include more concepts with locations, we should always make them play locating:located. In this way, this polymorphic query will always be able to find everything in a given location. If we would like to restrict the list of things returned, we can always place further constraints on the type of $x.

Nested relation types are an advanced feature of the PERA model. They can be difficult to deal with, and so should be used with caution. Notably, the strategies we explored in Lesson 4.3 for deleting entities and relations cannot be applied universally if there are nested relations in the schema. However, nested relations are also an extremely powerful tool, as we will see in Lesson 12.1, where we will explore the full type-theoretic capabilities of the PERA model.

The final schema

define
book sub entity,
    abstract,
    owns isbn-13,
    owns isbn-10,
    owns title,
    owns page-count,
    owns genre,
    owns price,
    plays contribution:work,
    plays publishing:published;
paperback sub book,
    owns stock;
hardback sub book,
    owns stock;
ebook sub book;
contributor sub entity,
    owns name,
    plays contribution:contributor;
publisher sub entity,
    owns name,
    plays publishing:publisher;
place sub entity,
    abstract,
    owns name,
    plays locating:location,
    plays locating:located;
city sub place;
state sub place;
country sub place;
contribution sub relation,
    relates contributor,
    relates work;
authoring sub contribution;
editing sub contribution;
illustrating sub contribution;
publishing sub relation,
    relates publisher,
    relates published,
    owns year,
    plays locating:located;
locating sub relation,
    relates located,
    relates location;
isbn sub attribute, abstract, value string;
isbn-13 sub isbn;
isbn-10 sub isbn;
title sub attribute, value string;
page-count sub attribute, value long;
genre sub attribute, value string;
price sub attribute, value double;
stock sub attribute, value long;
name sub attribute, value string;
year sub attribute, value long;

rule transitive-location:
    when {
        (location: $parent-place, located: $child-place) isa locating;
        (location: $child-place, located: $x) isa locating;
    } then {
        (location: $parent-place, located: $x) isa locating;
    };

Services

Platform

Tools

What is TypeDB?

Try cloud free

Pricing and plans

Learn

Content

Lesson 9.7: Avoiding interface redundancies

Identifying redundancies

Eliminating redundancies