The Project
We started by exploring the Whitechapel Murders commited by an unidentified serial killer with the pseudonym "Jack the Ripper". Our starting points were the Wikipedia articles connected to these pages and cultural institutions that provide metadata about the items related to the topic.
After putting the events, related people, places and dates into context, we have connected them to works inspired by the Victorian-era serial
killer and the murders associated with him.
Theoretical and E/R Models
An entity-relationship model (or E/R model) describes interrelated things of interest in a specific domain of knowledge(source).
A basic E/R model is composed of entity types which classify the things of interest; and specifies relationships that can exist between these entities (instances of those entity types).(source)
We decided that the concept of "Jack the Ripper" has a central position, so we connected the related items and assets to it.
As it can be seen, the first model below is an E/R model that represents the ideas connected to the topic in more specific, more detailed terms.
Metadata alignment
The following table shows the chosen items and their providing institutions along with the standards used by these institutions. Where the use of a given metadata standard was not specified, we have used the most sensible option. Firstly,Getty Images does not specify any metadata standards, so we have used IPTC (Photo Metadata Standard) which is commonly used by photographers and archivists to describe various types of metadata about images. Secondly, SBN ubo-Polo Bolognese does not specify their metadata standard either, but based on our studies, we assumed that they use REICAT (Regole italiane di catalogazione). This assumption was confirmed by a similar LOD project whose creators contacted SBN ubo-Polo Bolognese. Similarly to them, we have decided to use ISBD instead of REICAT as it is based on ISBD.
Lastly, Google Books describes data with various standards, here we have chosen ISBD because it is an important standard and this way we got a chance to deepen our knowledge about it.
The next step was the alignment of various metadata standards. The different properties were divided into four categories: person, time, place and concept/subject. Looking at the tables, we can see that we have looked up plenty of properties from various standards. The reason for this was to become more familiar with the different standards and the properties they offer. Even if we did not use all of them in our project, we have gained more knowledge about them and thus we can re-use this table in future projects. The major part of the metadata is decriptive and structural metadata but we have also included administrative metadata even if in a lesser extent. To follow the principles of 5-star linked data by Tim Berners-Lee, CSV files are provided for all tables to make the data available in non-proprietary open formats as well. Click here to download the CSV files.
| DC | FOAF | EDM | CIDOC-CRM | MARC21 | ISBD | IPTC | EAD | SKOS | OWL |
---|
Agent | dcterms: agent | foaf:agent | edm:agent | e39_Actor | | | | <agent> | | |
Author | dcterms: creator/agent | | | | | | 6.5. Creator | <author> | | |
Contributor | dc: contributor | | | p11_ participated in | | | | | | |
Creator | dc: creator | foaf:made | | p94_has created | | | | <origination> | | |
Owner | dcterms: provenance | | edm:provider | p52_is current owner of/p75_possessess | | | 6.20. Source/10.14 Image Supplier | <origination> | | |
Publisher | dc: publisher | | | | $b -- Name of publisher, distributor, etc. | has name of publisher, producer, distributor | | <publisher> | | |
Relation | dc: relation | foaf: knows | edm:hasMet/wasPresentAt | p136_was based on/p15_was influenced by/p69_has association with | | has note on relationship to other resources | | | skos:related | owl:sameAs |
Type | dctems: type | foaf: type | edm:hasType | e55_type | | | | | | |
Name | | foaf: familyName/firstName/foaf:name | | | 1 - Surname/3 -- Family name | | 11.10.4. Name | <name> | | |
Person | | foaf:person | | e21_person | | | | | | |
Topic of | | foaf:IsPrimaryTopicOf | | | | | | | skos:topConceptOf | |
Death | | | | p100_died in/death of/P100 was death of (died in) | | | | | | |
Birth | | foaf:birthday | | P98 brought into life (was born) | | | | | | |
Killed | | | | P93 took out of existence (was taken out of existence by) | | | | | | |
PLACE | GEONAMES | EDM | CIDOC CRM | IPTC | EAD | MARC21 | DCMI | ISBD |
---|
Place | gn:name | edm:Place | | 11.9. Location structure | <geogname> | 752 Added entry--hierarchical place name | | has place of publication, production, distribution statement |
Country | | edm:country | | 11.9.3. Country Name | | | | |
City | | | | 11.9.1. City | | | | |
(Current) Location | | edm:currentLocation | P55 has current location (currently holds) | 10.19. Location created | | | dcterms:location | |
Location of originals | | | | | <originalsloc> | 535 Location of Originals/Duplicates | | |
Location of event | | | P7 took place at (witnessed) | | | | | 4.1 Place of publication, distribution, etc. |
| | | | | | | | |
TIME | | | | | | | | |
Timespan | | edm:TimeSpan | P4 has time-span (is time-span of): E52 Time-Span | 11.1.11. Date Created | | | dcterms:extent | |
Year | | edm:year | | | | | | |
Events that happen simultaneously | | | P117 occurs during (includes)P119 meets in time with (is met in time by) | | | | | |
Beginning | | | P116 starts (is started by) | | <fromdate> | | | |
Ending | | | P115 finishes (is finished by) | | <todate> | | | |
Period | | | E4 Period | 11.1.16. Style Period | <daterange> | | dcterms:PeriodOfTime | |
Date | | | | | <date> | 260$c Date | dcterms:date | 4.4 Date of publication, distribution, etc. |
| DC | FOAF | EDM | CIDOC-CRM | MARC21 | ISBD | IPTC | EAD | ISBD | GEONAMES | SKOS | OWL |
---|
Subject | dcterms:subject | foaf:person | | P129 is about (is subject of) | | | | <subject> | | | skos:Concept | |
Author | dcterms: creator/agent | foaf:person | | | | | 6.5. Creator | <author> | | | | |
Contributor | dc: contributor | foaf:person | dc:contributor | p11_ participated in | | | 10.11. Image Creator/11.1.3. Contribution Description | | | | | |
Creator | dc: creator | foaf:made | dc:creator | P94 has created (was created by) | | | | <origination> | 4.6 Name of manufacturer | | | |
Owner | dcterms: provenance | foaf:person | edm:provider | P51 has former or current owner (is former or current owner of)/p75_possessess | | | 6.20. Source/10.14 Image Supplier | | | | | |
Publisher | dcterms: publisher | | dc:publisher | | | | | <publisher> | 4.2 Name of publisher, distributor, etc. | | | |
Relation | dcterms: relation/dcterms:isReferencedBy | foaf: primaryTopicOf/foaf:topic | edm:isRelatedTo/edm:isSimilarTo/edm:wasPresentAt/dcterms:isPartOf/dcterms:isReferencedBy | p136_was based on/p15_was influenced by/p69_has association with/P121_was_present_at/P67 refers to (is referred to by)/P69 is associated with | $b -- Name of publisher, distributor, etc. (R) | | | <relation>/<ref>/<relatedmaterial> | | | | |
Type | dcterms: type/dcterms:MediaType | foaf: type/foaf:Document/foaf:Image | edm:type/edm:object | P2 has type (is type of): E55 Type/P65 shows visual item (is shown by) | | | | | 5.1 Specific material designation and extent ofresource | | | |
Material | | | | | | | | | | | | |
Medium | dcterms: medium | | dcterms:medium | | 340 Physical medium | | | <phystech> | | | | |
Represents | | foaf:depicts/depiction | edm:incorporates/edm:isRepresentationOf | P62 depicts (is depicted by)/P70 documents (is documented in)/P138 represents (has representation) | skos:ifTopConceptOf | | 10.2. Artwork or Object in the Image/10.28. Person Shown in the Image | | | | | |
location | dcterms:location | | | P55 has current location (currently holds) | | | | | | | | |
description | dcterms:description | | dc:description | | | | 11.1.2. Content Description/11.10.2. Description | <did> | | | | |
title | dcterms:title | | | P102 has title (is title of) | 130 Main entry--uniform title OR 240 Uniform title | | 11.1.17. Title | <title> | 1.1 Title proper | | | |
alternative title | dcterms:alternative | | | | | | | | | | | |
language | dcterms: language | | edm:language | P72 has language (is language of) | 546 Language | | | <langmaterial> | | | | |
happened at | | | edm:happenedAt/edm:occurredAt/edm:isShownAt | P7 took place at (witnessed) | 752 Added entry--hierarchical place name | | 10.19. Location created | <geogname>/<originalsloc> | | | | |
website | | foaf:homepage | edm:WebResource | P4_has_time_span | | | | <href> | | gn:wikipediaArticle | | |
format | dc:format/dcterms:hasFormat | | dc:format | | | | | | | | | |
event | dcmiterms: event | | edm:Event | E5 Event | | | 10.9. Event | <event> | | | | |
identifier | dcterms:identifier | | | | | | | <identifier> | | | | |
Conceptual model and data description
The creation of a visual conceptual model was useful for data description as the different color codes stand for models that provided the predicates. We used these predicates in the following steps of the project. The mind map-like representation is presented below:
We have chosen three items to describe based on the following criteria:
- the items must be of different type,
- the items must come from different institutions,
- and the items must be able to connect different items, people and places.
These criterium were necessary in order to create a rich and varied knowledge graph in the end that describes relationships between items, people and places. Thus, we have chosen three different items that we have found to be capable of describing a large variety of data:
- Dear Boss, one of the letters written by Jack the Ripper;
- From Hell, a movie about the serial killer;
- and an artwork called Tomb of Jack the Ripper.
Links and interconnections
Expressing interconnections and links between data was conducted by applying properties from various vocabularies. As the chart shows, rdfs:seeAlso and foaf:PrimaryTopicOf were the most common ones as they are suitable for connecting concepts and people. For example, in the case of the letters attributed to Jack the Ripper, we have included only one of them (Dear Boss) in the collection of items. However, the data description enabled us to connect this letter with the other two letter written by Jack the Ripper (Dear Boss, From Hell, Saucy Jack).
Apart from these two predicates, gn:WikipediaArticle and dc:relation have been used to most. The latter can express a wide variety of connections, therefore it is suitable for most types of data connections. On the other hand, the former is quite specific, connecting a concept, person, date or place with its Wikipedia page. This also provides a sort of meta-level connection since Wikipedia articles are rich sources of related materials.
Cidoc-crm’s occurs during predicate provides an interesting perspective as we can use it to connect events and time-periods that took place simultaneously.
Similarly, Cidoc-crm’s has association with predicate was useful to connect people, concepts and events that are related, even if not in very obvious ways. A good example would be the movie entitled "From Hell" which is loosely based on a graphic novel with the same title which was inspired by the letter with the same title written by Jack the Ripper.
Lastly, skos:related and edm:isRelatedTo were used less often but as we can easily align them with DCMI’s relation, these three predicates that express fairly general concepts and can be used interchangeably.
RDF TURTLE
Semantic information helps computers understand what we put on the web. RDF stands for "Resource Description Framework."
What is RDF exactly?
RDF is a standard model for data interchange on the Web. RDF has features that facilitate data merging even if the underlying schemas differ. It specifically supports the evolution of schemas over time without requiring all the data consumers to be changed.
RDF extends the linking structure of the Web to use URIs to name the relationship between things. It can describe any concept or thing that exists in the universe. The idea behind RDF is simple and it thus very easy. There are 3 things in RDF:
SUBJECTS refer to the thing that we are describing
PREDICATES refer to an attribute of the thing that we are describing
OBJECTS is the thing that we are referring to with the predicate(W3schools).
A Turtle document is a textual representation of an RDF graph (W3schools). A graph is made up of triples consisting of a subject, predicate, and an object. So the simplest triple statement is a sequence of (subject, predicate, object) terms, separated by whitespace and terminated by '.' after each triple (W3schools).
Click here to view the RAW file.
RDF
@base <http://jacktheripper-1888.github.io/>.
@prefix crm: <http://www.cidoc-crm.org/cidoc-crm/>.
@prefix dbc: <hthttp://dbpedia.org/resource/Category/>.
@prefix dbr: <http://dbpedia.org/resource/>.
@prefix dc: <http://purl.org/dc/terms/>.
@prefix ead: <http://www.loc.gov/ead/>.
@prefix edm: <https://www.europeana.eu/en/>.
@prefix foaf: <http://xmlns.com/foaf/0.1/>.
@prefix owl: <http://www.w3.org/2002/07/owl#>.
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>.
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#>>.
@prefix skos: <http://www.w3.org/2008/05/skos#>.
@prefix xsd: <http://www.w3.org/2001/XMLSchema#>.
@prefix wn: <http://www.w3.org/2006/03/wn/wn20/schema/>.
@prefix gn: <http://www.geonames.org/>.
<production/Statue>
rdf:type crm:E12_Production;
crm:entity "artwork";
gn:name "London,England";
dc:title "Tomb of Jack the Ripper";
gn:name "London";
edm:provider "Tate";
dc:creator "Marcel Marien";
dc:isPartOf "http://kulturarvsdata.se/GSM/samling/7";
ead:date "1976";
ead:year "2002".
<object/Statue>
dc:medium "5 tin cans, shell, fur, chipboard, wood and Perspex." ;
crm:entity "artwork";
dc:title "Tomb of Jack the Ripper";
ead:genreform "Surrealism";
rdfs:seeAlso "https://en.wikipedia.org/wiki/Surrealism";
dc:publisher "Tate,London";
edm:isRelatedTo "Jack the Ripper".
<person/marcel_marien>
rdf:type foaf:Person;
owl:sameAs dbr:Marcel_Marien;
dc:creator "Marcel_Marien";
gn:wikipediaArticle "https://en.wikipedia.org/wiki/Marcel_Mari%C3%Abn".
<concept/jack_the_ripper>
owl:Class crm:E73_Information_Object;
crm:P15_was_influenced_by "Whitechapel Murders";
rdfs:subClassOf skos:Concept;
skos:related "popular culture";
dc:date "1888";
rdf:label "Jack the Ripper";
rdfs:comment "Artwork created by Belgian Surrealist artist, Marcel Marien in 1976, inspired by the Jack the Ripper murders.";
dc:creator "Marcel Marien";
foaf:isPrimaryTopicOf "https://www.tate.org.uk/art/artworks/marien-the-tomb-of-jack-the-ripper-t12048";
gn:wikipediaArticle "https://en.wikipedia.org/wiki/Marcel_Mari%C3%Abn".