## A gentle introduction to the SCTA data model ### The challenging of organizing our data --- Jeffrey C. Witt (Loyola University Maryland) ---

The Projectdata file

The basic task we are undertaking here is the traditional task of building a table of contents.

But this table of contents is also going to determine the ids and relationships between resources in our data model.

It is also going to provide us with guidance as to how we structure our files and directories.

While the immediate purpose of doing things this way may not seem immediately obvious, just keep the following rule of in mind.

"Strong organization enables automation."

Sample projectdata files:

Tip: you can load any of these files into Oxygen and play around with them by simply coping the url, selecting open url from the file menu, and pasting the url into the provided field.

Looking at the projectdata file

A word of caution

It is important to reiterate that the standardization of the projectdata file is still a work in progress.

As our community grows, its logic and standardization will improve

For the moment, this means we simply need to build the file in the same way as I have done, so that the automatic scripts will still work, even if this is not always the best way.

What's exciting about the prospect of working together, is that as we had more data, we discover more flaws or bugs in the system and logic, and then improve it together.

The Header

The header is not something you need to be too worried about. Its function is simply to collect metadata about this text that helps us link the text to its parent work group, to its author, and to the various witnesses to the text.

Div@id=body

This is simply the body tag that will contain your table of contents. (In the future, this mostly likely be changed to simply <body>.)

Items

Within this body tag, there are two main divisions you can make: Items and Divisions.

Items are the privileged divisions in your text. In the case of the Peter of Spain Tractatus, these seem to be Tractatus 1, Tractatus 2, etc, etc.

Items will correspond to the file divisions of your text. We expect there to be a separate directory and file for each section of the text that is identified as an Item. If your text is very small, it is possible that you will only have one Item. But in very large texts, it is extremely helpful to be able to break the text up into multiple files.

Divisions

Even though Items are privileged sections, it is also quite possible that these sections should be grouped into larger Divisions.

This is very common in Sentences Commentaries, where questions are grouped within distinction Divisions, and distinction Divisions are grouped with book Divisions.

hasWitnesses->witness->folio

The hasWitnesses section is designed to tell us whether or not a particular Item appears in a particular manuscript. The folio references go further to tell us on what folios this section of the text falls.

It is this data that allows us to build a table of contents for each manuscript as well as for the text as a whole.

The SCTA Database: the immediate result of processing a projectdata file

We can visualize what this data looks like at http://scta.info.

The LombardPress Web Client: consuming information pulled from the projectdata file

We can see this data being put to work at http://scta.lombardpress.org

The Mirador Web Client: consuming information pulled from the projectdata file

We can see this data being put to work at http://mirador.scta.info

Questions and Hands On Work