Jeffrey C. Witt (Loyola University Maryland) | @jeffreycwitt
2017 Universidad de Los Andes, Bogotá, Columbia, August 28th-30th, 2017
Slide Deck: http://lombardpress.org/slides/2017-08-28-los-andes-workshop | http://bit.ly/2wbCaJ9
Slide Deck: http://bit.ly/2wbCaJ9
Google Drive Workshop Files: http://bit.ly/2wi4acf
Google Doc for sharing notes and links: http://bit.ly/2wM76km
a. To be able to weigh the pros and cons of each method and to accurately assess the additional stages required by each approach to achieve different end goals.
a. To be able to understand the relationship between XML and semantic encoding
b. To be able to understand the relationship between XML and TEI
c. To understand the relationship between TEI and TEI Customizations
a. To understand various options for presenting and representing an XML/TEI text
b. To have a sense of the various tools available to help with the presentation of an XML/TEI document.
a. To understand how to do basic encoding of common features of a critical edition, such as a critical apparatus, an apparatus fontium, and diplomatic transcriptions
1) With a pen or pencil, draw a box around examples of these data types and give each data type a “label”.
2) Figure out how many different kinds of data are on this page.
3) Identify or explain how these data types are communicated to you as a human reader.
4) Identify what kinds of relationships between data types are communicated and how is this communicated.
Question: Were there any data types or relationships presented in ambiguous or unclear ways? What made these features ambiguous?
XML stands for eXtensible Markup Language and it is a specific technology or tool designed to allow us to semantically describe a text, document, or any kind of structured information.
XML semantically identifies pieces of data using opening and closing tags which are enclosed with angle brackets.
wrong
correct
Tags must be either siblings of another set of tags or children of a set of tags.
In addition to containing text and other elements, elements can take on "attributes".
The result of following these rules is a document whose content is nicely organized into a tree structure.
Use Oxygen to begin encoding our sample texts
To start a new file, open oxygen, select file->new, select "xml document"
Then start encoding. Remember you can create your own tags to describe your data. Just follow these basic rules: https://www.w3schools.com/xml/xml_syntax.asp
Because we have encoded our text semantically, we can now use this logic to programmatically style and restyle our text.
Styles can be added by linking a stylesheet to our XML document like so:
<?xml-stylesheet type="text/css" href="mystyles.css"?>
body {
display: block;
margin: 10px;
border: 1px black;
padding: 2px;
color: blue;
}
In Oxygen, open a new css file (file->new->css).
Save the file as mystyles.css
.
Add a link to this file at the top of your xml document like so: <?xml-stylesheet type="text/css" href="mystyles.css"?>
Use this resource https://www.w3schools.com/css/default.asp to start adding styles to your css stylesheet.
Open your xml document in your browser to see how your styles affect your document.
This means that XML actually does not specify any set of tags or element names. Anyone can make up their own set of elements and use them however they like.
It allows different industries and fields to create tags that meet their needs and their data.
If everyone can just make up their own tags, we can create confusion about what different tags mean and the datatypes they are encoding.
Someone might choose to tag something as <paragraph> and another person might choose to tag something as <para> and a third person might choose <p>.
How do we avoid this confusion?
Enter TEI
Explore the TEI Guidelines http://www.tei-c.org/release/doc/tei-p5-doc/en/html/index.html
Tutorials: http://teibyexample.org/
Question: What element should I use to encode a line in a poem?
Question: What element should I use to encode a new page and or a new column in a manuscript?
Question: What element (or elements) should I use to add an image?
See: http://teibyexample.org/modules/TBED01v00.htm?target=generalstructure#generalstructure
See: http://teibyexample.org/modules/TBED02v00.htm
See: http://teibyexample.org/modules/TBED03v00.htm?target=structure#structure
In Oxygen: select new->Framework Templates->TEI P5->All
Use the texts we worked with yesterday and begin to encode them according to the TEI guidlines, working to make a valid TEI document.
Use the guidines to look up new elements: http://www.tei-c.org/release/doc/tei-p5-doc/en/html/index.html and http://teibyexample.org/
Once, you have a valid document, create a new style sheet, and create a new presentation of your document.
Experiment with conversions of your file using Oxgarage: http://www.tei-c.org/oxgarage/
Open
Select the Simple TEI Edition Link
Create a new branch called "gh-pages"
Open the file xml/index.xml
Add a new pargraph
Save your changes
View your file on the web at: [you-user-name].github.io/simple-tei-edition
TEI Guidelines: http://www.tei-c.org/release/doc/tei-p5-doc/en/html/PH.html#PHST
Question: Why isn't TEI always enough?
EpiDoc: http://www.stoa.org/epidoc/gl/latest/app-alltrans.html
LombardPress-Schema Diplomatic: http://lombardpress.org/schema/docs/diplomatic/]
Using Page 2 of the "Billy Bud" manuscript, in the "examples" folder -- https://drive.google.com/open?id=0ByU6rbuuJtxGSFFYbjdWZjg1VzQ -- use one or all of the following guidelines to encode a diplomatic transcription of the manuscript page.
TEI has different elements to help you group different types of text classifications.
These lists can be added in various places within your TEI document.
For example:
<listPerson>
<person xml:id="Bob1">
<persName>Bob Smith</persName>
</person>
</listPerson>
<!-- -->
<p>Text dicussion a <name ref="#Bob1">Mr Smith</name></p>
Using the <cit>
element is a common way to group a quotation with bibliographic reference. For example:
<cit>
<quote>Quoted text</quote>
<bibl>Augustine, Book V</bibl>
</cit>
See http://lombardpress.org/schema/docs/critical/#apparatus-fontium
Using the TEI-WEB-EDITOR that we worked with yesterday...
/xml/lbp-1.0.0-critical-edition.xml
Using the <app>
a variation in witnesses can be recorded as follows:
<app>
<lem>fides</quote>
<rdg wit="#V">fidei</bibl>
</cit>
See the TEI Guidelines: http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html See the Lbp-Schema for an example of a customization of these guidelines http://lombardpress.org/schema/docs/critical/#apparatus-criticus
Using the TEI-WEB-EDITOR that we worked with yesterday...
/xml/lbp-1.0.0-critical-edition.xml