DAACS DTD documentationWritten by Sarah Wells and Daniel Pitti, March, 2004. IntroductionThe DAACS DTD is based on the TEI-Lite DTD and is TEI-compliant. All of these elements are in TEI-Lite, although they are not all used in the same way. The definitions here are intended for use in the DAACS DTD only. How to read the definitionsEach element has a description of its function and any special usage rules are explained. If other elements can be nested inside that element, they are listed as sub-elements. The following symbols are used to define how those nestable elements can be used:
If none of these symbols appear, assume that the element is required but cannot be repeated. The elements are placed inside sets of parentheses to clarify which symbols apply to which elements. For example, suppose that element <hello> can contain the following elements:
The only required sub-element is <foo>. It can also contain
one or zero <bar> elements, and zero or more <foo2> or <bar2>
elements (but not both). A proper usage might be:
Note that some non-textual characters, such as a copyright symbol, can be typed in as entities ("©"). Attributes are listed as either required or implied (i.e., optional). In some cases, they have a limited range of possible values. These values are listed in parenthesis. For example,
This means that there are two optional attributes, id and type. You can assign any value to id but type can only be set to "foo" or "bar". TypesThere are ten types of DAACS documents:
Elementsauthor This element contains the name of the author of a bibliographic
entry in the bibliography. The first author's name should be written last name
first. If there are multiple authors, the remaining names should be written
first name first. For example:
biblDescribes an item in the bibliography. It is nested in the <listBibl> element and contains the item's title and editor or author. Each <bibl> must have an id attribute, which is a unique name for each work listed in the bibliography. Any references to a work will refer to it by this name. The id should use one of the following formats: If the entry has one or more authors, the id should consists of
the primary author's last name plus the year of publication that is given in
the <date> element. If there are several publications from the same year,
the year should be followed by a letter. E.g.,
If there is no author or date, the id should
be the entry's title (or a reasonable abbreviation, if the title is long).
E.g.,
Either way, every <bibl> must contain either a <title> or <date>. The <date> is the year of publication. If available, the name(s) of either the editor(s)or author(s) can be included.
biblFullThis is bibliographic information about the source of the current XML file and is found inside the <sourceDesc> element in the <teiHeader>. It contains the file's title, its publication history, and how it was created.
bodyA standard TEI element that contains the body of an XML document. It is used in conjunction with the <front> element, which contains preliminary descriptive information. The division is similar to the division in a book between front matter, such as a table of contents, epigraph, and forward, and the main body of the text. It appears inside the <text> element. The <body> must include text inside a <p>, <listBibl>, or <list>; or a <div1>.
cellThis is used for data that will appear in a table cell. It is placed inside a <row> element. It can contain either text (i.e., PCDATA) or an <xref> (a cross-reference).
creationThis element is found inside the <profileDesc> element and
contains a statement about how the content was converted from its previous form
to XML and who was responsible for the conversion. For example,
dateThis can be found inside the <bibl> and <publicationStmt> elements. It contains a publication date. When used inside <bibl>, it indicates what year the work was
published. E.g.,
This date is used to assign the work a unique id, so it should
correspond to the <bibl> id (please see the
<bibl> element definition for more information). If an author published
more than one work in one year, the publication year should be followed by a
letter. For example,
When used inside the <publicationStmt>, the date is the year
that the XML document was published. To make the final HTML documents readable,
the date should be followed by a period. E.g.,
div1This is a standard TEI element used to mark a high-level section of a document. In the DAACS DTD, it is used inside the <front> and <body> elements. It can have an id or a type attribute. The id is used in
the history documents to identify sections of information. For example,
div2A standard TEI element used to mark a second-level section of a document. It can only be used in a <div1> element.
div3A standard TEI element used to mark a third-level section of a document. It can only be used in a <div2> element.
editionThis element appears inside <editionStmt> and describes
which edition of the document is contained inside the current XML file. The
documents in this site are all electronic editions and should therefore be
identified thus:
editionStmtThis element appears inside <fileDesc> and contains information in the current XML file. It must contain the <edition> element.
editor This element contains the name of the editor of a bibliographic
entry in the bibliography. The first editor's name should be written last name
first. If there are multiple editors, the remaining names should be have the first
name first. For example:
figDescThis element is used in the images documents. It is nested inside
a <figure> and provides a textual description of the image and/or an
<xptr> to a full-size version of the image. For example:
figureContains information about an image. It is used only in an images documents and is nested inside an <item> in a list of images. It contains a <figDesc> to describe the image. It must have an entity attribute, which
corresponds to an entity declaration at the top of the file. The entity
declaration, in turn, translates into a the location of the image file. For
example:
fileDescThis element contains file description information and is the first element inside the <teiHeader>. It holds bibliographic information about the XML file, including information about the origins and publication history of the file's content.
frontLike <body>, this is a standard TEI element. It is used in conjunction with the <body> element, and contains preliminary descriptive information. In DAACS documents, it contains pointers to other XML documents and images which appear in the backgroup or in side panels. The division is similar to the division in a book between front matter, such as a table of contents, epigraph, and forward, and the main body of the text. It appears inside the <text> element.
headThis is a heading for a <body>, <div1>, <div2>,
<div3>, or <list>. Use the <hi> element to render parts of a
heading in a specific style. For example,
Use the <xref> and <name> elements here as in another element.
hiIdentifies a portion of text that needs to be formatted in a specific manner. It can only be used inside the <head>, <item>, <name>, <p>, <title>, and <xref> elements. It must have a rend attribute, which describes how the text should be rendered. The description can be one or more options from a controlled list. Currently, the possible text renderings are:
<hi rend="vertical-align:text-top;font-size:xx-small">
itemMarks an item in a <list>. It can contain text, figures, and
cross-references to other files. Note that you can start a secondary
<list> inside the current list:
labelContains the title of an item in a <list>. It is nested
inside the <list> element and before the <item> tag. For example,
Be sure to be consistent about labeling items: do not label some but not others. The id attribute is not required, although it is used in the glossary to assign a unique id to every glossary entry.
listContains a list. It can be placed inside the <body>, <div1>, <div2>, or <div3> tags. It may contain a <head> and one or more <item> or <label> elements. It may have a type attribute, which identifies what kind of list it is. Currently, the possible types are:
listBiblThis is a list for the bibliography. It holds a list of <bibl> elements, which in turn contain information about the works.
nameThis is currently used to identify a building mentioned in a
background file. E.g.,
It can be found in <p>, <q>, and <head> elements.
noteContains a note about a bibliographic entry or about the file. It
can appear in the <teiHeader> in the <notesStmt> element or in the
<body> in the <bibl> element. It must use a type attribute to define the information as "technology" or
"bibliographic." When it is used in a <notesStmt>, it is technology. For
example,
When used in a <bibl> it is bibliographic. If it is a bibliographic note, it can contain one or more <title> elements.
notesStmtThis element contains notes about the XML file. It appears twice in the <teiHeader>, in the <fileDesc>, where it holds technical information about the file's encoding, and <biblFull>, where it describes what technology was used to encode the original content. In both cases, the information is in a nested <note type="technology"> element.
numThis is found inside the <title> element and is used to
specify the volume number of a work.
pThis is a standard TEI element. It can be used in several elements and is always used to mark the beginning and end of a paragraph of information.
profileDescContains information about how the XML file was created. It is located in the <teiHeader> element and contains a <creation> element.
publicationStmtContains information about where and when the file was published
and who published it. It is found inside the <biblFull> and
<fileDesc> elements and should look something like this:
publisherThe name of the publisher of the XML document. It is found in the <publicationStmt> element. To make the final HTML documents
readable, the publisher name should be followed by a comma and space. E.g.,
In all cases, the publisher is DAACS.
pubPlaceThe place where theXML document was published. It is found in the <publicationStmt> element. To make the final HTML documents
readable, this place name should be followed by a colon and space. E.g.,
Currently, all documents are published in Charlottesville.
qThis marks a quotation and is only used inside the <p> element. It can include an <xref> for citations.
rowMarks a row inside a <table>. It must contain one or more <cell> elements (be sure that each row in the table has the same number of cells). You may use the role attribute to mark the top row of a table (e.g., <row role="head">).
sourceDescDescribes the source of the document contained in the current XML file. It can be found in the <fileDesc> element and can contain either a <p> or <biblFull>. If the content is being encoded for the first time, use <p> and the word "Original." If the source of the content is another document (such as a Microsoft Word file), use <biblFull>.
tableThis marks a table of data in the text. It can only be found in <p> and <item> elements and must contain one or more <row> elements.
TEI.2This is the root element of a TEI-conformant file. It is used here because the XML markup we are using is essentially TEI. The root element is the first element in an XML file: all other elements in the file are nested inside. It comes after the XML and general entity declarations and must contain a <teiHeader> and <text> element.
teiHeaderThis contains header information about the XML file and is a required element in all TEI-based texts. It contains descriptive and bibliographic information about the file and is the first element in the <TEI.2> root element. It must contain a <fileDesc> and <profileDesc>. It must include an type and a n attribute. The type attribute indicates what type of document the file. As discussed above, there are currently ten types of DAACS documents, listed again here, and you must choose one of these types.
The n attribute associates the file with a given site or given type of information. XML attributes are case sensitive, so it is important it be spelled consistently. Currently, the following values are used:
termThis is used in the bibliography to associate a work with one or
more sites. It can be used in the <bibl> and <p> elements. The
type attribute must be used and must have the value of
“sitename." The site name used should be drawn from the same list used by the
<teiHeader> n attribute:
textThis is a standard TEI element and indicates the content of an XML file. It follows the <teiHeader> and can hold <front> and <body> elements.
titleContains the title of a bibliographic element, note, or the document. It can occur in the <bibl>, <note>, and <titlestmt> elements. When used in the <bibl> and <note> elements, it must use the level attribute. This attribute indicates what type of resource the bibliographic work is. There are five possible values for the level attribute: a, meaning an article (i.e., something that was published as part of a larger work); m, meaning a book,
manuscript, or other kind of monographic title (including single volumes of multi-volume works); j, meaning a journal;
s, meaning a series; and u, meaning an
unpublished paper or poster (including theses and dissertations unless published by a commercial press). An article title
should include the title of the
journal, book, or series in which it was published, in the <note>
element. For example,
titleStmtA standard TEI element that contains information about the title of the XML document. It is used inside the <fileDesc> and <biblFull> elements and contains a <title> element.
xptrThis is an empty element, meaning that it does not contain text or any nested elements and has a single tage marked with an "/" at the end. It is a pointer to another, internal, location in the current document or to an external document. It can be used in the <figDesc>, <item>, and <p> elements. It must include the doc and type attributes. The doc attribute
specifies an entity reference listed at the top of the document and the
type attribute specifies what kind of data the entity
reference will resolve to. For example, a pointer to a .gif file of an full
image of Building S might look like this:
When the file is processed by the style sheet, the reference will be resolved and the path of the .gif file will be inserted into the HTML. The type must be one of the choices listed below. Pointers to XML files can be used to associate the content of other files with the current file.
The n attribute can be used in conjunction
with dxf, gif, and
jpg to specify the size of the referenced file. It
should be in kilobytes and should include the abbreviation “K". E.g.,
xrefThis element is very similar to the <xptr> element. It is not an empty tag, though, and contains text. It is generally used to mark a portion of text, such as a cross-reference, and can be used in the <cell>, <head>, <item>, <p>, and <q> elements. It can also contain nested <xref> elements. There are no required attributes. The doc attribute can be used to provide an entity reference or to specify that the text is a cross-reference to the bibliography or glossary. The from attribute is used in cross-references and should correspond to either a <bibl> id in the bibliography or a <label> id in the glossary. The type attribute indicate what kind of data the reference is pointing to. Currently, possible values are:
|