|none (empty element)|
What is it?
<lb/> element is an empty element that marks a line break, and so we put one at the end of every typographic line in the document. The differences between our different types of line breaks allow some of these line breaks to be made visible in some of the edition's views, while remaining invisible in others. Please note that the word 'line' in 'line break' does not refer to verse lines. Verses should be tagged using the
<l> element, not the
We distinguish 3 types of
- hard returns for all views (without a
- soft returns for the image/text view (
- and hyphenated soft returns (
This tag is reserved for those rare cases where the author has clearly broken off a line for his own reasons, rather than being obliged to do so because of the document's limited dimensions. It is not necessary, however, to put such a bare
<lb/> at the end of each paragraph: in those cases, closing the
<p> tag will suffice.
An example of this could be a prose passage with some lines of dialogue. Each "turn" in the dialog is not a paragraph in itself, there are
<lb/> tags between turns to encode the hard return.
The letters of the
@rend attribute's value stand for 'image/text', meaning that the line break is to be visualized in the 'image/text' view, but not in the 'text' view. This is the most current type of line break, that takes place every time the author reaches the end of a line, and continues to write on the next one.
Hyphenated soft returns
In those cases of
@rend="it" where the author splits a word in two, using a hyphen, we use the
"-it" value. This way, the word will be hyphenated in the 'image-text' view, but merged back together in the 'text' view.
The hyphen is part of the word
When the hyphen at the end of the line is part of the word or phrase itself (such as in the French phrase: "c'est-à-dire"), we hardcode the hyphen, and end the line with a regular
@rend="it" line break. This way, the hyphen will be visible in both the 'text' view and the 'image/text' view. Please make sure not to write a space before or after the line break in those cases.
c'est-<lb rend="it">à-dire is CORRECT
c'est-<lb rend="it"> à-dire is INCORRECT
The end of the sentence coincides with the end of a line
In those cases where the end of the sentence and the end of the line are one and the same, please put the
<lb/> tag inside the
<seg> tag, as this will make sure that the line break will be visible in the 'image/text' view. For example:
<seg>This sentence ends at the end of the line<lb rend="it"/></seg>
We recommend you to transcribe the manuscript as follows:
- transcribe a line of text,
- leave a space,
- put in the linebreak,
- take a hard return (or two),
- and continue coding on the next line:
<p><seg>An example sentence.</seg> <seg>A se<lb rend="-it"/> cond sentence.</seg> <seg>Followed by a third <lb rend="it"/> sentence, that ends exactly at the line break.<lb rend="it"/> </seg> <seg>And then a last one.<seg></p>
Our basic principle applies: if you leave out all of the tags and hard returns, you will have a normally spaced text.
An other option (that makes it a little easier to encode, but a little harder to compare your transcription with the manuscript) would be to use a hard return for every
<seg>, rather than after every
<lb/>. For example:
<p> <seg>An example sentence.</seg> <seg>A se<lb rend="-it"/>cond sentence.</seg> <seg>Followed by a third <lb rend="it"/>sentence, that ends exactly at the line break.<lb rend="it"/></seg> <seg>And then a last one.<seg> </p>
Of course, you don't have to take a hard return after every
<seg>, you can also transcribe the text on a single line of code. For example:
<p><seg>An example sentence.</seg> <seg>A se<lb rend="-it"/>cond sentence.</seg> <seg>Followed by a third <lb rend="it"/>sentence, that ends exactly at the line break.<lb rend="it"/></seg> <seg>And then a last one.<seg></p>
However, this makes your transcription very difficult to read, both for human readers, and for computers: an XML document that uses fewer hard returns will take longer to load than a document that uses more of them.