====== TEI tagging ====== Each diplomatic transcription is encoded in TEI XML format. Each TEI file consists of a '''' tag and a '''' tag. In the '''', the most important tag to fill out is the '''' tag, which specifies the siglum of the witness which will be used in any generated apparatus. See the template for more details. The '''' tag is where the transcription will go. Each '''' should consist of a '''' with a number of paragraphs (''

'') and/or verses ('''' and ''''). Each paragraph or verse should have a unique ''xml:id''. These ''xml:id'' identifiers will be used to collate like sections of text together. For transcribing, a subset of TEI is used to mark features such as additions, deletions, and marginal annotations. ===== , , and ===== Indicates text that has been substituted. * the ''rend'' attribute indicates how the addition or deletion is marked. * the ''place'' attribute indicates where the addition is noted. sarvavidyāśabdānām > sarva~~vidyā~~śabdānām ===== , , and ===== Indicates text that the transcriber has corrected. vaṃtartatamānaḥ > va~~ṃta~~rtamānaḥ ===== ===== Indicates a caesura in a line of verse. suvarṇarucakādi yathā yuktaṃ svakair ākāraiḥ > suvarṇarucakādi ya-
thā yuktaṃ svakair ākāraiḥ ===== ===== Indicates a section of the text that is unreadable by the transcriber. * the ''reason'' attribute gives a reason for the text being illegible. * the ''unit'' attribute is the unit of measure. * the ''quantity'' attirbute is the length of the gap in ''unit''s. sārvārthyam > sā.......rvārthyam ===== ===== Indicates text that has been marked in some way for emphasis. * the ''rend'' attribute indicates how the text has been marked. ātma vastu svabhāvaś ca > ātma vastu svabhāvaś ca ===== ===== Indicates a new line beginning. * the ''n'' attribute indicates the line number. * if a line beginning corresponds to a word break, then the space appears before the '''' tag. vya-vahāre, ātmā tattvaṃ > vya-⸤vahāre, ātmā ⸤tattvaṃ ===== ===== Indicates a character that is not part of the content. * the ''function'' attribute indicates the function of the character. tat siddhiḥ✗ > tat siddhiḥ✗ ===== ===== Indicates a location in a document. * The ''n'' attribute indicates a page or folio number. * The ''unit'' attribute is the unit of measure. jātir vā dravyaṃ vā padārthāv iti > (From folio 34r4) jātir vā dravyaṃ vā padārthāv iti ===== ===== Indicates a note. * The ''place'' attribute indicates where the note is placed, such as in the ''top-margin'', ''footer'', ''appendix'', or ''inline''. * The ''xml:lang'' attribute indicates the language of the note. The main use of this is to write notes in English; if ''xml:lang="en"'' is set, then the note text will not be transcribed into other scripts. ‘ktaktavatū niṣṭhā’(pa॰ 1|1|16) > ‘ktaktavatū niṣṭhā’(pa॰ 1|1|16) ===== ===== Indicates a page beginning. * the ''n'' attribute indicates the folio or page number of the new page. anekaviṣayanihitapadānām > anekaviṣayanihiLtapadānām ===== ===== Used to split compounds words. aneka viṣaya nihita padānām > ankeaviṣayanihitapadānām The spaces are not displayed, but variant readings will be split using those spaces as guides. This manual procedure is only needed occasionally, to clarify the critical apparatus in the case of very long or complex compounds. ===== ===== Indicates text that has been retraced. gṛhītaṃ gṛhaśabdena śuddham evābhidhīyate > gṛhītaṃ gṛhaśabdena śuddham evābhidhīyate ===== ===== Indicates text that has been transcribed as found in the document, without correction. viśeṣopadhiḥ > viśeṣopadhiḥ ===== ===== Indicates a blank space in the text. * the ''unit'' attribute is the unit of measure. * the ''quantity'' attirbute is the length of the space in ''unit''s. ity arthaḥ atha ca > ity arthaḥ__ atha ca ===== ===== Indicates text that has been supplied by the transcriber. tad dravyam > tad dravyam ===== ===== Indicates text that the transcriber believes is superfluous. * the ''reason'' attribute indicates the transcriber's reason for marking the text as superfluous. iti teṣān darśanaṃnaṃ > iti teṣān darśanaṃ⸤naṃ ===== ===== Indicates a passage that is unclear to the transcriber. * the ''reason'' attribute indicates why the text is unclear. svarūpānyathāt tānāpapaptiḥ > svarūpānyathāt tānāpapaptiḥ