File /Humanist.vol22.txt, message 223


Date:         Mon, 29 Sep 2008 08:39:25 +0100
From: Humanist Discussion Group <willard.mccarty-AT-MCCARTY.ORG.UK>
Subject: 22.231 how to design an online database
To: humanist-AT-Princeton.EDU


               Humanist Discussion Group, Vol. 22, No. 231.
       Centre for Computing in the Humanities, King's College London
                        www.princeton.edu/humanist/
                     Submit to: humanist-AT-princeton.edu



         Date: Mon, 29 Sep 2008 08:33:21 +0100
         From: Humanist Discussion Group <willard.mccarty-AT-mccarty.org.uk>
         Subject: Re: 22.227 how to design an online database?

> The online databse I envision is a mixture of Google Book Search and
> the Corpus del Espanol. That is, I'd like for the site to offer a
> visual, facsimile (with the option of viewing the text in an
> expanded, plain text version)as well as to have word search (exact
> word or wild card)options.

The best way to approach this, I would think, is first to create a
transcription of the texts in TEI/XML. That transcription is independent
from the operating system (unix/microsoft) and also independent from
any program that you use, at a second stage, to create a web
representation of your texts. One of the advantages of this approach is
that you can create multiple presentations, tailored to different needs.

Samples of editions of parallel texts that were created based on a TEI
encoding:

Shorter Chinese Samyukta Agama:
http://buddhistinformatics.chibs.edu.tw/BZA/cluster.xql?base=bza001

Preface to Ancrene Wisse
http://www.tei-c.org.uk/Projects/EETS/AW-tt.html

Freising Manuscripts
http://nl.ijs.si/e-zrc/bs/index-en.html
(a number of versions in parallel: http://nl.ijs.si/e-zrc/bs/html/bsPA.html)

Another sample, this one of TEI encoded texts integrating advanced
search, facsimile display and a textual rendering:
Proceedings of the Old Bailey, 1674-1913: http://www.oldbaileyonline.org/

But there are many more.

The Text Encoding Initiative (http://www.tei-c.org/), as many on this
listserv will know, is a consortium of libraries, scholars, and others,
that has been working for 20+ years on creating guidelines for text
encoding in the humanities. If you need more information, please mail me
off-list.

Peter Boot
pboot-AT-xs4all.nl

From - Mon Sep 29 08:55:26 2008
X-Mozilla-Status: 0001
X-Mozilla-Status2: 00000000
   

Humanist Main Page

 

Display software: ArchTracker © Malgosia Askanas, 2000-2005