@PhilosTEI: Building Corpora for Philosophers

In J. Odijk & A. Van Hessen (eds.), Clarin in the Low Countries. Londen, Verenigd Koninkrijk: pp. 379-392 (2017)

Arianna Betti
University of Amsterdam
Hein Van Den Berg
University of Amsterdam
The step to e-research in philosophy depends on the availability of high quality, easily and freely accessible corpora in a sustainable format composed from multi-language, multi-script books from different historical periods. Corpora matching these needs are at the moment virtually non-existing. Within @PhilosTei, we have addressed this corpus building problem by developing an open source, web-based, user-friendly workflow from textual images to TEI, based on state-of-the-art open source OCR software, to wit Tesseract, and a multi-language version of TICCL, a powerful OCR post-correction tool. We have demonstrated the utility of the tool by applying it to a multilingual, multi-script corpus of important eighteenth to twentieth-century European philosophical texts.
Keywords No keywords specified (fix it)
Categories (categorize this paper)
Buy the book Find it on Amazon.com
Edit this record
Mark as duplicate
Export citation
Find it on Scholar
Request removal from index
Revision history

Download options

PhilArchive copy

Upload a copy of this paper     Check publisher's policy     Papers currently archived: 65,811
External links

Setup an account with your affiliations in order to access resources via your University's proxy server
Configure custom proxy (use this if your affiliation does not provide a proxy)
Through your library

References found in this work BETA

No references found.

Add more references

Citations of this work BETA

Add more citations

Similar books and articles

Why African Philosophers Should Build Systems: An Exercise in Conversational Thinking.Ojah Uti Egbai - 2018 - Filosofia Theoretica: Journal of African Philosophy, Culture and Religions 7 (1):34-52.
Construction Area (No Hard Hat Required).Karen Bennett - 2011 - Philosophical Studies 154 (1):79-104.
Theory and Experiment in Philosophy.Piet Hut - 1999 - Journal of Consciousness Studies 6 (2-3):2-3.
Machine Readable Corpora.S. Bernardini - 2006 - In Keith Brown (ed.), Encyclopedia of Language and Linguistics. Elsevier. pp. 358--375.
On Architecture.Fred Rush - 2008 - Routledge.
Corpora in Studies of Variation.Robert Sigley - 2006 - In Keith Brown (ed.), Encyclopedia of Language and Linguistics. Elsevier. pp. 220--226.


Added to PP index

Total views
2 ( #1,423,734 of 2,463,232 )

Recent downloads (6 months)
1 ( #449,456 of 2,463,232 )

How can I increase my downloads?


My notes