T-PEN (Transcription for Paleographical and Editorial Notation)

T‑PEN (transcription for paleographical and editorial notation) is a web-based tool for working with images of manuscripts. Users attach transcription data (new or uploaded) to the actual lines of the original manuscript in a simple, flexible interface.




T-PEN automatically recognizes columns and lines. This automatical layout segmentation can be modified by the users before transcribing.



  • is an open and general tool for scholars of any technical expertise level
  • allows transcriptions to be created, manipulated, and viewed in many ways
  • collaborate with others through simple project management
  • exports transcriptions as a pdf, XML(plaintext) for further processing, or contribute to a collaborating institution with a click
  • respects existing and emerging standards for text, image, and annotation data storage
  • avoids prejudice in data, allowing users to find new ways to work

As of April 2014, it provides access to more than 4000 manuscripts (e.g. links with e-codices), either publicly available or on restricted access within specific projects.


T-PEN version 2.0 was launched in May 2012, with new features (1. Users can now upload their own image set for transcriptions; 2. T-PEN now fully supports crowd-sourcing projects; 3. T-PEN has been providing access to support tools for transcribers; 4. an additionnal feature is still experimental: Glyph matching, a paleographical analytical tool into T-PEN).