These pages are designed to offer some example of scripts to do preprocessing like renaming annotations, selecting some portions of a document, splitting a document, correcting some encoding error, splitting long lines into multiple lines, etc.
Apart from preprocessing, scripts can also output some useful information like counting the number of annotations.
Perl and Gnu/Unix text tools are the best suited for this kind of tasks.
- We will do our best to answer questions regarding how these scripts work but you do need first a good knowledge of the programming language used as is not user friendly.
- There is chance that they will destroy all your data (and your home) so be warned.
- Gnu/Unix scripts
- Perl scripts