The Bangor Autoglosser allows CHAT files to be glossed automatically in Welsh, Spanish and English.
The code (licensed under the GPL v3) is available here. Install Git and then run:
Note that cloning the repository will download over 100Mb, and will take around 12 minutes. Note also that the autoglosser is work-in-progress, and liable to substantial change.
For publications about the autoglosser, see the publications page.
The databundle referred to in the ISB8 presentation is available here.
bilingualism@bangor.ac.uk
The Siarad corpus
The Patagonia corpus
The Miami corpus
The support of the Arts and Humanities Research Council (AHRC), the Economic and Social Research Council (ESRC), the Higher Education Funding Council for Wales (HEFCW) and the Welsh Government is gratefully acknowledged.