The Corpus of Contemporary Irish is a monolingual collection of Irish-language texts in digital format. It consists of edited texts which have been published from the beginning of the 21st century onwards. The corpus currently includes texts from Beo!, the Cló Iar-Chonnacht archive, the Cois Life archive, Comhar, Éabhlóid, Feasta, The Irish Times, Cló Mhaigh Eo, Meon Eile, NÓS, Nuacht RTÉ, Seachtain, Tuairisc.ie, and An tUltach. It contains 15 million words.
The corpus has been used as an internal terminological resource in Fiontar agus Scoil na Gaeilge for some time but it is now being made freely available to the general public. Fiontar agus Scoil na Gaeilge is very grateful to the publishers and copyright holders who have given permission to use their material.
The search interface is very simple. There is a specific search (‘This phrase as is’) and a broad search. Results can be filtered according to collection in the bar to the right.
We intend to expand the content and improve the functionality of the corpus over time. We greatly appreciate feedback from our users and we are particularly interested in hearing from copyright holders who have digital material in Irish that would be suitable for the scope of this corpus. We can be contacted at firstname.lastname@example.org.