HomePublications ➤ upeksha2015implementing

Implementing a Corpus for Sinhala Language

Dimuthu Upeksha, Chamila Wijayarathna, Maduranga Siriwardena, Lahiru Lasandun, Chinthana Wimalasuriya, N. H. N. D. De Silva, Gihan Dias
Symposium on Language Technology for South Asia 2015

This paper presents the project we did to develop a corpus, which is continuously updating, dynamic and covers a wide range of topics for Sinhala language. This paper will introduce the technologies we have used in the project and will discuss its design features.

Keywords: Natural Language Processing | Big Data | Sinhala | Corpus | Corpus Linguistics | Language Corpus |