START Conference Manager    

Building and Evaluating a Distributional Memory for Croatian

Jan Snajder, Sebastian Pado and Zeljko Agic

The 51st Annual Meeting of the Association for Computational Linguistics - Short Papers (ACL Short Papers 2013)
Sofia, Bulgaria, August 4-9, 2013


Abstract

We report on the first structured distributional semantic model for Croatian, dm.hr. It is constructed after the model of the English Distributional Memory (Baroni and Lenci, 2010), from a dependency-parsed Croatian web corpus, and covers around 2M lemmas. We give details on the linguistic processing and the design principles. An evaluation shows state-of-the-art performance on a semantic similarity task with particularly good performance on nouns. The resource is freely available.


START Conference Manager (V2.61.0 - Rev. 2792M)