Web site Classification TrackOverviewThe purpose of this track is to evaluate methods of Web site topic classification.For this track the standard procedure is used. Test CollectionThe source dataset consists of BY.web collection and DMOZ collection. The latter is used as a training set.Task Description for Participating SystemsEach participant is granted access to the training set, DMOZ collection, a set of web sites (not single web pages!) from BY.web collection. The task is to assign topics from training set to each web to each web site. Valid number of topics per site is from 0 to 5. Topics should be returned as an ordered list for each web site.
The training set is based on a subset of the Russian-language categories from the
Evaluation Methodology
Data Formats |