Tool for interactive redescription set exploration

Redescription mining [5] is a field of Knowledge Discovery that aims to find different characterizations of the same or similar subsets of entities. In addition, it allows users to find connections between attributes from different attributes sets, called views. This is useful in many different fields of science like biology, economy, medicine etc.


The main purpose of the tool InterSet is to allow interactive, comprehensive, redescription set exploration. On this page you can test the features of the tool by exploring redescriptions created on two different datasets. The first dataset contains attributes describing world countries by using general country information and country trading patterns for the year 2012 ([2,7,8]). The second dataset contains attributes describing co-authorship graph and the author-conference bipartite graph ([1,10]). The tool is described in more detail in the paper "InterSet: Interactive redescription set exploration" published in the proccedings of the Discovery Science Conference (DS'16). ([6])


General data information

Country data contains 199 world countries. The data contains two views:

DBLP data contains 6455 authors. The data contains two views:

We created 4150 redescriptions on the Country data and 3674 redescriptions on the DBLP data. Redescriptions were created with a Predictive Clustering Trees [3] based algorithm for mining redescriptions. The main idea of redescription generation process can be seen in [4].


Redescription set exploration

The tool offers interactive redescription set exploration based on entities described by redescriptions in the redescription set, attributes contained in the redescription queries and general redescription properties. The properties used in the literature are: the Jaccard index, the p-value and redescription support [1].


Information and resources

In the information and resources section you can find the instructions on how to set up and use the tool. Also, there are some resources available that should help in understanding what is redescription mining, why is it used and how are redescriptions created.