Project Information


Community search, which aims to find the most likely community that contains the query node, has attracted great attention from both academic and industry areas. To facilite community retrieval algorithms, we designed C-Explorer. C-Explorer is a web-based platform and it can assist users in extracting, visualizing, and analyzing communities. The FPY project also involves further research based on the ACP community retrieval algorithm.


Various documentation

Project plan(PDF)
Interium report(PDF)
Final report(PDF)


Progress and Deliverables

The FYP has been finish. The total project is divided into two parts. We first developed a web-based system named "C-Explorer" as proposed. After that, we continued to study the problem of edge-attributed community search problem and designed a corresponding algorithm for retriving edge-attributed communities.

C-Explorer
Link to C-Explorer


Community search algorithms, which aim to find the most likely community containing the query node, have been one of the hottest research topics in graph mining. To facilitate community search algorithms, we have proposed C-Explorer, which has the following functions:

• Enables users to formulate community search queries to retrieve and view communities which they are interested in.

• Compares the efficiency of different community search algorithms.

• Supports attributed graphs, in which each vertex in attributed graphs is associated with a set of attributes. C-Explorer can help to look for attributed communities, in which vertices are cohesive both structurally and semantically.

• Provides interfaces are provided for researchers to plug in different algorithms for testing or visualization.

Exploration

Exploration

The “Exploration” page is for formulating attributed community search queries and viewing the communities returned.

• The left panel is for query formulation and the user interface is designed specifically for community search queries. After specifying the query names, degree and the set of attributes, users can click on the “Search” button at the bottom to send the query to the server.

• If the “Reset” button is clicked, the query names and attributes will be deleted to start a new search. The resulting communities will be displayed in the right panel of the page.

• The “Theme” section at the top shows the set of attributes that all vertex share and it should be a subset of the attributes specified by the user in the query.

• If the result contains more than one community, users can choose to view different communities by clicking on the community at the bottom.

Analysis

The layout of “Analysis” (see Figure 2.2) is similar to that of “Exploration”.

Because some of the community search algorithms do not support more than one query vertex, only one query name can be inputted. Once the text field loses its focus, the keywords of the name in the text field will be loaded and users can click on the “Compare” button after the selection of keywords.

Two metrics (CMF and CPJ) are used to evaluate the overall effectiveness of algorithms. The two metrics are considered effective in measuring the cohesiveness of a community in [1]. Usually, communities with better cohesiveness will score higher in these two metrics. The result is put into two charts and shown in the upper half of the right section(see Figure 2.2). The statistics of each algorithm are shown in the table under “Community Statistics” for users to check and compare.

EACQ

The previous attributed-community search problem treat attributes as the characteristic of vertices. However, in social networks these characteristics are often reflected by interactions between users. Hence we propose to explore the possibility of retriving communities based on edge-attributes.

We modified the ACQ algorithm to solve the edge-attributed community search problem. The details of methodology, implementation and results can be found in the final report.



Project Progress and schedule

1 October 2017

Deliverables of Phase 1

(Inception) 

•   Detailed project plan

•   Project web page

22 October 2017

Solve Efficiency problem of the program and deploy the program with ACQ embedded online

November 2017

Design and testing of APIs for researchers to plugin algorithms and datasets; Do literature collection and review of CR algorithms.

December 2017

Learn about key concepts and related algorithms in CR algorithms.

8-12 January 2018

First presentation

21 January 2018

Deliverables of Phase 2

(Elaboration)

•   Preliminary implementation

•   Detailed interim report 

January – February 2018

Design and test the algorithm

March – 15 April 2018

Combine the algorithm with the program for demonstration, write reports about the algorithm.

15 April 2018

Deliverables of Phase 3

(Construction)

•   Finalized tested implementation

•   Final report 

16-20 April 2018

Final presentation 

2 May 2018

Project exhibition