Abstract

Social recommendation algorithm is a common tool for recommending interesting or potentially useful items to users amidst the sea of online information. The users usually have various relationships, each of which has its unique impact on the recommendation results. It is unlikely to make accurate recommendations solely based on one relationship. Based on user-item bipartite graph, this paper establishes a multisubnet composited complex network (MSCCN) of multiple user relationships and then extends the mass diffusion (MD) algorithm into a novel intelligent recommendation algorithm. Two public online datasets, namely, Epinions and FilmTrust, were selected to verify the effect of the proposed algorithm. The results show that the proposed intelligent recommendation algorithm with two types of relationships made much more accurate recommendations than that with a single relationship and the traditional MD algorithm.

1. Introduction

The era of information technology is defined by the explosion of all sorts of information. The wealth of information not only provides human with more choices but also causes serious information overload [15]; amidst the sea of information, it is difficult for users to find the information that they really need. As a result, how to provide users with useful information from massive data has become a research hotspot. To solve information overload, many recommendation systems have been developed to predict user preference based on their historical data and recommend interesting or potentially useful information to them [68].

The common recommendation methods can be divided into collaborative filtering (CF) recommendation [9], content-based recommendation [10], spectrum analysis recommendation [11, 12], and hybrid recommendation [13, 14]. The recommendation methods based on user or item similarity are also quite popular, namely, mass diffusion (MD) algorithm and heat spreading (HeatS) algorithm. The MD is a classic algorithm capable of making accurate recommendations of items to users. By the principle of energy distribution, the algorithm spreads energy from the initial items to different items and recommend the items with relatively high energy at a high probability.

After predicting user preference, Bin et al. [15] evaluated the popularity of each item and used it to modify the MD algorithm. Based on mass diffusion and heat spreading in physical kinetics, Hu et al. [16] summarized the mechanism of energy distribution and realized a recommendation mechanism based on the tripartite of user, item, and topic. Li et al. [17] managed to improve recommendation quality through singular value decomposition (SVD) of the recommendation results.

The above studies mainly recommend items to users based on the item scores rated by users. However, none of them have considered the influence of multiple social relationships on user preference. In fact, the preference of a user is affected by his/her trusted friends. Therefore, this paper constructs a complex network with multiple social relationships. The network incorporates multiple user relationships into the recommendation system, which effectively improves the recommendation accuracy.

2. Multisubnet Composited Complex Network (MSCCN) Model

There are various systems that can be modeled in our daily life, such as the urban system, ecosystem, transport system, and recommendation system. In a complex system, each object can be treated as a node, and each relationship between objects can be represented as an edge, turning the system into a complex network [1820].

The MSCCN model [2023] can describe multiple relationships among heterogeneous individuals in complex systems. For example, users and items can be considered nodes, while the user relationships and user-item relationships can be regarded as edges in the recommendation system.

By integrating multiple subnets and relationships among complex systems, the MSCCN model can be expressed as a four-tuple , where is the set of nodes, is the set of edges among the nodes, is the set of relationships ( is a type of relationship; is the number of relationship types), and : is the mapping of set to by the function, reflecting the type of relationship on each edge.

The structure of a typical MSCNN model is presented in Figure 1, where , , and . As shown in Figure 1, the edges , , and only have relationship, the edges , , and only have relationship, and the edge has both and relationships.

3. MD Algorithm Based on MSCCN Model

3.1. MD Algorithm

As proposed by Li et al. [17], the MD algorithm can be implemented as follows: First, a unit of energy is placed on the items selected by the target user. Then, the energy spreads from the items to the users who select them. At this time, the energy obtained by user can be calculated bywhere is the degree of item ; is the number of items; is the energy of item ; is the selection variable of items. If item is selected by user , then .

Next, the energy that is spread to users is distributed to the items selected by users. After the distribution, the energy obtained by item can be expressed aswhere is the degree of user ; is the number of users.

Let and be the initial and final energy of items, respectively. Then, the entire diffusion process can be defined aswhere is the state transition matrix. Each element of the matrix satisfies

After diffusion, the unselected items are arranged in descending order by the resource quantity, forming a recommendation list. This recommendation list is generated solely based on user-item relationships, without considering the social relationships of the target user.

The workflow of the MD algorithm is illustrated in Figure 2. It can be inferred that is the initial energy of items and is the final energy of items after being processed by MD algorithm.

3.2. MD Algorithm Based on MSCCN Network (SMD)

By the MD algorithm, the recommendation list is generated solely based on user-item relationships, failing to consider the social relationships of the target user. To overcome the defect, this paper introduces the social relationship into the recommendation system by setting up a multirelationship network, allowing the initial energy of item nodes to spread along the edges in the network.

Let and be user-item relationship and user relationship in the multirelationship network, respectively. It is assumed that the weights of the two types of relationship are both 1. Then, the proportionality coefficient of and that of satisfy . Assuming that is , then , . After introducing one type of social relationship, the SMD algorithm can be implemented as follows:(i)Step 1. One unit of energy is distributed to the items selected by the target user, serving as the initial energy of items.(ii)Step 2. The initial energy of items spread to the users who have selected them. The energy obtained by user can be described aswhere and are selection variables of items; is the degree of item for relationship . If  = 1, user and item have relationship. If  = 1, user and item have relationship.(iii)Step 3. The energy spreads to users and items along edges in the composite network. Then, the energy obtained by user and that obtained by item can be, respectively, described as(iv)Step 4. The energy allocated to the user via relationship further spreads to the items. Then, the energy obtained by the item can be depicted asThus, the total energy obtained by item can be defined asThe element of the state transition matrix satisfiesNext, the unselected items are sorted by the final energy, producing a recommendation list.The diffusion process of the SMD algorithm with one type of relationship (SMD1) is explained by the example in Figure 3, where .

In real social networks, there are usually multiple relationships, each of which has its unique impact on recommendation results. The multiple relationships could be introduced to the MD algorithm through the loading operation of the MSCCN model [20]. Here, another type of relationship denoted as is imported to the network of Figure 3. Suppose and . Then, , , and . In this case, the element of state transition matrix satisfies

Figure 4 shows the process of the SMD algorithm with two types of relationships (SMD2), where and .

If or , that is, or , the SMD2 algorithm degenerates into the SMD1 algorithm.

4. Results and Discussion

4.1. Experimental Data

Two public online datasets, Epinions and FilmTrust, were selected to evaluate the performance of the proposed algorithm. The Epinions dataset contains 40,272 users, 139,738 items, 487,182 user relationships, and more than 664,000 item scores rated by users. The FilmTrust dataset provides 1,050 users, 2,071 items, >1,800 user relationships, and 35,497 item scores rated by users. The original data were preprocessed by removing the relationships of the users that have not selected any items.

4.2. Evaluation Indices

Fivefold cross validation was employed to test the performance of the SMD algorithm. Specifically, the preprocessed data were randomly split into five subsets. For each experiment, a random subset was chosen as the test set, and the remaining subsets were chosen as training sets. Five experiments were conducted to ensure that the SMD algorithm is tested on each subset. The results of the five experiments were averaged to obtain the final result.

Then, the mean ranking score (RS) [15] was introduced to evaluate the ranking accuracy of the SMD algorithm, and the Hamming distance [24] was adopted to measure the diversity of recommendation results.

Suppose the target user has chosen the item in the test set. Then, the ranking of item in the recommendation list can be calculated. For the mean RS of all the items chosen by the user from the test set, the higher the value of mean RS, the better the accuracy of the recommendation algorithm. The RS of user can be calculated bywhere is the number of items preferred by user in test set; is the item preferred by user in the test set.

The Hamming distance can be defined aswhere and are two users; is the number of overlapping items in the recommendation lists of the two users; is the length of the recommendation list. If , the two recommendation lists have no overlapping items; if , the two recommendation lists are identical.

4.3. Results Analysis

The numerical simulation was performed to determine the values of and . Figure 5 presents the simulation results of SMD1 on the two datasets with and changing between various values.

As shown in Figure 5, the optimal value was reached at on Epinions dataset, indicating that SMD1 has the highest accuracy at . Similarly, it can be inferred that SMD1 has the highest accuracy at on FilmTrust dataset.

Let and be the sets of items chosen by users and , respectively. The greater the number of overlapping items in the two sets is, the more likely that the two users have the same interests, and the greater they influence each other. The user similarity can be defined asIf , the two users have similar interests.

Figure 6 displays the simulation results of SMD2 with both and changing between various values.

As shown in Figure 6, the value reached the minimum at and on Epinions dataset. This means, on Epinions dataset, the SMD2 has the highest accuracy at and . Similarly, the SMD2 has the highest accuracy at and on FilmTrust dataset. Since relationship is denser than relationship, the recommendation accuracy increases as approaches 1 on FilmTrust dataset.

For comparison, traditional recommendation algorithms, namely, HeatS algorithm and Hybrid algorithm [25], were also applied on Epinions dataset and FilmTrust dataset. The values of the Hybrid algorithm were minimized on the two datasets, when the regularization parameter was 0.67 and 0.5, respectively. Table 1 compares the results of the contrastive algorithms at the optimal values of and .

As shown in Table 1, SMD2 achieved slightly higher recommendation accuracy than SMD1. The accuracy of the two algorithms was marked higher than that of the traditional algorithms on the two datasets. The results indicate that the recommendation system becomes more accurate, thanks to the introduction of multiple social relationships.

The diversity of the recommendation list is also an important indicator of recommendation quality. Figure 7 shows the diversity trends of recommendation list on Epinions dataset and FilmTrust dataset, with the list length changing between various values.

As shown in Figure 7, the length of the recommendation list is positively correlated with the similarity between recommendation lists for different users. On the two datasets, SMD2 and SMD1 had lower recommendation diversity than the MD algorithm. The relatively low diversity is the result of the following: After user relationship(s) is introduced, the recommendation list for a user is affected by his/her historical scores and also the historical scores of the relevant users. In other words, the recommendation list of a user will bear a high resemblance to that of users, who has social relationships with the user. The HeatS algorithm achieved higher recommendation diversity than the MD algorithm because it attracts the historical scores of the user to the less popular items.

In the real world, some users only select very few items. For implicitly, the users who have purchased fewer than 30 items were defined as small-degree users, and the other users as generous-degree users. The small-degree users generate relatively few data to be used for recommendation. Therefore, the recommendation to such users is usually of low accuracy. Then, the users with ( is the degree of user ) on the two selected datasets were treated as a small-degree user group. Figure 8 shows the recommendation results of SMD for this user group on the two datasets, with and changing between various values.

As shown in Figure 8, SMD1 algorithm achieved the minimum RS at and , respectively, on Epinions and FilmTrust datasets. The value decreased from the scenario in Figure 5, where SMD1 achieved the highest accuracy at and , respectively. The value determines the number of resources that need to be transmitted through the social network to reach the item during the diffusion process. The smaller the value, the greater the resource demand.

To further explore the relationship between parameters and , the RS values were grouped by user degree. Figure 9 shows the values corresponding to the optimal RS values on the two datasets.

As shown in Figure 9, the relationships between and exhibited a consistent trend on the two datasets: the smaller the degree of the target user, the smaller the value for the RS to reach the optimal value. A user with a small degree chooses only a few items and provides limited information for the recommendation algorithm. Therefore, the recommendation accuracy for small-degree users needs to be improved based on social network information.

To demonstrate the superiority of SMD in the recommendation for small-degree users, the small-degree users satisfying were grouped by , as and reached the optimal values on the two datasets. Figure 10 compares the optimal RS values under SMD and MD for small-degree users.

As shown in Figure 10, the highest RS values were achieved by MD algorithm on Epinions dataset, followed in turn by SMD1 and SMD2. That is, for small-degree users, SMD outperforms the MD in recommendation accuracy; the advantage increases with the reduction of user degree. The same was observed on FilmTrust dataset. To sum up, SMD is more effective than MD in the recommendation for small-degree users.

5. Conclusions

This paper introduces social information to the MSCCN model to create a complex network of multiple relationships. On this basis, the MD algorithm was improved into the SMD. Experimental results show that the SMD made a highly accurate recommendation to users because it considers the potentially useful items of different users in the social network. It is also learned that SMD2 outperformed SMD1 and MD, indicating that the integration of two types of relationships is better than incorporating only one type of relationship or considering no social relationship. In other words, the recommendation accuracy can be greatly improved by introducing multiple social relationships between users. In addition, the experiments also demonstrated that the SMD is highly effective in making accurate recommendations to small-degree users. The future research will discover the implicit social relationships between users and evaluate their impacts on recommendation results.

Data Availability

The basic data used in this paper were downloaded from two public datasets online: the Epinions http://www.trustlet.org/epinions.html and the FilmTrust https://www.librec.net/datasets.html.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was funded by Shandong Provincial Natural Science Foundation, China, Grant no. ZR2017MG011.