JWE Abstracts 

Vol.15 No.3&4 July 1, 2016

Review Article:

Enhancing Keyword Suggestion of Web Search by Leveraging Microblog Data (pp181-202)
       
Lin Li, Lu Qi, Fang Deng, Shengwu Xiong, and Jingling Yuan
Query suggestion of Web search is an effective approach to help users quickly express their information need and accurately get the information they need. Most of popular web-search engines provide possible query suggestions based on their query log data, which is a kind of implicit relevance based approach. However, it is difficult to give suggestions to search queries that have no or few historical evidences in query logs. To solve this problem, traditional pseudo relevance based approaches directly extract additional keywords from the top-listed search results of a given search query as suggestions. However, for hot topic or event related search queries, users more like to browse the latest and newly appeared contents. In this paper, we follow the direction of pseudo relevance based suggestion approaches by mining microblog data that is inherent in fast information propagation and dissemination. Our graph based rank aggregation approach combines a frequency based ranking with considering words themself and a LDA (Latent Dirichlet Allocation) based ranking by mining hidden topics behinds words. A dataset is crawled from the posts of fourteen micro-topics of Sina microblog platform. The experimental results clearly demonstrate our proposed approach is more effective than traditional pseudo relevance based methods. Moreover, the suggested keywords extracted from the posts published by \textit{authenticated users} are more effective than two traditional pseudo relevance based approaches, i.e., the posts submitted by \textit{all users} and the top returned posts returned by Sina search engine. In addition, applying LDA on microbog posts alone is far from satisfactory, but the combination of the frequency based ranking and the LDA based ranking show much better performance.

Privacy-Preserving Collaborative Web Services QoS Prediction via Yao’s Garbled Circuits and Homomorphic Encryption (pp203-225)
       
Lu Li, An Liu, Qing Li, Liusheng Huang, Wei Yang, and Guanfeng Liu
Collaborative Web services QoS prediction has become an important tool for the generation of accurate personalized QoS which is a cornerstone of most QoS-based approaches for Web services selection and composition. While a number of achievements have been attained on the study of improving the accuracy of collaborative QoS prediction, little work has been done for protecting user privacy in this process. In this paper, we propose a privacy-preserving collaborative QoS prediction framework which can protect the private data of users while retaining the ability of generating accurate QoS prediction. We combine Yao's garbled circuit and additively homomorphic encryption via additively secret sharing to address non-linear computations required in the process of QoS prediction. We implement the proposed framework based on FasterGC, an open source implementation of Yao's garbled circuit, and conduct extensive simulations to study its performance. Simulation results, together with theoretical security and complexity analysis, show that privacy-preserving QoS prediction can be efficiently achieved in our framework.

Outbreak Power Measurement for Evolution Course of Web Events (pp226-248)
       
Xingzhi Wang, Xiangfeng Luo, Hui Zhang, and Huimin Liu
Nowadays, emergencies have a great impact on people’s daily lives. Web makes it possible to study emergencies from web information due to its real-time, open, and dynamic features. Measuring temporal features in evolution course of web events can help people timely get knowledge and understand emergent events, which contribute to reducing harms to our society caused by emergencies. In this paper, we propose an outbreak power measuring algorithm for the evolution of web events, in order to provide guidance for automatic detection and prediction of emergencies. An iterative algorithm is firstly introduced to calculate outbreak power of web events through increased web pages of events, increased attributes of events, and distribution of attributes in web pages and the relationships of attributes. Secondly, definition of web events types is proposed. From studying each type of web events, we dig out feature patterns and find laws of each type events, with hot event having the highest outbreak power while general event have the lowest outbreak power, and general event fluctuating most while urgent event fluctuating least, which can be prior knowledge of web events we study. And then, a fuzzy based algorithm is presented to discriminate the type of web events. By means of prior knowledge, membership grade of web events belong to each type can be calculated, and then the type of web events can be discriminated. Experiments on real data set demonstrate the proposed algorithm is both efficient and effective, and it is capable of providing accurate results of discrimination.

Searching for Relevant Tweets Based on Topic-Related User Activities (pp249-276)
       
Tomoya Noro and Takehiro Tokuda
 
Twitter is one of the largest social media. Although it can be used to get information on a topic of interest, it is not easy for us to find tweets relevant to the topic due to a massive amount of tweets and the small size of each tweet. Some relevant tweets may not include any terms explicitly related to the topic, and general content-based keyword search techniques and query expansion techniques are not effective for finding such relevant tweets. To solve this problem, we present a method for finding tweets on a topic of interest based on the Twitter user activities related to the topic such as tweet, retweet, and reply. The method consists of two phases: the preparation phase and the main phase. In the preparation phase, we create a user-tweet reference graph representing the relation between users and tweets based on the past user activities related to the topic, calculate the influence of each user and tweet in the topic, then define two types of each user's power, called ``Voice'' and ``Impact'', indicating ``how much voice the user has on the topic'' and ``how much impact the user has on the other users' tweets on the topic''. In the main phase, we calculate the relevance of newly-arrived tweets to the topic according to the Voice and the Impact score of the users who posted, retweeted, or replied to each of the tweets, then rank the tweets by the relevance score. The two phases are processed independently. Once the preparation phase is completed, the main phase can return the final result any time. Experimental results show that ``who retweeted or replied to the tweet'' is more effective for judging the relevance of each tweet to the topic than ``who posted the tweet'', and our method can find relevant tweets which do not include any terms explicitly related to the topic. We compare our method with an indegree-based method and a PageRank-based method, and show that our method outperforms the methods compared.

A Description-Based Hybrid Composition Method of Mashup Applications for Mobile Devices (pp277-309)
       
Korawit Prutsachainimmit and Takehiro Tokuda
Mashup application composition methods have been proposed for quick development of new mobile applications from existing resources. The existing methods have succeeded in developing data-flow mashup applications. However, they have limited capability to create event-driven mashup applications. A full treatment of data-flow and event-driven mashup composition is not yet achieved. This paper presents a new methodology for developing data-flow and event-driven mashup applications for mobile devices. Our hybrid composition method allows integration of mobile applications and REST Web services in a data-flow and event-driven manner. Description-based techniques and application generator tools are applied to reduce development cost. A mashup development system is implemented in Android mobile environment as the first experimental platform. The evaluation results show that our method is expressive and efficient in composing mobile mashup applications.

A Semantic Approach for Dynamically Determining Complex Composed Service Behaviour (pp310-338)
       
Carla Vairetti, Rosa Alarcon, and Jesus Bellido
Dynamic Web services composition aims to generate a composition plan at run-time. Semantic-based techniques rely on annotating services to facilitate the discovery of the service components that satisfy a user need (matchmaking). The matchmaking process places most attention on service selection rather than on the behaviour of the composed service, and the service components are arranged considering simple control-flow patterns (mainly sequence). In real life scenarios, however, composed service behaviour follows complex control-flow patterns that satisfy the needs of business processes, which are generally defined through manual service composition. In this paper we present a technique to derive complex composed service behaviour semantics, such semantics make possible to dynamically and automatically discover complex services compositions. We have implemented and tested our technique with a known dataset with better performance when compared to simple service composition strategies.

Multimedia Data Retrieving based on SOA Architecture (pp339-360)
       
Sid Ahmed Djallal Midouni, Youssef Amghar & Azeddine Chikh
In our past research we have already defined a full service approach to compose MaaS services for multimedia data retrieving. This approach is based on a four phases process: description; filtering; clustering; and restitution. In this work, we are especially interested in the description and filtering phases of this process. Our contribution is two-fold. First, we propose to extend for the MaaS description the W3C recommendation on semantics for web services (SAWSDL). To do so, we use two types of ontologies: a Domain Ontology encompassing concepts that define semantics of the related business domain and a Multimedia Ontology encompassing concepts that define a set of annotation properties of the multimedia content. Second, we show how this extension allows addressing the problem of matching between MaaS services and user needs.

Back to JWE Online Front Page