WSP: A Network Coordinate based Web Service Positioning Framework
for Response Time Prediction

Jieming Zhu,  Yu Kang,  Zibin Zheng,  and Michael R. Lyu

Department of Computer Science and Engineering
The Chinese University of Hong Kong

With the rapid growth of Web services in recent years, the optimal service selection from functionally-equivalent service candidates has become more critical for building high quality service-oriented systems. To provide accurate QoS values for service selection, user-side QoS prediction thus becomes an important research problem. Although collaborative filtering based prediction approaches have been studied in several previous works, these methods suffer from the limitation of the sparsity of available historical QoS data, which greatly degrades the prediction accuracy. To address this problem, this paper proposes a Web service positioning (WSP) framework for response time prediction, which is one of the most important QoS properties. In our approach, a small set of landmarks are deployed to periodically monitor the response times of the Web service candidates and provide references to the numerous service users. By combining the advantages of network coordinate based approaches and collaborative filtering based approaches, the response times between users and Web services can be accurately predicted using their corresponding Euclidean distances. Extensive experiments are conducted based on our real-world QoS dataset collected on PlanetLab, comprising about 359,400 response time values from 200 users on 1,597 Web services. The experimental results show that our WSP approach outperforms the other existing approaches, especially when the historical data is sparse.

Read more from our paper:
-------------------------------
Jieming Zhu, Yu Kang, Zibin Zheng, and Michael R. Lyu, "WSP: A Network Coordinate based Web Service Positioning Framework for Response Time Prediction," in Proc. of IEEE International Conference on Web Services (ICWS), 2012. [Paper][Slides]

Dataset Release

This dataset is collected via the PlanetLab platform on Dec. 5 in 2011, comprising the RTTs between 200 distributed PlanetLab nodes and 1,597 Web services and also the RTTs between the 200 Planetlab nodes. The unit of each value is millisecond. Please refer to our paper for more information on the dataset collection.

Download dataset List of contents of the dataset
-----------------------------------
1. "P2P" (298 KB): This file is a 200-by-200 data matrix, comprising RTTs between 200 PlanetLab nodes. This matrix is asymmetric.
2. "W2P" (2.36 M): This file is a 1597-by-200 data matrix, comprising RTTs between 200 nodes and 1,597 Web service hosts.
3. "node_list.txt" (6 KB): This file decribes the ID and address of each PlanetLab node.
4. "WS_list.txt" (39 KB): This file decribes the ID and address of each Web service.
5. "readme.txt" (2 KB): This file decribes the dataset in detail.