You are here

Web Mining: A Review

Journal Name:

Publication Year:

Abstract (2. Language): 
The World Wide Web, or simply Web, represents one of the largest sources of information in the world. We can say, perhaps, that any subject we think has probably become exists on a page in the Web. Information on the Web comes on different shapes and types, such as document texts, images and video clips. However, extraction of useful information, without the help of some Web tools, is not a trivial process. Here comes the role of Web mining, which provides tools that help us to extract useful knowledge from Web data. In this paper, we will provide an overview of Web mining and discuss the well-known applied algorithms of the three types of Web mining named content, structure and usage. Some future directions for this area provided.



[1] D. Hand, H. Mannila, and R. Smyth, Principles of Data Mining, MIT Press, Cambridge, 2001.
[2] B. Liu, Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, Springer, 2011.
[3] M.G. Junior and Z. Gong, "Web Structure Mining: An Introduction", Proceedings of the IEEE International Conference on Information Acquisition, June 27 - July 3, 2005, Hong Kong and Macau, China, pp. 590 – 595.
[4] A. Sowmiya A and A. Gayathri, "Enhancement in Weighted Page Rank Algorithm for Ranking Web Pages", International Journal of Computer Technology & Applications, vol. 5, no. 1, pp. 140 – 143, 2014.
[5] J. Pokorny and J. Smizansky, "Page Content Rank: An Approach to the Web Content Mining", Proceedings of the IADIS International Conference on Applied Computing, vol. 2, Algarve, Portugal, February 22-25, 2005.
[6] M. Kaur and C. Singh, "A Hybri Page Rank Algorithm: An Efficient Approach", International Journal of Computer Applications (0975 – 8887) vol. 100, no. 16, pp. 58 – 63, 2014.
[7] M. Kaur and C. Singh, "A Hybrid Page Rank Algorithm using Content and Link Based Algorithms", Global Journal of Advanced Engineering Technologies, vol. 3, no. 2, pp. 160 – 163, 2014.
[8] C. Zhao, Z. Zhang, H. Li, and X. Xie, “A Search Result Ranking Algorithm Based on Web Pages and Tags Clustering”, IEEE International Conference on Computer Science and Autoation Engineering (CSAE), 10-12 June 2011, pp.609-614.
[9] A. Joy and R. Remya, "Techniques for Web Mining of Various Forms of Existence of Data on Web: A Review", International Journal of Advance Research in Computer Science and Management Studies, vol. 3, no. 1, January 2015, pp. 279 - 281.
[10] N. Pushpalatha, "Hybrid Clustering Methods for Web Usage Mining", International Journal of Advance Research in Computer Science and Management Studies, vol. 3, no. 9, September 2015, pp. 228 - 232.
[11] J. Servastava, P. Dasikan, V. Kumar, Web Mining - Concepts, Applications, and Research Directions, Foundations and Advances in Data Mining, vol. 180 of the series Studies in Fuzziness and Soft Computing, pp 275-307, 2005.
[12] K. Sharma, G. Shrivastava and V. Kumar, "Web Mining: Today and Tomorrow", 3rd International Conference on Electronics Computer Technology (ICECT), vol. 1, pp. 399-403, 2011.
[13] R. Kosala and H. Blockeel, "Web Mining Research: A Survey", ACM SIGKDD, vol. 2, no. 1, pp. 1 – 15, 2000.
[14] R. Jain and G. N. Purohit, "Page Ranking Algorithms for Web Mining", International Journal of Computer Applications, vol. 13, no. 5, pp. 22 – 25, 2011.
[15] P. R. Kumar and A. K. Singh, "Web Structure Mining: Exploring Hyperlinks and Algorithms for Information Retrieval", American Journal of Applied Sciences, vol. 7, no. 6, pp. 840 - 845, 2010.
[16] G. Kaur and S. Aggarwal, "A Survey- Link Algorithm for Web Mining", International Journal of Computer Science & Communication Networks, vol. 3, no. 2, pp. 105-110, 2013.
[17] R. Malarvizhi and K. Saraswathi, "Web Content Mining Techniques Tools & Algorithms – A Comprehensive Study", International Journal of Computer Trends and Technology (IJCTT), vol. 4, no. 8, pp. 2940 – 2945, 2013.
[18] G. Singh and P. Dixit, "A New Algorithm for Web Log Mining", International Journal of Computer Applications, vol. 90, no. 17, pp. 20 – 24, 2014.
[19] K. Tandele and B. Pansare, "Web Usage Mining with Improved Frequent Pattern Tree Algorithms", International Journal of Computer Science and Information Technology Research, vol. 3, no. 2, pp. 952-958, 2015.
[20] J. Srivastava, R. Cooley, M. Deshpande, and T. P- Ning, "Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data", ACM SIGKDD Explorations Newsletter, vol. 1, no. 2, pp. 12-23, 2000.
[21] P. Mehtaa, B. Parekh, K. Modi, and P. Solanki, "Web Personalization Using Web Mining: Concept and Research Issue", International Journal of Information and Education Technology, vol. 2, no. 5, pp. 510-512, 2012.
[22] W. Xing and A. Ghorbani, "Weighted PageRank algorithm", CNSR '04 Proceedings of the 2nd Annual Conference on Communication Networks and Services Research, IEEE Computer Society Washington, DC, USA, pp. 305 – 314, 19 - 21 May, 2004
[23] J. Kleinberg, "Hubs, Authorities, and Communities". ACM Computing Surveys (CSUR), vol. 31, no. 4, 1999.

Thank you for copying data from