JCSE, vol. 7, no. 3, pp.147-158, 2013
DOI: http://dx.doi.org/10.5626/JCSE.2013.7.3.147
Deep Web and MapReduce
Yufei Tao
Division of Web Science and Technology, Korea Advanced Institute of Science and Technology, Daejeon, Korea
Abstract: This invited paper introduces results on Web science and technology obtained during work with the Korea Advanced
Institute of Science and Technology. In the first part, we discuss algorithms for exploring the deep Web, which refers to
the collection of Web pages that cannot be reached by conventional Web crawlers. In the second part, we discuss sorting
algorithms on the MapReduce system, which has become a dominant paradigm for massive parallel computing.
Keyword:
Web; Big data; MapReduce; Parallel computing; Algorithm; Theory
Full Paper: 229 Downloads, 2535 View
|