Call for Papers
About the Journal
Editorial Board
Publication Ethics
Instructions for Authors
Announcements
Current Issue
Back Issues
Search for Articles
Categories
Back Issues
 

JCSE, vol. 9, no. 4, pp.190-203, December, 2015

DOI: http://dx.doi.org/10.5626/JCSE.2015.9.4.190

Crowdsourcing Identification of License Violations

Sanghoon Lee, Daniel M. German, Seung-won Hwang, and Sunghun Kim
Department of Computer Science and Engineering, Pohang University of Science and Technology (POSTECH), Pohang, Korea Department of Computer Science, University of Victoria, Canada Department of Computer Science, Yonsei University, Seoul, Korea Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong

Abstract: Free and open source software (FOSS) has created a large pool of source codes that can be easily copied to create new applications. However, a copy should preserve copyright notice and license of the original file unless the license explicitly permits such a change. Through software evolution, it is challenging to keep original licenses or choose proper licenses. As a result, there are many potential license violations. Despite the fact that violations can have high impact on protecting copyright, identification of violations is highly complex. It relies on manual inspections by experts. However, such inspection cannot be scaled up with open source software released daily worldwide. To make this process scalable, we propose the following two methods: use machine-based algorithms to narrow down the potential violations; and guide non-experts to manually inspect violations. Using the first method, we found 219 projects (76.6%) with potential violations. Using the second method, we show that the accuracy of crowds is comparable to that of experts. Our techniques might help developers identify potential violations, understand the causes, and resolve these violations.

Keyword: Crowdsourcing; Software license; Violation; Clone detection

Full Paper:   212 Downloads, 1457 View

 
 
ⓒ Copyright 2010 KIISE – All Rights Reserved.    
Korean Institute of Information Scientists and Engineers (KIISE)   #401 Meorijae Bldg., 984-1 Bangbae 3-dong, Seo-cho-gu, Seoul 137-849, Korea
Phone: +82-2-588-9240    Fax: +82-2-521-1352    Homepage: http://jcse.kiise.org    Email: office@kiise.org