Tools and Datasets for Mining Libre Software Repositories

TitleTools and Datasets for Mining Libre Software Repositories
Publication TypeBook Chapter
Year of Publication2011
AuthorsRobles, Gregorio, Jesus M. Gonzalez-Barahona, Daniel Izquierdo-Cortazar, and Israel Herraiz
EditorKoch, Stefan
Book TitleMulti-Disciplinary Advancement in Open Source Software and Processes
Volume1
Chapter2
Pagination24–42
PublisherIGI Global
City Hershey, PA
ISBN Number9781609605148
Keywordsdata mining, open source, tools
Abstract

Thanks to the open nature of libre (free, open source) software projects, researchers have gained access to a rich set of data related to various aspects of software development. Although it is usually publicly available on the Internet, obtaining and analyzing the data in a convenient way is not an easy task, and many considerations have to be taken into account. In this chapter we introduce the most relevant data sources that can be found in libre software projects and that are commonly studied by scholars: source code releases, source code management systems, mailing lists and issue (bug) tracking systems. The chapter also provides some advice on the problems that can be found when retrieving and preparing the data sources for a later analysis, as well as information about the tools and datasets that support these tasks.

URLhttp://www.igi-global.com/book/multi-disciplinary-advancement-open-source/46171
DOI10.4018/978-1-60960-513-1
AttachmentSize
[file] tools-datasets-jossp.pdf199.04 KB