Show simple item record

dc.contributor.advisorDa Silva, Dilma
dc.creatorSmith, Cameron Maurice
dc.date.accessioned2019-01-17T23:05:14Z
dc.date.available2020-08-01T06:38:06Z
dc.date.created2018-08
dc.date.issued2018-05-23
dc.date.submittedAugust 2018
dc.identifier.urihttps://hdl.handle.net/1969.1/173656
dc.description.abstractIn this research, we investigate how the mining of student software repository data can be useful in capturing development analytics in educational software projects. Our methodology was to demonstrate the feasibility of extracting and analyzing software repository data automatically and show examples of how to analyze the obtained data. We designed an application toolset that works with GitHub, a web-based version control platform that Texas A&M University makes freely available to the students. Our toolset can be used with GitHub software repositories hosting programming assignments developed by students as part of their coursework. We consider how the analysis of information available in a software repository revision history can enable inspection of student programming assignment progression behaviors. For example, for a given programming assignment, using analytics derived from a set of corresponding student software repository changelogs, one can generate assignment progression statistics. As a result of the exploratory phase of this research, we demonstrate usage of our toolset with anonymized student GitHub repository data from two previous courses taught at Texas A&M University. We conclude that it is feasible to automate the extraction of student GitHub repository data that may lead to valuable observations about student software project development patterns. We make our tools available to the community so that other relevant questions regarding the relationship between software development analytics and student learning can be explored.en
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectSoftware configuration managementen
dc.subjectDistributed version control systemen
dc.subjectMining software repositoriesen
dc.subjectComputer science educationen
dc.titleA Toolset for Mining GitHub Repositories in Educational Software Projectsen
dc.typeThesisen
thesis.degree.departmentComputer Science and Engineeringen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorTexas A & M Universityen
thesis.degree.nameMaster of Scienceen
thesis.degree.levelMastersen
dc.contributor.committeeMemberDuffield, Nick
dc.contributor.committeeMemberShipman, Frank
dc.type.materialtexten
dc.date.updated2019-01-17T23:05:15Z
local.embargo.terms2020-08-01
local.etdauthor.orcid0000-0003-1627-4287


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record