Github application programme interface and wordnet for code reuse

dc.contributor.authorPirapuraj
dc.contributor.authorPerera, Indika
dc.date.accessioned2016-12-30T09:49:02Z
dc.date.available2016-12-30T09:49:02Z
dc.date.issued12/29/2016
dc.description.abstractIt is clear that code reuse is important task in software development and maintenance. As a lot of software application and source code have been used as libraries in version control systems, such that Git, SVN, LibreSource and related web sites, such that GitHub.com, sourceforge.net, projectsgeek.com, Googlecode.com, more and more companies, especially Small and Medium Enterprises (SMEs), are reusing open source code to develop their own software. The problem in code reuse is, after download all relevant code, we need to identify most relevant code among pool of code. In this paper we use keyword search with n-gram NLP technique using GitHub Application Program Interface (API). Before search the source code, we retrieve all Repository name in GitHub belongs to particular programing language (JAVA, C++, etc.), as well as we retrieve all .java file name if we search java libraries using GitHub API. Then compare our keyword with this list, if the keyword extracted from Software architecture is connected word, then we will split using Apache Camel Splitter. If the particular keyword related to any project, we download the project. Otherwise using WordNet, get some synonym and do the above process again. For further relevancy, we will use a speech recognition technique (Dynamic Time Warping (DTW)) and a NLP technique (Part of Speech Tagging (POS)). Because of this is a part of the whole research, in this paper we will consider only GitHub API.en_US
dc.identifier.citationProceedings of Fifth Annual Science Research Sessions 2016 on "Enriching the Novel Scientific Research for the Development of the Nation" pp.7-16en_US
dc.identifier.isbn9.78956E+12
dc.identifier.urihttp://ir.lib.seu.ac.lk/handle/123456789/2050
dc.language.isoen_USen_US
dc.publisherFaculty of Applied Sciences, South eastern University of Sri lankaen_US
dc.subjectn-gramen_US
dc.subjectGitHub APIen_US
dc.subjectWordNeten_US
dc.subjectDynamic Time Warpingen_US
dc.subjectPart of Speech Taggingen_US
dc.subjectApache Camel Splitte.en_US
dc.titleGithub application programme interface and wordnet for code reuseen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ASRS 2016- Conference Proceeding - Page 7-16.pdf
Size:
593.29 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections