Duplex Output Software Effort Estimation Model with Self-guided Interpretation

Citation: UNSPECIFIED.

Full text not available from this repository. (Request a copy)

Official URL: http://www.sciencedirect.com/science/article/pii/S...

Abstract

Context: Software eﬀort estimation (SEE) plays a key role in predicting the eﬀort needed to complete software development task. However, the conclusion instability across learners has aﬀected the implementation of SEE models. This instability can be attributed to the lack of an eﬀort classiﬁcation benchmark that software researchers and practitioners can use to facilitate and interpret prediction results. Objective: To ameliorate the conclusion instability challenge by introducing a classiﬁcation and self-guided interpretation scheme for SEE. Method: We ﬁrst used the density quantile function to discretise the eﬀort recorded in 14 datasets into three classes (high, low and moderate) and built regression models for these datasets. The results of the regression models were an eﬀort estimate, termed output 1, which was then classiﬁed into an eﬀort class, termed output 2. We refertothe models generated inthis study as duplex output models as they return twooutputs. Theintroduced duplex output models trained with the leave-one-out cross validation and evaluated with MAE, BMMRE and adjusted R2, can be used to predict both the software eﬀort and the class of software eﬀort estimate. Robust statistical tests (Welch's t-test and Kruskal-Wallis H-test) were used to examine the statistical signiﬁcant differences in the models’ prediction performances. Results: Weobserved the following: (1) the duplex output models not only predicted the eﬀort estimates, they also oﬀeredaguidetointerpretingtheeﬀortexpended; (2)incorporatingthegeneticsearch algorithmintothe duplex output model allowed the sampling of relevant features for improved prediction accuracy; and (3) ElasticNet, a hybrid regression, provided superior prediction accuracy over the ATLM, the state-of-the-art baseline regression. Conclusion: The results show that the duplex output model provides a self-guided benchmark for interpreting estimated software eﬀort. ElasticNet can also serve as a baseline model for SEE.

Item Type:	Journal article
Uncontrolled Keywords:	Duplex output, Eﬀort estimation Eﬀort, classiﬁcation Multiple regression models
Subjects:	Q Science > QA Mathematics > QA76 Computer software
Divisions:	Schools > Centre for Business, Information Technology and Enterprise > School of Information Technology
Depositing User:	Michael Bosu
Date Deposited:	12 Oct 2017 02:27
Last Modified:	21 Jul 2023 04:44
URI:	http://researcharchive.wintec.ac.nz/id/eprint/5476

Actions (login required)

: View Item

Search for collections on Wintec Research Archive