TY - GEN
T1 - Feature weighting for Case-based reasoning software project effort estimation
AU - Sigweni, Boyce
PY - 2014
Y1 - 2014
N2 - Background: Software effort estimation is one of the most important activities in software development process. Un-fortunately, estimates are often substantially wrong and specifically most projects encounter effort overruns. Numerous methods have been proposed including Case based reasoning (CBR). Existing research shows that feature subset selection (FSS) is an important aspect of CBR, however, searching for the optimal feature weights is a combinatorial problem and therefore NP-hard. Objective: To develop and evaluate efficient algorithms to generalise FSS into an effective feature weighting approach that can improve accuracy further, since not all features contribute equally to solving the problem. Method: Use various search algorithms e.g., forward sequential weighting (FSW) and random mutation hill climbing (RMHC) to assign weight to features in order to improve the estimation accuracy. We will extend an existing CBR java shell ArchANGEL1. We will perform experiments based on repeated measures design on real world datasets to evaluate these algorithms. Limitations of the proposed research: Dataset quality cannot be assured therefore our findings could be influenced by noisy data. Older datasets may be misrepresenting current software development approaches and technologies. CBR could be sensitive to the choice of distance metric; however, we will only use standardised Euclidean distance.
AB - Background: Software effort estimation is one of the most important activities in software development process. Un-fortunately, estimates are often substantially wrong and specifically most projects encounter effort overruns. Numerous methods have been proposed including Case based reasoning (CBR). Existing research shows that feature subset selection (FSS) is an important aspect of CBR, however, searching for the optimal feature weights is a combinatorial problem and therefore NP-hard. Objective: To develop and evaluate efficient algorithms to generalise FSS into an effective feature weighting approach that can improve accuracy further, since not all features contribute equally to solving the problem. Method: Use various search algorithms e.g., forward sequential weighting (FSW) and random mutation hill climbing (RMHC) to assign weight to features in order to improve the estimation accuracy. We will extend an existing CBR java shell ArchANGEL1. We will perform experiments based on repeated measures design on real world datasets to evaluate these algorithms. Limitations of the proposed research: Dataset quality cannot be assured therefore our findings could be influenced by noisy data. Older datasets may be misrepresenting current software development approaches and technologies. CBR could be sensitive to the choice of distance metric; however, we will only use standardised Euclidean distance.
UR - http://www.scopus.com/inward/record.url?scp=84905484617&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84905484617&partnerID=8YFLogxK
U2 - 10.1145/2601248.2613081
DO - 10.1145/2601248.2613081
M3 - Conference contribution
AN - SCOPUS:84905484617
SN - 9781450324762
T3 - ACM International Conference Proceeding Series
BT - 18th International Conference on Evaluation and Assessment in Software Engineering, EASE 2014
PB - Association for Computing Machinery
T2 - 18th International Conference on Evaluation and Assessment in Software Engineering, EASE 2014
Y2 - 12 May 2014 through 14 May 2014
ER -