Background: Developers inevitably make human errors while coding. These errors can lead to faults in code, some of which may result in system failures. It is important to reduce the faults inserted by developers as well as fix any that slip through.Aim: To investigate the fault insertion and fault fixing activities of developers. We identify developers who insert and fix faults, ask whether code topic 'experts' insert fewer faults, and experts fix more faults and whether patterns of insertion and fixing change over time. Methods: We perform a time-based analysis of developer activity on twelve Apache projects using Latent Dirichlet Allocation (LDA), Network Analysis and Topic Modelling. We also build three models (using Petri -net, Markov Chain and Hawkes Processes) which describe and simulate developers' bug-introduction and fixing behaviour.Results: We show that: the majority of the projects we analysed have developers who dominate in the insertion and fixing of faults; Faults are less likely to be inserted by developers with code topic expertise; Different projects have different patterns of fault inserting and fixing over time.Conclusions: We recommend that projects identify the code topic expertise of developers and use expertise information to inform the assignment of project work.

Fault-insertion and fault-fixing behavioural patterns in Apache Software Foundation Projects

Ortu, M
Primo
;
2023-01-01

Abstract

Background: Developers inevitably make human errors while coding. These errors can lead to faults in code, some of which may result in system failures. It is important to reduce the faults inserted by developers as well as fix any that slip through.Aim: To investigate the fault insertion and fault fixing activities of developers. We identify developers who insert and fix faults, ask whether code topic 'experts' insert fewer faults, and experts fix more faults and whether patterns of insertion and fixing change over time. Methods: We perform a time-based analysis of developer activity on twelve Apache projects using Latent Dirichlet Allocation (LDA), Network Analysis and Topic Modelling. We also build three models (using Petri -net, Markov Chain and Hawkes Processes) which describe and simulate developers' bug-introduction and fixing behaviour.Results: We show that: the majority of the projects we analysed have developers who dominate in the insertion and fixing of faults; Faults are less likely to be inserted by developers with code topic expertise; Different projects have different patterns of fault inserting and fixing over time.Conclusions: We recommend that projects identify the code topic expertise of developers and use expertise information to inform the assignment of project work.
2023
Faults analysis; LDA; Mining software repositories
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0950584923000411-main.pdf

accesso aperto

Tipologia: versione editoriale
Dimensione 5.45 MB
Formato Adobe PDF
5.45 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/362963
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact