This paper discusses as its primary research question the viability of using the Mahalanobis Distance as a multivariate method for detecting outliers in an industrial setting. An algorithm is used to detect future customer returns in a printed circuit board production line situated in Sibiu, Romania. From the literature, there is a lack of methods, tools and guidelines concerning the paradigm of Zero-Defect Manufacturing. The novelty of the method presented includes separation of highly specialized, future outliers from other outliers, and further automation using Python, a Docker container, a graphical user interface, a search-engine and a reporting tool. This allows the method to be used without external assistance. The data used is extracted industrial datasets from Continentals datalake. The algorithm detects 20% of future outliers and has been implemented by Continental. This can possibly be improved by increasing domain knowledge. The generality of the algorithm in principle allows for use at any of Continental’s production lines. There are strong assumptions regarding the requirements for the method, including benefits of employing domain knowledge critical variable identification and detection rate improvements. Further improvements of detection rate are also discussed. The paper concludes that the algorithm can detect a percentage of highly specialized outliers with simple automation in Python, but also acknowledges limitations in terms of increased demands from data quality and domain knowledge.

Identifying Customer Returns in a Printed Circuit Board Production Line Using the Mahalanobis Distance

Arena S.;
2023-01-01

Abstract

This paper discusses as its primary research question the viability of using the Mahalanobis Distance as a multivariate method for detecting outliers in an industrial setting. An algorithm is used to detect future customer returns in a printed circuit board production line situated in Sibiu, Romania. From the literature, there is a lack of methods, tools and guidelines concerning the paradigm of Zero-Defect Manufacturing. The novelty of the method presented includes separation of highly specialized, future outliers from other outliers, and further automation using Python, a Docker container, a graphical user interface, a search-engine and a reporting tool. This allows the method to be used without external assistance. The data used is extracted industrial datasets from Continentals datalake. The algorithm detects 20% of future outliers and has been implemented by Continental. This can possibly be improved by increasing domain knowledge. The generality of the algorithm in principle allows for use at any of Continental’s production lines. There are strong assumptions regarding the requirements for the method, including benefits of employing domain knowledge critical variable identification and detection rate improvements. Further improvements of detection rate are also discussed. The paper concludes that the algorithm can detect a percentage of highly specialized outliers with simple automation in Python, but also acknowledges limitations in terms of increased demands from data quality and domain knowledge.
2023
978-3-031-43687-1
978-3-031-43688-8
Automation
Industry 4.0
Multivariate Analysis
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/392143
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact