ModSec-Learn: Boosting ModSecurity with Machine Learning

Scano, Christian; Floris, Giuseppe; Montaruli, Biagio; Demetrio, Luca; Valenza, Andrea; Compagna, Luca; Ariu, Davide; Piras, Luca; Balzarotti, Davide; Biggio, Battista

doi:10.1007/978-3-031-76459-2_3

ModSecurity is widely recognized as the standard open-source Web Application Firewall (WAF), maintained by the OWASP Foundation. It detects malicious requests by matching them against the Core Rule Set (CRS), identifying well-known attack patterns. Each rule is manually assigned a weight based on the severity of the corresponding attack, and a request is blocked if the sum of the weights of matched rules exceeds a given threshold. However, we argue that this strategy is largely ineffective against web attacks, as detection is only based on heuristics and not customized on the application to protect. In this work, we overcome this issue by proposing a machine-learning model that uses the CRS rules as input features. Through training, ModSec-Learn is able to tune the contribution of each CRS rule to predictions, thus adapting the severity level to the web applications to protect. Our experiments show that ModSec-Learn achieves a significantly better trade-off between detection and false positive rates. Finally, we analyze how sparse regularization can reduce the number of rules that are relevant at inference time, by discarding more than 30% of the CRS rules. We release our open-source code and the dataset at https://github.com/pralab/modsec-learn and https://github.com/pralab/http-traffic-dataset, respectively.

ModSec-Learn: Boosting ModSecurity with Machine Learning

Scano, Christian^Primo;Floris, Giuseppe^Secondo;Montaruli, Biagio;Demetrio, Luca;Valenza, Andrea;Compagna, Luca;Ariu, Davide;Piras, Luca;Balzarotti, Davide;Biggio, Battista

2025-01-01

Abstract

ModSecurity is widely recognized as the standard open-source Web Application Firewall (WAF), maintained by the OWASP Foundation. It detects malicious requests by matching them against the Core Rule Set (CRS), identifying well-known attack patterns. Each rule is manually assigned a weight based on the severity of the corresponding attack, and a request is blocked if the sum of the weights of matched rules exceeds a given threshold. However, we argue that this strategy is largely ineffective against web attacks, as detection is only based on heuristics and not customized on the application to protect. In this work, we overcome this issue by proposing a machine-learning model that uses the CRS rules as input features. Through training, ModSec-Learn is able to tune the contribution of each CRS rule to predictions, thus adapting the severity level to the web applications to protect. Our experiments show that ModSec-Learn achieves a significantly better trade-off between detection and false positive rates. Finally, we analyze how sparse regularization can reduce the number of rules that are relevant at inference time, by discarding more than 30% of the CRS rules. We release our open-source code and the dataset at https://github.com/pralab/modsec-learn and https://github.com/pralab/http-traffic-dataset, respectively.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Codice ISBN
	
				978-3-031-76458-5
978-3-031-76459-2
			
	Parole chiave
	
				Web Application Firewalls,  Machine Learning, Web Security, SQL injection, OWASP ModSecurity Core Rule Set
			
	Tipologia:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
ModSec-Learn.pdf Solo gestori archivio Tipologia: versione editoriale (VoR) Dimensione 1.16 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.16 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
ModSec_AAM_compressed.pdf embargo fino al 11/03/2026 Tipologia: versione post-print (AAM) Dimensione 822.05 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	822.05 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/440487

Citazioni

ND

1

0

UNICA IRIS Institutional Research Information System

ModSec-Learn: Boosting ModSecurity with Machine Learning

Scano, Christian^Primo;Floris, Giuseppe^Secondo;Montaruli, Biagio;Demetrio, Luca;Valenza, Andrea;Compagna, Luca;Ariu, Davide;Piras, Luca;Balzarotti, Davide;Biggio, Battista

Primo

Secondo

2025-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

UNICA IRIS Institutional Research Information System

ModSec-Learn: Boosting ModSecurity with Machine Learning

Scano, ChristianPrimo;Floris, Giuseppe Secondo;Montaruli, Biagio;Demetrio, Luca;Valenza, Andrea;Compagna, Luca;Ariu, Davide;Piras, Luca;Balzarotti, Davide;Biggio, Battista

Primo

Secondo

2025-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scano, Christian^Primo;Floris, Giuseppe^Secondo;Montaruli, Biagio;Demetrio, Luca;Valenza, Andrea;Compagna, Luca;Ariu, Davide;Piras, Luca;Balzarotti, Davide;Biggio, Battista

Scheda breve

Scheda completa

Scheda completa (DC)