Shared DB - Windows

These results should be viewed as guidelines and not performance guarantees, since there are many variables that affect performance (file set, network configurations, hardware characteristics, etc.). If throughput is important to your implementation, OPSWAT recommends site-specific benchmarking before implementing a production solution.

Environment

Using AWS environment with the specification below:

MetaDefender Core

	OS	AWS instance type	vCPU	Memory (GB)	Network bandwidth (Gbps)	Disk type	Benchmark
MetaDefender Core #1	Windows Server 2022	c5.4xlarge	16	32	Up to 10	SSD	Amazon EC2 c5.4xlarge - Geekbench
MetaDefender Core #2	Windows Server 2022	c5.4xlarge	16	32	Up to 10	SSD	Amazon EC2 c5.4xlarge - Geekbench

RDS

OS	AWS instance type	vCPU	Memory (GB)	Network bandwidth (Gbps)	Disk type
Windows Server 2022	db.m7i.4xlarge	16	64	Up to 10	SSD

Deployment Model

Using a AWS Load Balancer to distribute files sent from the client tool to two (2) different MetaDefender Core servers applying Round Robin algorithm. With this algorithm, each MetaDefender Core server is supposed to receive same number of requests.

Client tool

A simple tool written in Python to collect files in a designated folder and submit requests to Load Balancer mentioned above.

Python
    
​x
 
files_to_scan = list of files to scanscan_futures =[]for file_path in files_to_scan:    scan_futures.append(asyncio.create_task(self.scan_file(file_path, self.load_balancer_url))​async def scan_file(self, file_path, core_url):    api_url = f'{core_url}/file'​    with open(file_path, 'rb') as file:        file_content = file.read()​    headers = {'Content-Type': 'application/octet-stream', 'filename': file_path}​    max_retries = 20  # Maximum number of retry attempts    retry_delay = 120  # Delay in seconds before retrying​    for attempt in range(max_retries + 1):        try:            starttime = time.time()            response = requests.post(api_url, data=file_content, headers=headers)            endtime = time.time()            status_code = response.status_code            if status_code == 200:                self.post_wait = self.post_wait + endtime - starttime                self.num_post_req += 1                response_json = response.json()                data_id = response_json.get('data_id')                return status_code, response_json, data_id​            if status_code == 503:                print(f"Received status code {status_code}. Retrying in {retry_delay} seconds...")                await asyncio.sleep(retry_delay)            else:                return None, None, None​        except requests.RequestException as e:            return None, None, None​    return None, None, None
Copy

OS	AWS instance type	vCPU	Memory (GB)	Network bandwidth (Gbps)	Disk type
CentOS 7	c5.4xlarge	16	32	Up to 10	SSD

Dataset

Detailed information of dataset below will be used for testing:

File category	File type	Number of files	Total size (MB)	Average file size (MB)
Adobe	PDF	370	385 MB	1.0 MB
Executable	EXE	45	309.5 MB	6.9 MB
	MSI	15	45.75 MB	3.1 MB
Image	BMP	80	515 MB	6.4 MB
	JPG	420	237.5 MB	0.6 MB
	PNG	345	169 MB	0.5 MB
Media	MP3	135	865 MB	6.4 MB
	MP4	50	500 MB	10.0 MB
Office	DOCX	235	190 MB	0.8 MB
	DOC	225	486 MB	2.2 MB
	PPTX	365	860 MB	2.4 MB
	PPT	355	1950 MB	5.5 MB
	XLSX	340	283.5 MB	0.8 MB
	XLS	335	284.5 MB	0.8 MB
Text	CSV	100	236 MB	2.4 MB
	HTML	1075	76 MB	0.1 MB
	TXT	500	210 MB	0.4 MB
Archive	ZIP	Compressed files: 10 Extracted files: 270	Compressed size: 125.5 MB Extracted size: 156.5 MB	Avg compressed size: 12.6 MB Avg extracted size: 0.6 MB
Summary (compressed)		5000	7728.5 MB	1.55 MB average file size
Summary (extracted)		5260	7759.5 MB	1.48 MB average file size

Product Information

Product versions:

MetaDefender Core 5.16.0
Engines:
- Metascan 8: Ahnlab, Avira, ClamAV, ESET, Bitdefender, K7, Quick Heal, VirIT Explorer
- Metascan 12: Metascan 8, Varist, Ikarus, Emsisoft, Tachyon
- Metascan 16: Metascan 12, NANO, Comodo, VirusBlokAda, Zillya!
- Deep CDR: 7.6.0
- Proactive DLP: 3.0.0
- Archive: 7.6.0
- File type analysis: 7.6.0
- File-based vulnerability assessment: 4.2.416.0

MetaDefender Core settings

General settings

Turn off data retention
Turn off engine update

Archive extraction settings

Max recursion level: 99999999
Max number of extracted files: 99999999
Max total size of extracted files: 99999999
Timeout: 10 minutes
Handle archive extraction task as Failed: true
- Extracted partially: true

Metascan AV settings

Max file size: 99999999
Scan timeout: 10 minutes
Per engine scan timeout: 1 minutes

Performance test results

MetaDefender Core with single engine (technology)

Summary metrics:

Performance Report - Scan Metrics

Use case	Scan duration (minutes)	Throughput (processed objects/hour)	Avg. processing time (seconds/object)
Metascan 8	8.2	952,317	0.004
Metascan 12	13.8	565,869	0.006
Metascan 16	15.4	507,077	0.007
Deep CDR	10.8	722,910	0.005
Proactive DLP	7.2	1,062,541	0.003
Vulnerability	6.2	1,259,516	0.003

System resource utilization:

Performance Report - Dual Core Metrics

Use case	Avg./Max CPU usage (%)		Avg./Max RAM usage (%)		Avg. Network speed (KB/s)
Use case	Core 1	Core 2	Core 1	Core 2	Core 1	Core 2
Metascan 8	74.1/92.1	82.7/91.6	40.1/48.5	42.6/50.1	4,072	4,090
Metascan 12	81.0/93.3	80.9/94.6	49.6/56.3	50.5/58.1	3,451	2,522
Metascan 16	83.4/95.3	81.9/94.4	53.3/59.0	52.2/60.4	3,710	2,390
Deep CDR	79.7/94.1	77.7/91.3	42.4/46.4	43.1/48.0	3,924	3,111
Proactive DLP	63.6/90.3	55.4/90.0	40.1/43.4	40.2/47.3	6,299	6,016
Vulnerability	87.0/98.5	76.2/95.3	37.9/40.1	38.5/43.5	11,448	9,498

MetaDefender Core with common engine packages

Summary metrics:

Performance Report - Combined Scan Metrics

Use case	Scan duration (minutes)	Throughput (processed objects/hour)	Avg. processing time (seconds/object)
Metascan 8 + Deep CDR	14.4	542,291	0.007
Metascan 8 + Deep CDR + Proactive DLP	16.1	485,031	0.007
Metascan 8 + Deep CDR + Proactive DLP + Vulnerability	16.2	471,477	0.008
Metascan 12 + Deep CDR	16.5	473,272	0.008
Metascan 12 + Deep CDR + Proactive DLP	17.7	426,320	0.008
Metascan 12 + Deep CDR + Proactive DLP + Vulnerability	20.6	374,421	0.01
Metascan 16 + Deep CDR	19.8	394,393	0.009
Metascan 16 + Deep CDR + Proactive DLP	21.7	359,861	0.01
Metascan 16 + Deep CDR + Proactive DLP + Vulnerability	22.5	347,066	0.01

System resource utilization:

Performance Report - Combined Dual Core

Use case	Avg./Max CPU usage (%)		Avg./Max RAM usage (%)		Avg. Network speed (KB/s)
Use case	Core 1	Core 2	Core 1	Core 2	Core 1	Core 2
Metascan 8 + Deep CDR	88.3/98.7	84.7/98.1	42.9/49.7	43.0/46.3	3,549	2,646
Metascan 8 + Deep CDR + Proactive DLP	86.0/98.3	85.0/99.2	43.8/50.3	44.4/47.7	1,812	1,841
Metascan 8 + Deep CDR + Proactive DLP + Vulnerability	95.8/99.8	85.5/99.6	44.5/51.5	43.3/48.1	2,241	2,373
Metascan 12 + Deep CDR	77.3/97.2	74.6/95.4	51.2/60.8	52.6/61.3	2,307	2,769
Metascan 12 + Deep CDR + Proactive DLP	80.0/98.4	93.1/99.6	52.4/63.1	54.5/62.0	1,449	1,849
Metascan 12 + Deep CDR + Proactive DLP + Vulnerability	95.8/99.9	73.4/98.4	52.7/60.3	51.6/60.0	1,678	2,938
Metascan 16 + Deep CDR	82.9/98.1	80.3/96.9	53.4/63.2	56.6/66.3	2,430	2,645
Metascan 16 + Deep CDR + Proactive DLP	85.2/99.7	94.5/99.7	57.4/64.4	56.1/64.8	1,223	1,527
Metascan 16 + Deep CDR + Proactive DLP + Vulnerability	97.2/99.9	94.1/99.6	58.3/68.1	57.3/65.9	1,798	2,383

Recommendations

Controlling total processing time of each MD Core server:

In this deployment model, we should organize and send files in the way that it best utilizes the load of each MD Core server. It is not a good practice if one Core server is free while the other one is busy. By optimizing the distribution of files, we can ensure that each Core server is utilized efficiently, thereby improving overall system performance. Furthermore, this approach can help prevent bottlenecks and minimize the chances of system overload.

Adding proper number of MD Core servers to the cluster:

Adding more Core servers to this model will increase more load on the shared database. When adding a new MD Core server, users should monitor performance of database server such as memory/CPU consumption, disk usage, network bandwidth, request response time and so on… to see if it still can handle the load. This is important in order to maintain optimal performance and ensure that the database server can continue to efficiently serve the needs of the system.

Optimizing database server for better performance:

Continuing to add more Core servers to this model may result in increased strain on the shared database. As such, it is crucial to ensure that the database is optimized to handle the additional load effectively. Users can consider adjusting default database settings of PostgresSQL to optimize for more data load if needed. Here is where we can adjust PostgresSQL database settings: <PostgreSQL install location\version>\data\postgresql.conf.

Besides that, MD Core also supports a parameter (db_connection) for users to specify max connections that MD Core can handle, take a look at this guideline: MetaDefender Configuration.

Last updated on

Was this page helpful?

Shared DB - Windows

Environment

MetaDefender Core

RDS

Deployment Model

Client tool

Dataset

Product Information

MetaDefender Core settings

General settings

Archive extraction settings

Metascan AV settings

Performance test results

MetaDefender Core with single engine (technology)

MetaDefender Core with common engine packages

This Website Uses Cookies

Functional Cookies

Performance Cookies

Strictly Necessary Cookies

Targeting Cookies