Skip to main content

The real difference between an Azure VM with or without SSD

I want to talk about the real difference of an Azure VM with or without SSD. This is not a post with charts and artificial benchmarks; it is just a real story from the field.
Context
One of my fellows from my work came at me complaining about a performance issue related to SQL Server. On an Azure VM with Linux they used to have an SQL Server instance. The DB storage size was not to complex and the DB size was acceptable.
Problem
Every few hours a job has to be executed on the database. There is a lot of data processing inside it and it usually takes around 1 hour. From duration perspective this is not acceptable, there is a clear NFR that request the task to be executed under 30 minutes.
An audit was done to the VM and database and it was pretty clear that there is a problem at read and write operations. Many actions were happening at that level, causing the memory and storage to be at high levels.
Actions
The DB specialists reviewed the database structure and the job. Unfortunately, there was not to many things that they could optimize. In addition, different things were tested like enable/disable different cache level and other SQL Server configurations, but without success.
The IT team also checked the Linux configuration and tried to see if there was an issue with the VM itself, but like in the case of DB nothing relevant was found except that the disk and memory were at high limits.
VM Resources
When we looked at VM type, we discover that it was an A4 with 8 cores and 14Gb memory. In theory this should be more than enough, with plenty of memory and CPU.
When we looked at the storage we noticed that behind the disk there is a normal HDD. This was the first sign combined with the results from DB and  the IT team that reported high disk and memory consumption. Usually, a system like SQL Server tries to use more memory when the disk is too slow and cannot keep up with the load.
Solution
We migrated the VM from A4 to D4S v3 that has only 4 cores and 16Gb memory, but has a powerful SSD behind it offered by premium storage.
Surprise, from more than an hour we were able to reduce the SQL job to 7 MINUTES. WOW! This is a big difference, that was mostly influenced by the storage type.
Less than 50% of memory is now consumed and the boost from SSD makes the job to fly. The funny thing is that we even pay less. An A4 cost us around 255e/month in comparison with a D4S v3 that is less than 150e/month.

Lesson learned
Before deciding what kind of machine you want to use from Azure, try to put on the paper what kind of resources you will need most. Based on this, try to choose the best VM that suites your needs and don’t forget that Microsoft has a good documentation related for different VM types (General Purpose, Compute Optimized, Memory Optimized, Storage Optimize, GPU and High Performance Compute).
And yes, play with different VMs configurations to see what works best for your needs.

Comments

  1. SSD disks for SharePoint Server is a no brainer wither in Azure IaaS or On Prem. At the least the SQL DB server should have SSD.

    ReplyDelete

Post a Comment

Popular posts from this blog

Windows Docker Containers can make WIN32 API calls, use COM and ASP.NET WebForms

After the last post , I received two interesting questions related to Docker and Windows. People were interested if we do Win32 API calls from a Docker container and if there is support for COM. WIN32 Support To test calls to WIN32 API, let’s try to populate SYSTEM_INFO class. [StructLayout(LayoutKind.Sequential)] public struct SYSTEM_INFO { public uint dwOemId; public uint dwPageSize; public uint lpMinimumApplicationAddress; public uint lpMaximumApplicationAddress; public uint dwActiveProcessorMask; public uint dwNumberOfProcessors; public uint dwProcessorType; public uint dwAllocationGranularity; public uint dwProcessorLevel; public uint dwProcessorRevision; } ... [DllImport("kernel32")] static extern void GetSystemInfo(ref SYSTEM_INFO pSI); ... SYSTEM_INFO pSI = new SYSTEM_INFO(...

ADO.NET provider with invariant name 'System.Data.SqlClient' could not be loaded

Today blog post will be started with the following error when running DB tests on the CI machine: threw exception: System.InvalidOperationException: The Entity Framework provider type 'System.Data.Entity.SqlServer.SqlProviderServices, EntityFramework.SqlServer' registered in the application config file for the ADO.NET provider with invariant name 'System.Data.SqlClient' could not be loaded. Make sure that the assembly-qualified name is used and that the assembly is available to the running application. See http://go.microsoft.com/fwlink/?LinkId=260882 for more information. at System.Data.Entity.Infrastructure.DependencyResolution.ProviderServicesFactory.GetInstance(String providerTypeName, String providerInvariantName) This error happened only on the Continuous Integration machine. On the devs machines, everything has fine. The classic problem – on my machine it’s working. The CI has the following configuration: TeamCity .NET 4.51 EF 6.0.2 VS2013 It see...

Navigating Cloud Strategy after Azure Central US Region Outage

 Looking back, July 19, 2024, was challenging for customers using Microsoft Azure or Windows machines. Two major outages affected customers using CrowdStrike Falcon or Microsoft Azure computation resources in the Central US. These two outages affected many people and put many businesses on pause for a few hours or even days. The overlap of these two issues was a nightmare for travellers. In addition to blue screens in the airport terminals, they could not get additional information from the airport website, airline personnel, or the support line because they were affected by the outage in the Central US region or the CrowdStrike outage.   But what happened in reality? A faulty CrowdStrike update affected Windows computers globally, from airports and healthcare to small businesses, affecting over 8.5m computers. Even if the Falson Sensor software defect was identified and a fix deployed shortly after, the recovery took longer. In parallel with CrowdStrike, Microsoft provi...