Skip to main content

Ordering logs with different timestamp

Time is relative? Yes, it is. When you need to write Audit Data and Logs, you don’t want to have a relative time on different systems.
There are different protocalls for time synchronization like NTP that can help us to synchronize the time on machines. Unfortunately, based on firewall configuration or the type of access we have on that machines we might not be able to use NTP protocol.
Even if we are using NTP, depending on server configuration we can have a deviation of 1-2s between machines. A deviation of a few seconds might not be too much until you write logs and you try them to order them based on time. In this situation, reading the logs is not so simple – the logs order will not match with the execution order of the actions that were logged. Trying to understand what happen inside the system wil be hard.

If this is not enough, we might have cases when our application is running in different environments that are using different time servers. For example, we can have a group of systems that are running on Azure machines, another group of systems that are running on-premises and another set of machines from AWS. The time will not match perfectly. Each server  will have a small time difference.

Usualy all NTP servers are sync between each others, but still, there can be a small time difference. The biggest problem is with on-premises systems, where the deviation can be 1s or more.
What we could do in this situation? Our mission is to be able to order logs from a timeline perspective, in the correct order.
From the time perspective, logs that were generated by the same system can be ordered correctly using the time information.
Another important information that we have is that we know the flow of data and logic inside the system. This means that we know that a request is first processed by system A, after it will be processed by system B and so on.
Even if we have cycles in the flow between different system, we can consider that when the current requests arrives in a system where it was already, we can view it as a new system.
Using this approach, we define ‘gates’ inside our system that can be tracked and numbered. If each gate has a different number or id, we are able to order them based on this information. In this way we can put all our logs in only one repository, order them based on the system (gate) number and each group of logs can be ordered based on the time.
Different tracking ID of the request will help us to identify from the logs repository the request that we want to analyze.
In the end we will have the right order of our logs and we’ll be able to read them like a story. For complex scenarios, we might need to work a little more, but the main idea will remain the same.

Comments

Popular posts from this blog

Windows Docker Containers can make WIN32 API calls, use COM and ASP.NET WebForms

After the last post , I received two interesting questions related to Docker and Windows. People were interested if we do Win32 API calls from a Docker container and if there is support for COM. WIN32 Support To test calls to WIN32 API, let’s try to populate SYSTEM_INFO class. [StructLayout(LayoutKind.Sequential)] public struct SYSTEM_INFO { public uint dwOemId; public uint dwPageSize; public uint lpMinimumApplicationAddress; public uint lpMaximumApplicationAddress; public uint dwActiveProcessorMask; public uint dwNumberOfProcessors; public uint dwProcessorType; public uint dwAllocationGranularity; public uint dwProcessorLevel; public uint dwProcessorRevision; } ... [DllImport("kernel32")] static extern void GetSystemInfo(ref SYSTEM_INFO pSI); ... SYSTEM_INFO pSI = new SYSTEM_INFO(...

How to audit an Azure Cosmos DB

In this post, we will talk about how we can audit an Azure Cosmos DB database. Before jumping into the problem let us define the business requirement: As an Administrator I want to be able to audit all changes that were done to specific collection inside my Azure Cosmos DB. The requirement is simple, but can be a little tricky to implement fully. First of all when you are using Azure Cosmos DB or any other storage solution there are 99% odds that you’ll have more than one system that writes data to it. This means that you have or not have control on the systems that are doing any create/update/delete operations. Solution 1: Diagnostic Logs Cosmos DB allows us activate diagnostics logs and stream the output a storage account for achieving to other systems like Event Hub or Log Analytics. This would allow us to have information related to who, when, what, response code and how the access operation to our Cosmos DB was done. Beside this there is a field that specifies what was th...

Cloud Myths: Cloud is Cheaper (Pill 1 of 5 / Cloud Pills)

Cloud Myths: Cloud is Cheaper (Pill 1 of 5 / Cloud Pills) The idea that moving to the cloud reduces the costs is a common misconception. The cloud infrastructure provides flexibility, scalability, and better CAPEX, but it does not guarantee lower costs without proper optimisation and management of the cloud services and infrastructure. Idle and unused resources, overprovisioning, oversize databases, and unnecessary data transfer can increase running costs. The regional pricing mode, multi-cloud complexity, and cost variety add extra complexity to the cost function. Cloud adoption without a cost governance strategy can result in unexpected expenses. Improper usage, combined with a pay-as-you-go model, can result in a nightmare for business stakeholders who cannot track and manage the monthly costs. Cloud-native services such as AI services, managed databases, and analytics platforms are powerful, provide out-of-the-shelve capabilities, and increase business agility and innovation. H...