Azure Storage
Azure Storage is Microsoft's cloud storage solution for modern data storage scenarios offering highly available, massively scalable, durable, and secure storage for a variety of data objects in the cloud. This integration helps in monitoring the transaction volume and read/write activity of all your storage accounts.
In Azure Storage, storage accounts allow you to create and manage the following storage services:
- Blob storage stores any type of text or binary data, such as a document, media file, or application installer. You can set Blob storage for private access or share contents publicly to the Internet. Blob storage serves the same purpose as both AWS S3 and EBS.
- Table storage. Stores structured datasets. Table storage is a NoSQL key-attribute data store that allows for rapid development and fast access to large quantities of data. Similar to AWS SimpleDB and DynamoDB services.
- Queue storage. Provides messaging for workflow processing and for communication between components of cloud services.
- File storage. Offers shared storage for legacy applications using the standard Server Message Block (SMB) protocol. File storage is used in a similar manner to EFS in the AWS platform.
Log and metric types
For Azure Storage, you can collect the following logs and metrics:
-
Resource logs, which provide an insight into operations that were performed within an Azure resource. For a complete schema for resource logs refer to the below documentation:
- Azure Blob Storage schema
- Azure File Storage schema
- Azure Queue Storage schema
- Azure File Storage schema
Requests made by the Blob storage service itself, such as log creation or deletion, aren't logged. For a full list of the logged data, see Storage logged operations and status messages.
-
Activity logs, provides insight into any subscription-level or management group level events that have occurred in the Azure. To learn more, refer to Azure documentation.
-
Metrics for Azure Storage are in below namespaces:
- Microsoft.Storage/storageAccounts
- Microsoft.Storage/storageAccounts/blobServices
- Microsoft.Storage/storageAccounts/fileServices
- Microsoft.Storage/storageAccounts/queueServices
- Microsoft.Storage/storageAccounts/tableServices
noteOnly metrics with
category=Transaction
can be exported from diagnostic settings export feature.Click on the above namespaces to learn more about the supported metrics. For a complete list of the dimensions that Azure Storage supports, refer to the below documentation.
- Azure Blob Storage Metrics dimensions
- Azure File Storage Metrics dimensions
- Azure Queue Storage Metrics dimensions
- Azure Table Storage Metrics dimensions
infoCapacity metrics are currently not supported via Diagnostic Settings.
Setup
Azure service sends monitoring data to Azure Monitor, which can then stream data to Eventhub. Sumo Logic supports:
- Logs collection from Azure Monitor using our Azure Event Hubs source.
- Activity Logs collection from Azure Monitor using our Azure Event Hubs source. It is recommended to create a separate source for activity logs. If you are already collecting these logs, you can skip this step.
- Metrics collection using our HTTP Logs and Metrics source via Azure Functions deployed using the ARM template.
You must explicitly enable diagnostic settings for each storage service (blob,queue,table and file) and each storage account that you want to monitor. You can forward logs to the same event hub provided they satisfy the limitations and permissions as described here.
When you configure the event hubs source or HTTP source, plan your source category to ease the querying process. A hierarchical approach allows you to make use of wildcards. For example: Azure/Storage/Logs
, Azure/Storage/Metrics
.
Configure field in field schema
-
In the main Sumo Logic menu, select Manage Data > Logs > Fields.
-
Search for following fields:
tenant_name
. This field is tagged at the collector level and users can get the tenant name using the instructions here https://learn.microsoft.com/en-us/azure/active-directory-b2c/tenant-management-read-tenant-name#get-your-tenant-namelocation
. The region to which the resource name belongs to.subscription_id
. Id associated with a subscription where resource is present.resource_group
. The resource group name where the Azure resource is present.provider_name
. Azure resource provider name (for ex Microsoft.Storage).resource_type
. Azure resource type (for ex storageaccounts).resource_name
. The name of the resource (for ex storage account name).service_type
. Type of the service that can be accessed from with a azure resource (for example, blobservices).service_name
. Services that can be accessed from within an Azure resource (for example, Azure SQL databases in Azure SQL Server).
-
Create the fields if it is not present. Refer to create and manage fields.
Configure Field Extraction Rules
Create a Field Extraction Rule (FER) for Azure Storage by following the instructions here.
-
Azure Location Extraction FER
Rule Name: AzureLocationExtractionFER
Applied at: Ingest Time
Scope (Specific Data): tenant_name=*Parse Expressionjson "location", "properties.resourceLocation", "properties.region" as location, resourceLocation, service_region nodrop
| replace(toLowerCase(resourceLocation), " ", "") as resourceLocation
| if (!isBlank(resourceLocation), resourceLocation, location) as location
| if (!isBlank(service_region), service_region, location) as location
| if (isBlank(location), "global", location) as location
| fields location -
Resource ID Extraction FER
Rule Name: AzureResourceIdExtractionFER
Applied at: Ingest Time
Scope (Specific Data): tenant_name=*Parse Expressionjson "resourceId"
| toUpperCase(resourceId) as resourceId
| parse regex field=resourceId "/SUBSCRIPTIONS/(?<subscription_id>[^/]+)" nodrop
| parse field=resourceId "/RESOURCEGROUPS/*/" as resource_group nodrop
| parse regex field=resourceId "/PROVIDERS/(?<provider_name>[^/]+)" nodrop
| parse regex field=resourceId "/PROVIDERS/[^/]+(?:/LOCATIONS/[^/]+)?/(?<resource_type>[^/]+)/(?<resource_name>.+)" nodrop
| parse regex field=resource_name "(?<parent_resource_name>[^/]+)(?:/PROVIDERS/[^/]+)?/(?<service_type>[^/]+)/?(?<service_name>.+)" nodrop
| if (isBlank(parent_resource_name), resource_name, parent_resource_name) as resource_name
| fields subscription_id, location, provider_name, resource_group, resource_type, resource_name, service_type, service_name
Configure metric rules
-
Azure Observability Metadata Extraction Service Level
In case this rule is already exists then no need to create again.
Rule Name: AzureObservabilityMetadataExtractionServiceLevel
resourceId=/SUBSCRIPTIONS/*/RESOURCEGROUPS/*/PROVIDERS/*/*/*/*/* tenant_name=*
Fields extracted | Metric rule |
---|---|
subscription_id | $resourceId._1 |
resource_group | $resourceId._2 |
provider_name | $resourceId._3 |
resource_type | $resourceId._4 |
resource_name | $resourceId._5 |
service_type | $resourceId._6 |
service_name | $resourceId._7 |
- Azure Observability Metadata Extraction Storage Account Level
Rule Name: AzureObservabilityMetadataExtractionStorageAccountLevel
resourceId=*/SUBSCRIPTIONS/*/RESOURCEGROUPS/*/PROVIDERS/*/*/* tenant_name=* accountresourceid=*
Fields extracted | Metric rule |
---|---|
subscription_id | $resourceId._1 |
resource_group | $resourceId._2 |
provider_name | $resourceId._3 |
resource_type | $resourceId._4 |
resource_name | $resourceId._5 |
Configure metrics collection
In this section, you will configure a pipeline for shipping metrics from Azure Monitor to an Event Hub, on to an Azure Function, and finally to an HTTP Source on a hosted collector in Sumo Logic.
- Create hosted collector and tag tenant_name field
- Configure an HTTP Source.
- Configure and deploy the ARM Template.
- Export metrics to Event Hub. Perform below steps for each storage service (blob,queue,table and file) and each storage account that you want to monitor.
- Choose
Stream to an event hub
as destination. - Select
Transaction
. - Use the Event hub namespace created by the ARM template in Step 2 above. You can create a new Event hub or use the one created by ARM template. You can use the default policy
RootManageSharedAccessKey
as the policy name.
- Choose
Configure logs collection
Diagnostic logs
In this section, you will configure a pipeline for shipping diagnostic logs from Azure Monitor to an Event Hub.
- To set up the Azure Event Hubs source in Sumo Logic, refer to Azure Event Hubs Source for Logs.
- To create the Diagnostic settings in Azure portal, refer to the Azure documentation. Perform below steps for each storage service (blob,queue,table and file) and each storage account that you want to monitor.
- Choose
Stream to an event hub
as the destination. - Select
allLogs
. - Use the Event hub namespace and Event hub name configured in previous step in destination details section. You can use the default policy
RootManageSharedAccessKey
as the policy name.
- Choose
- Tag the location field in the source with right location value.
Activity Logs
To collect activity logs, follow the instructions here. Do not perform this step in case you are already collecting activity logs for a subscription.
Since this source contains logs from multiple regions make sure that you do not tag this source with the location tag.
Enabling Microsoft Defender for Cloud
For Security events, make sure you enable Microsoft Defender for Cloud. If you have an existing settings, click Edit Settings.
In Defender Plans settings page turn on the Azure Storage status under Cloud Workload Protection section.
Installing the Azure Storage app
Now that you have set up data collection, install the Azure Storage Sumo Logic app to use the pre-configured dashboards that provide visibility into your environment for real-time analysis of overall usage.
To install the app:
- Select App Catalog.
- In the Search Apps field, search for and then select your app.
- Optionally, you can scroll down to preview the dashboards included with the app. Then, click Install App (sometimes this button says Add Integration).
- Click Next.
- Look for the dialog confirming that your app was installed successfully.
Once an app is installed, it will appear in your Personal folder or the folder that you specified. From here, you can share it with other users in your organization. Dashboard panels will automatically start to fill with data matching the time range query received since you created the panel. Results won't be available immediately, but within about 20 minutes, you'll see completed graphs and maps.
Viewing the Azure Storage app dashboards
All dashboards have a set of filters that you can apply to the entire dashboard. Use these filters to drill down and examine the data to a granular level.
- You can change the time range for a dashboard or panel by selecting a predefined interval from a drop-down list, choosing a recently used time range, or specifying custom dates and times. Learn more.
- You can use template variables to drill down and examine the data on a granular level. For more information, see Filtering Dashboards with Template Variables.
- Most Next-Gen apps allow you to provide the scope at the installation time and are comprised of a key (
_sourceCategory
by default) and a default value for this key. Based on your input, the app dashboards will be parameterized with a dashboard variable, allowing you to change the dataset queried by all panels. This eliminates the need to create multiple copies of the same dashboard with different queries.
Overview
The Azure Storage - Overview dashboard provides insights into transactions by storage account, storage type, and API name.
Use this dashboard to:
- Monitor Availability by storage account.
- Monitor Errors by storage account.
- View Total Ingress by storage account.
- View Total Egress by storage account.
- View Max Success E2E Latency.
- View Max Success Server Latency.
- View transactions by storage account.
Access
The Azure Storage - Access dashboard provides insights into transactions by location, TLS version also it shows distribution by user agent and identity type.
Use this dashboard to:
- View transactions by location.
- View transactions by TLS version.
- View distribution by user agent.
- View distribution by identity type.
Operations
The Azure Storage - Operations dashboard provides details like total write in bytes, total read, total ingress by storage account, and total egress by storage account. It also provides storage account statistic insights such as total read count, read bytes, max/avg read latency, total write count, write bytes, and max/avg write latency. In addition, it also displays the status code trend.
Use this dashboard to:
- View amount of write data in MB.
- View amount of read data in MB.
- View Non 200 status code by Service Type.
- View Transactions by Service Type.
- View Storage Account Write Statistics.
- View Storage Account Read Statistics.
- Monitor Total Ingress by Service Type.
- Monitor Total Egress by Service Type.
- Monitor Total Ingress by API name.
- Monitor Total Egress by API name.
Blob Service
The Azure Storage - Blob Service dashboard provides details like read bytes, write bytes, last 10 operations, read/write trend by service type, and top 10 read/write by resource name.
Use this dashboard to:
- View amount of read data in MB.
- View amount of write data in MB.
- View Read vs Write.
- View Reads by Container Name.
- View Writes by Container Name.
- View Container Name by Failed Operations.
- View Top 10 Resources by Failures.
- View Top 10 Resources by Latency.
- View Top 10 Resources by Reads(MB).
- View Top 10 Resources by Writes(MB).
File Service
The Azure Storage - File Service dashboard provides details like read bytes, write bytes, last 10 operations, read/write trend by service type, and top 10 read/write by resource name.
Use this dashboard to:
- View amount of read data in MB.
- View amount of write data in MB.
- View Read vs Write.
- View Reads by File Share.
- View Writes by File Share.
- View File Share by Failed Operations.
- View Top 10 Resources by Failures.
- View Top 10 Resources by Latency.
- View Top 10 Resources by Reads(MB).
- View Top 10 Resources by Writes(MB).
Queue Service
The Azure Storage - Queue Service dashboard provides details like read bytes, write bytes, last 10 operations, read/write trend by service type, and top 10 read/write by resource name.
Use this dashboard to:
- View amount of read data in MB.
- View amount of write data in MB.
- View Read vs Write.
- View Reads by Queue Service.
- View Writes by Queue Service.
- View Queue Service by Failed Operations.
- View Top 10 Resources by Failures.
- View Top 10 Resources by Latency.
- View Top 10 Resources by Reads(MB).
- View Top 10 Resources by Writes(MB).
Table Service
The Azure Storage - Table Service dashboard provides details like read bytes, write bytes, last 10 operations, read/write trend by service type, and top 10 read/write by resource name.
Use this dashboard to:
- View amount of read data in MB.
- View amount of write data in MB.
- View Read vs Write.
- View Reads by Table Service.
- View Writes by Table Service.
- View Table Service by Failed Operations.
- View Top 10 Resources by Failures.
- View Top 10 Resources by Latency.
- View Top 10 Resources by Reads(MB).
- View Top 10 Resources by Writes(MB).
Audit control plane operations
The Azure Storage - Audit control plane operations dashboard provides details like changes, read/write/delete specific changes, different operations used, top 10 operations that caused most errors, and most common errors.
Use this dashboard to:
- View last 24 hours changes.
- View changes - read, write, and delete.
- View operations used.
- View Top 10 operations that caused the most errors.
- View Top 10 most common errors.
- View requests with anonymous access.
- view Operations that caused server-side throttling errors.