Azure OpenAI

Puorpose of quotas

To regulate the number of requests, the volume of texts tokens, or the amount of computational power utilized by OpenAI API user

Quota Allocation

Subscription receive a quota allocation, determined by geographic region and regions quantified by token-per-minute (TPM)

Rate limit

Conversion metric

Azure API Management (APIM)

Azure Monitor and Log Analytics

Azure OpenAI Monitoring Metrics

References

Learn Live: Monitoring Azure OpenAI Monitor OpenAI Monitor OpenAI