Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MLFlow metrics are not available immediately for consumption #39198

Open
bastbu opened this issue Jan 15, 2025 · 1 comment
Open

MLFlow metrics are not available immediately for consumption #39198

bastbu opened this issue Jan 15, 2025 · 1 comment
Labels
customer-reported Issues that are reported by GitHub users external to the Azure organization. Machine Learning needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team question The issue doesn't require a change to the product in order to be resolved. Most issues start as that Service Attention Workflow: This issue is responsible by Azure service team.

Comments

@bastbu
Copy link

bastbu commented Jan 15, 2025

  • Package Name: azure-ai-ml
  • Package Version: 1.23.0
  • Operating System: Ubuntu
  • Python Version: 3.11

Describe the bug

The documentation on Azure ML states that when logging MLFlow metrics (which is now the recommended way to log metrics in Azure ML) synchronously, the metrics are immediately available for consumption:

Metrics, as opposed to parameters, are always numeric, and they can be logged either synchronously or asynchronously. When metrics are logged, they are immediately available for consumption upon call return.

Also, for asynchronous metric logging, the doc seems to imply that the metrics should be queryable when the job finishes:

Azure Machine Learning automatically waits when the job is about to finish, to see if there is any pending metric to be persisted. By the time a job is completed in Azure Machine Learning, all metrics are guaranteed to be persisted.

However, our tests have shown several times that the metrics are not immediately available to be queried with MLFlow when the job finishes. Is this expected, and if yes, is there a deterministic way to wait for the metrics to be available for querying?

To Reproduce

Steps to reproduce the behavior:

  1. Log a metric in an Azure ML job using the MLFlow SDK
  2. Query all metrics for an Azure ML job after it finished using the MLFlow SDK

Expected behavior

The reported metrics should immediately appear when querying.

@github-actions github-actions bot added customer-reported Issues that are reported by GitHub users external to the Azure organization. needs-triage Workflow: This is a new issue that needs to be triaged to the appropriate team. question The issue doesn't require a change to the product in order to be resolved. Most issues start as that labels Jan 15, 2025
@xiangyan99 xiangyan99 added Machine Learning Service Attention Workflow: This issue is responsible by Azure service team. and removed needs-triage Workflow: This is a new issue that needs to be triaged to the appropriate team. labels Jan 15, 2025
Copy link

Thanks for the feedback! We are routing this to the appropriate team for follow-up. cc @Azure/azure-ml-sdk @azureml-github.

@github-actions github-actions bot added the needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team label Jan 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
customer-reported Issues that are reported by GitHub users external to the Azure organization. Machine Learning needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team question The issue doesn't require a change to the product in order to be resolved. Most issues start as that Service Attention Workflow: This issue is responsible by Azure service team.
Projects
None yet
Development

No branches or pull requests

2 participants