Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expose Vertex Dynamic Workload Scheduler on Vertex Run #15417

Open
dwyatte opened this issue Sep 18, 2024 · 2 comments
Open

Expose Vertex Dynamic Workload Scheduler on Vertex Run #15417

dwyatte opened this issue Sep 18, 2024 · 2 comments
Labels
enhancement An improvement of an existing feature integrations Related to integrations with other services

Comments

@dwyatte
Copy link

dwyatte commented Sep 18, 2024

Describe the current behavior

Prefect's GCP Vertex integration exposes a subset of the arguments required to run a flow using a Vertex job. GCP recently integrated Vertex jobs with its Dynamic Workload Scheduler which allows users to pass an additional request parameter scheduling to allow more control over the start times of their jobs. This is especially useful for flows that require high-demand resources such as GPUs (e.g., by waiting up to 30 minutes for a GPU to become available from the flex start pool)

https://cloud.google.com/vertex-ai/docs/training/schedule-jobs-dws

workerPoolSpecs:
  machineSpec:
    machineType: n1-highmem-2
  replicaCount: 1
  containerSpec:
    imageUri: gcr.io/ucaip-test/ucaip-training-test
    args:
    - port=8500
    command:
    - start
scheduling:
  strategy: FLEX_START
  maxWaitDuration: 1800s

Describe the proposed behavior

https://github.com/PrefectHQ/prefect/blob/main/src/integrations/prefect-gcp/prefect_gcp/workers/vertex.py should expose the ability to specify the scheduling parameter

Example Use

No response

Additional context

Partial duplicate of #5495, we might consider just addressing that at the same time

@dwyatte dwyatte added the enhancement An improvement of an existing feature label Sep 18, 2024
@zzstoatzz
Copy link
Collaborator

hi @dwyatte - thank you for the issue!

increasing the capability of the vertex worker like this sounds useful and reasonable

do you have any interest / capacity to contribute this?

here are docs which might be a useful reference, but it would essentially just be adding a field to the config model that specifies the current work pool variables

@zzstoatzz zzstoatzz added the integrations Related to integrations with other services label Sep 18, 2024
@dwyatte
Copy link
Author

dwyatte commented Sep 19, 2024

Thanks @zzstoatzz

I or one of my colleagues plan to contribute soon!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement An improvement of an existing feature integrations Related to integrations with other services
Projects
None yet
Development

No branches or pull requests

2 participants