CFE-1162: Updates enhancement to reflect hcp usecase and with latest information on aws tags support #1700

TrilokGeer · 2024-10-17T16:05:05Z

The PR updates enhancement to reflect hcp usecase and with latest information on aws tags support.

Signed-off-by: Trilok Geer <[email protected]>

openshift-ci · 2024-10-17T16:05:18Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please ask for approval from trilokgeer. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci-robot · 2024-10-18T07:06:25Z

@TrilokGeer: This pull request references CFE-1162 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.18.0" version, but no target version was set.

In response to this:

The PR updates enhancement to reflect hcp usecase and with latest information on aws tags support.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Miciah · 2024-10-18T20:13:19Z

enhancements/api-review/custom-tags-aws.md


-A new field `propagateUserTags` is added in GA release version. The `experimentalPropagateUserTags` field will be deprecated. In future release versions, `experimentalPropagateUserTags` will be removed.
-When both fields are set, `experimentalPropagateUserTags` takes precedence.
+If the userTags field is changed post-install, there is no guarantee about how an in-cluster operator will respond to the change. Some operators may reconcile the change and change tags on the AWS resource. Some operators may ignore the change. However, if tags are removed from userTags, the tag will not be removed from the AWS resource.


You moved this paragraph, but did you mean to remove it entirely (or perhaps keep it for historical reference, but mark it as outdated)? We've run into an obstacle with cluster-ingress-operator and cloud-provider-aws (see openshift/cluster-ingress-operator#1148 (comment)), so I want to understand whether (1) handling updates and (2) handling removals in particular are hard requirements or not.

In contrast to cluster-ingress-operator and cloud-provider-aws, I checked the behaviour with aws-load-balancer-operator#137, and it can update and remove the tags.

oc get infrastructures.config.openshift.io cluster -oyaml | yq .status.platformStatus aws: region: us-east-2 resourceTags: - key: conflict-key1 value: plat-value2 - key: conflict-key2 value: plat-value3 type: AWS aws elbv2 describe-tags --resource-arns arn:aws:elasticloadbalancing:us-east-******* { "TagDescriptions": [ { "ResourceArn": "arn:aws:elasticloadbalancing:us-east-*******", "Tags": [ { "Key": "service.k8s.aws/resource", "Value": "LoadBalancer" }, { "Key": "service.k8s.aws/stack", "Value": "aws-load-balancer-test-default-lb-class/echoserver" }, { "Key": "conflict-key2", "Value": "plat-value3" }, { "Key": "conflict-key1", "Value": "plat-value2" }, { "Key": "elbv2.k8s.aws/cluster", "Value": "ckyal-20241021-2e71f8-pmfc8" } ] } ] }

Updated infrastructure status: removed conflict-key1 and updated conflict-key2

oc get infrastructures.config.openshift.io cluster -oyaml | yq .status.platformStatus aws: region: us-east-2 resourceTags: - key: conflict-key2 value: new-plat-value3 type: AWS aws elbv2 describe-tags --resource-arns arn:aws:elasticloadbalancing:us-east-******* { "TagDescriptions": [ { "ResourceArn": "arn:aws:elasticloadbalancing:us-east-*******", "Tags": [ { "Key": "service.k8s.aws/resource", "Value": "LoadBalancer" }, { "Key": "service.k8s.aws/stack", "Value": "aws-load-balancer-test-default-lb-class/echoserver" }, { "Key": "conflict-key2", "Value": "new-plat-value3" }, { "Key": "elbv2.k8s.aws/cluster", "Value": "ckyal-20241021-2e71f8-pmfc8" } ] } ] }

So, there is a bit of inconsistency between upstream cloud-provider-aws and aws-load-balancer-controller on their way of handling tags.

From OpenShift (cluster-ingress-operator and aws-load-balancer-operator) side, I think we can only control

aws-load-balancer-additional-resource-tags annotation for cluster-ingress-operator , and

--default-tags container arg for aws-load-balancer-operator

to propagate relevant tags to its consumers, which in this case are cloud-provider-aws and aws-load-balancer-controller, respectively. If we need to standardise the behaviour, then we might need to change the upstream logic.

You moved this paragraph, but did you mean to remove it entirely (or perhaps keep it for historical reference, but mark it as outdated)? We've run into an obstacle with cluster-ingress-operator and cloud-provider-aws (see openshift/cluster-ingress-operator#1148 (comment)), so I want to understand whether (1) handling updates and (2) handling removals in particular are hard requirements or not.

Ah!, I think it is a residue of the initial version, thanks @Miciah. In summary, all redhat supported operators (in-cluster or day2) will reconcile the tags on resources created and managed by them. Removal of tag is not supported, as it'd intervene and remove the tags un-intentionally for tags added externally by the user, if any.

Removal of tag is not supported, as it'd intervene and remove the tags un-intentionally for tags added externally by the user, if any.

aws-load-balancer-controller provides a command line flag called --external-managed-tags through which a list of tag keys can be ignored to be reconciled by the controller.

xRef :

https://kubernetes-sigs.github.io/aws-load-balancer-controller/latest/deploy/configurations/

add support for external-managed-tags & prefer defaultTags kubernetes-sigs/aws-load-balancer-controller#1970

external-managed-tags stringList AWS Tag keys that will be managed externally. Specified Tags are ignored during reconciliation

This means if a user wants to manage a tag externally, it has to be added in external-managed-tags list. If not, aws-load-balancer-controller will remove unsolicited tags, and apply tags defined in --default-tags.

@alebedev87 As I can see, aws-load-balancer-operator doesn't provide a field to specify ExternalManagedTags. Was there any discussion around this, or it's an existing gap?

Signed-off-by: Trilok Geer <[email protected]>

jsafrane · 2024-10-21T13:15:01Z

enhancements/api-review/custom-tags-aws.md

+For the resources created and managed by hosted control plane, cluster api provider for aws reconciles the user tags on AWS resources. The hosted control plane updates the `infrastructure.config.openshift.io` resource to reflect new tags in `resourceTags`. The OpenShift operators, both core and non-core (managed by RedHat), reconcile the respective AWS resources created and managed by them. 
+Given that, there is no universal controller to update all resources created by OpenShift, the day2 updates of tags is not supported for standalone OpenShift deployments.


What does it mean "not supported for standalone OpenShift deployments"? What will happen if a cluster admin changes resourceTags in infrastructure? Line 95 ("If the userTags field is changed post-install,") says it's going to be applied to the cloud objects.

At present, in standalone Openshift deployments, the resourceTags in infrastructure is handled as immutable and not a supported operation. The userTags update is allowed for hcp usecases, where, hcp control plane updates the infrastructure status based on edits in hostedCluster and nodepool objects.

resourceTags in infrastructure is handled as immutable and not a supported operation

This is not true, status.platformStatus.aws.resourceTags is editable on a standalone cluster.
Something will happen on the standalone cluster. I think with the current PR, the volumes just get re-tagged, just as in HCP.

Ah yes, it is definitely a bug. The feature was implemented before the support of subresource edit in kubectl.

The controllers should not be enabled to reconcile for standalone clusters. Even in the event of update in standalone clusters, the change must not reflected on the cloud provider resources.

I do not agree here. I think there should be as little difference between a standalone cluster and HyperShift managed one, esp. when it comes to consuming the OpenShift own APIs such as Infrastructure. Having different behavior in HyperShift managed clusters will create inconsistent experience for users.

In addition, for example cluster registry operator already syncs tags in existing s3 buckets when user changes infrastructure status.platformStatus.aws.resourceTags in a standalone cluster, and I think all observers of this field should have the same behavior.

@JoelSpeed @openshift/openshift-staff-engineers

Well, the OCP standalone cannot be considered for the reason of incompleteness of the solution. The completeness is added along with hypershift cluster spec which reconciles tags for vpc, subnets, etc. The day2 tags worklfow being considered here is for SD+ROSA+HCP deployments. I am happy to consider better alternatives other than adding differentiation in operators along with an additional annotation (which exists on infrastructure) that identifies the clusters as hypershift deployed.

Based on discussion on slack, let's consider user updating infrastructure status as unsupported on standalone OCP and the statement in the enhancement is OK.
Still, I'd prefer all implementations to do the same - if registry operator syncs tags on status update, storage should too. It will greatly simplify our testing.

In addition, for example cluster registry operator already syncs tags in existing s3 buckets when user changes infrastructure status.platformStatus.aws.resourceTags in a standalone cluster, and I think all observers of this field should have the same behavior.

The Machine API providers behave in the same way

We previously didn't want tag updating because of the inconsistency between some resources being update and some not being updated.

In the case of HyperShift, we have an opportunity to make the tagging consistent across all resources, however, it was my understanding that a lot of resources for a HyperShift cluster were pre-provisioned and handed to the cluster, and it's not actually the Cluster API controllers that provision these resources. Will CAPI update tags on resources it didn't create?

chiragkyal · 2024-10-22T12:54:36Z

/cc @alebedev87

Signed-off-by: Trilok Geer <[email protected]>

openshift-ci · 2024-11-06T07:38:29Z

@TrilokGeer: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

jsafrane · 2024-12-10T16:11:25Z

enhancements/api-review/custom-tags-aws.md

-### Existing design details
-
-New `experimentalPropagateUserTags` field added to `.platform.aws` of install config to indicate that the user tags should be applied to AWS
+New `propagateUserTags` field added to `.platform.aws` of install config to indicate that the user tags should be applied to AWS


Where is the field stored after installation? In other words, how a CSI driver operator knows it should sync tags or not? The operator does not read install-config.

The field is applied in install-config while generating the infrastructure resource. There are no control explicit control fields proposed. Hence, it is enabled by default.

Ack.
I understood that the flag propagateUserTags enables the propagation as whole, but it just copies the tags from install-config to the initial infrastructure. I thought that installer already does that, but I may be wrong here.

Correct, the change here is about deprecation of experimentalPropagateUserTags field. The field applies to installer config.

JoelSpeed · 2025-01-28T14:03:37Z

enhancements/api-review/custom-tags-aws.md

 If the userTags field is changed post-install, there is no guarantee about how an in-cluster operator will respond to the change. Some operators may reconcile the change and change tags on the AWS resource. Some operators may ignore the change. However, if tags are removed from userTags, the tag will not be removed from the AWS resource.

+For the resources created and managed by hosted control plane, cluster api provider for aws reconciles the user tags on AWS resources. The hosted control plane updates the `infrastructure.config.openshift.io` resource to reflect new tags in `resourceTags`. The OpenShift operators, both core and non-core (managed by RedHat), reconcile the respective AWS resources created and managed by them. 


Where do the CAPI controllers actually source from? The hosted control plane resource will copy them into the AWSCluster which is within the control plane, and not the guest cluster. So why do we need to adjust the infrastructure resource at all? Are there things beyond what the AWS CAPI provider manages that also support updating the tags?

JoelSpeed · 2025-01-28T14:06:57Z

enhancements/api-review/custom-tags-aws.md

+For the resources created and managed by hosted control plane, cluster api provider for aws reconciles the user tags on AWS resources. The hosted control plane updates the `infrastructure.config.openshift.io` resource to reflect new tags in `resourceTags`. The OpenShift operators, both core and non-core (managed by RedHat), reconcile the respective AWS resources created and managed by them. 
+Given that, there is no universal controller to update all resources created by OpenShift, the day2 updates of tags is not supported for standalone OpenShift deployments.


In addition, for example cluster registry operator already syncs tags in existing s3 buckets when user changes infrastructure status.platformStatus.aws.resourceTags in a standalone cluster, and I think all observers of this field should have the same behavior.

The Machine API providers behave in the same way

We previously didn't want tag updating because of the inconsistency between some resources being update and some not being updated.

In the case of HyperShift, we have an opportunity to make the tagging consistent across all resources, however, it was my understanding that a lot of resources for a HyperShift cluster were pre-provisioned and handed to the cluster, and it's not actually the Cluster API controllers that provision these resources. Will CAPI update tags on resources it didn't create?

TrilokGeer added 2 commits October 17, 2024 20:54

Updates enhancement with latest relevant information on aws tags support

7433185

Signed-off-by: Trilok Geer <[email protected]>

Updates enhancement to reflect hcp usecase

403e6c1

Signed-off-by: Trilok Geer <[email protected]>

openshift-ci bot requested review from hasbro17 and jerpeter1 October 17, 2024 16:05

TrilokGeer changed the title ~~Updates enhancement to reflect hcp usecase and with latest information on aws tags support~~ [CFE-1162] Updates enhancement to reflect hcp usecase and with latest information on aws tags support Oct 18, 2024

TrilokGeer changed the title ~~[CFE-1162] Updates enhancement to reflect hcp usecase and with latest information on aws tags support~~ CFE-1162: Updates enhancement to reflect hcp usecase and with latest information on aws tags support Oct 18, 2024

openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Oct 18, 2024

Miciah mentioned this pull request Oct 18, 2024

CFE-1134: Watch infrastructure and update AWS tags openshift/cluster-ingress-operator#1148

Merged

Miciah reviewed Oct 18, 2024

View reviewed changes

Adds clarity on the update behaviour

6a800c3

Signed-off-by: Trilok Geer <[email protected]>

jsafrane reviewed Oct 21, 2024

View reviewed changes

openshift-ci bot requested a review from alebedev87 October 22, 2024 12:54

jsafrane mentioned this pull request Oct 29, 2024

CFE-1131: AWS Tags DAY2 Update openshift/csi-operator#297

Open

Updates implementation constraints in load balancer controllers

b42c50d

Signed-off-by: Trilok Geer <[email protected]>

alebedev87 mentioned this pull request Dec 4, 2024

CFE-1133: Watch Infrastructure and update AWS tags openshift/aws-load-balancer-operator#137

Merged

jsafrane reviewed Dec 10, 2024

View reviewed changes

JoelSpeed reviewed Jan 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CFE-1162: Updates enhancement to reflect hcp usecase and with latest information on aws tags support #1700

CFE-1162: Updates enhancement to reflect hcp usecase and with latest information on aws tags support #1700

TrilokGeer commented Oct 17, 2024

openshift-ci bot commented Oct 17, 2024

openshift-ci-robot commented Oct 18, 2024 •

edited by openshift-ci bot

Loading

Miciah Oct 18, 2024

chiragkyal Oct 21, 2024

TrilokGeer Oct 21, 2024 •

edited

Loading

chiragkyal Oct 22, 2024 •

edited

Loading

jsafrane Oct 21, 2024

TrilokGeer Oct 21, 2024

jsafrane Nov 19, 2024 •

edited

Loading

TrilokGeer Dec 16, 2024

TrilokGeer Jan 27, 2025

jsafrane Jan 27, 2025

TrilokGeer Jan 27, 2025

jsafrane Jan 28, 2025

JoelSpeed Jan 28, 2025

chiragkyal commented Oct 22, 2024

openshift-ci bot commented Nov 6, 2024

jsafrane Dec 10, 2024 •

edited

Loading

TrilokGeer Dec 16, 2024

jsafrane Jan 2, 2025

TrilokGeer Jan 27, 2025

JoelSpeed Jan 28, 2025

JoelSpeed Jan 28, 2025

		For the resources created and managed by hosted control plane, cluster api provider for aws reconciles the user tags on AWS resources. The hosted control plane updates the `infrastructure.config.openshift.io` resource to reflect new tags in `resourceTags`. The OpenShift operators, both core and non-core (managed by RedHat), reconcile the respective AWS resources created and managed by them.
		Given that, there is no universal controller to update all resources created by OpenShift, the day2 updates of tags is not supported for standalone OpenShift deployments.

		If the userTags field is changed post-install, there is no guarantee about how an in-cluster operator will respond to the change. Some operators may reconcile the change and change tags on the AWS resource. Some operators may ignore the change. However, if tags are removed from userTags, the tag will not be removed from the AWS resource.

		For the resources created and managed by hosted control plane, cluster api provider for aws reconciles the user tags on AWS resources. The hosted control plane updates the `infrastructure.config.openshift.io` resource to reflect new tags in `resourceTags`. The OpenShift operators, both core and non-core (managed by RedHat), reconcile the respective AWS resources created and managed by them.

CFE-1162: Updates enhancement to reflect hcp usecase and with latest information on aws tags support #1700

Are you sure you want to change the base?

CFE-1162: Updates enhancement to reflect hcp usecase and with latest information on aws tags support #1700

Conversation

TrilokGeer commented Oct 17, 2024

openshift-ci bot commented Oct 17, 2024

openshift-ci-robot commented Oct 18, 2024 • edited by openshift-ci bot Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TrilokGeer Oct 21, 2024 • edited Loading

Choose a reason for hiding this comment

chiragkyal Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsafrane Nov 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chiragkyal commented Oct 22, 2024

openshift-ci bot commented Nov 6, 2024

jsafrane Dec 10, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-ci-robot commented Oct 18, 2024 •

edited by openshift-ci bot

Loading

TrilokGeer Oct 21, 2024 •

edited

Loading

chiragkyal Oct 22, 2024 •

edited

Loading

jsafrane Nov 19, 2024 •

edited

Loading

jsafrane Dec 10, 2024 •

edited

Loading