Run db:migrate in pre- and post-install/upgrade (#18, #26) #37

angdraug · 2023-01-15T23:34:45Z

No description provided.

As recommended in Mastodon release notes, run db:migrate with SKIP_POST_DEPLOYMENT_MIGRATIONS=true in pre-install and pre-upgrade hooks, and again without the flag in post-install and post-upgrade. Co-authored-by: Sheogorath <[email protected]>

pre-install and pre-upgrade hooks run before the persistent ConfigMap resources are installed. As suggested in helm/helm#8694, create a hook with lower hook-weight and resource-policy=keep to make the same ConfigMap available in pre- hooks.

dunn · 2023-01-17T22:31:00Z

templates/configmap-env-pre.yaml

+apiVersion: v1
+kind: ConfigMap
+metadata:
+  name: {{ include "mastodon.fullname" . }}-env


why do we need a second configmap with the same name and data?

Because Helm doesn't create the other configmap until after the pre-install hooks, and deletes this one after running pre-install hooks. It's a catch-22: for the mastodon-env configmap to be available when db-migrate job runs, it has to be created as a hook with a lower weight, but since it's created as a hook it gets cleaned up after db-migrate is done, so we also have to keep the non-hook version of the same configmap. See also commit message in a749654.

oh i didn't realize that! thanks for the clarification.

For some additional clarification: this is only about the first time the chart gets installed after that it will be there.

angdraug · 2023-01-26T20:27:43Z

Not sure if it shows up for you, but GitHub tells me:

First-time contributors need a maintainer to approve running workflows. Learn more.

I don't think this PR is going to land until someone does the needful.

paolomainardi · 2023-01-27T18:01:56Z

Please can you release this PR? It seems ok. Otherwise, this chart cannot be used within a highly automated environment with Terraform + Helm; the helm installation never ends, and the migration job is never triggered by Helm, making it impossible to use it for a new fresh install.

cc @dunn

dunn · 2023-01-27T21:53:03Z

I'm not actually a maintainer of this repo, so I can't merge.

paolomainardi · 2023-01-27T21:58:57Z

I'm not actually a maintainer of this repo, so I can't merge.

Ops, so sorry, I saw your comments and just thought you was a maintainer too.

paolomainardi · 2023-01-29T15:47:12Z

Just tried this PR, and it doesn't work, the migrate job requires PVC already created otherwise the job cannot be executed.

The question is, does the migration job requires the PVC ?

paolomainardi · 2023-01-29T16:16:45Z

I tried again using a bucket instead of PVC, and now the problem is with the required redis instance, which should be up and running to finish the job.

paolomainardi · 2023-01-30T16:25:42Z

I tried running the migration job along with the other deployments, and it worked fine; it is the same approach the GitLab chart is taking. The concept is to let the scheduler restart the services until the migration job finishes to initialize the services; once finished, the pods start to come up and work fine.

Gitlab migration job used as a reference: https://docs.gitlab.com/charts/charts/gitlab/migrations/

renchap · 2023-02-17T10:02:36Z

Thanks for your work on this @paolomainardi!

I tried running the migration job along with the other deployments, and it worked fine; it is the same approach the GitLab chart is taking. The concept is to let the scheduler restart the services until the migration job finishes to initialize the services; once finished, the pods start to come up and work fine.

How would this work for version upgrades? The pre-upgrade migrations needs to be run before any of the new version application pods, otherwise those can generate errors on some requests (when trying to access a table that has not been migrated yet), while their /health endpoint returns OK (it does not check the schema version).

I worry it will create user-facing errors during the migration, or even a server to become unavailable if the migration does not happen and all pods are upgraded to the new version.

paolomainardi · 2023-02-27T12:06:07Z

@renchap yes, you're right; the issue with this approach is that users can face problems while migrations are running.

This issue can only be overcome by just using Helm, and the best choice is to avoid running them as is doing this chart and move most of the complexity to the application side.

Always looking how Gitlab does, they open sourced the database migration types they support: https://docs.gitlab.com/ee/development/migration_style_guide.html

The case for Mastodon is "Regular migrations" which according to their document must be always under 3 minutes if higher than must be moved on post-deployment or background migrations.

Is not very clear indeed, what it happens during the 3-minutes window, maybe the migrations are always written in a way that prev/next releases are always compatible.

This is migration helm chart documentation: https://docs.gitlab.com/charts/charts/gitlab/migrations and from my direct experience, they run along the other deployments.

jessebot · 2023-07-04T10:09:13Z

is there any chance this could be moved forward?

timetinytim · 2025-02-10T10:18:50Z

Apologies for the long delay in getting to this...

Generally I like how this is done, including the generalization of the deploy job into pre & post runs. Though I do want to clarify something first.

I tried running the migration job along with the other deployments, and it worked fine; it is the same approach the GitLab chart is taking. The concept is to let the scheduler restart the services until the migration job finishes to initialize the services; once finished, the pods start to come up and work fine.

@paolomainardi This job is using pre/post-upgrade/install helm hooks, which if I'm not mistaken, means that the pre- job is run first before anything is installed/updated, THEN resources are added/changed, THEN the post- job is run. The scheduler shouldn't be restarting anything while the pre-migration is running. Is this something you saw while trying this yourself?

paolomainardi · 2025-02-10T11:21:28Z

@timetinytim, thanks for your reply; no worries about the long delay.

It should work like that once the migrations are running. A pod is created as part of the helm hooks; once finished, the rest will be applied, such as updating the Kubernetes deployment with the mastodon container image that will trigger the scheduler to do something.

I have not been managing a Mastodon K8S instance lately, but I can give you some tips.

timetinytim · 2025-02-10T14:59:05Z

@paolomainardi Alright cool, I just wanted to confirm. That's the behavior we expect. As long as pre-migrations run before the new pods come up, that will hopefully prevent any user-facing errors during an update.

Let me do some testing on my own side on this, and if everything looks good, I'll get this approved/merged in the next day or two.

timetinytim · 2025-02-11T16:06:09Z

I've been doing some testing, and I've come to a conclusion:

The idea of a pre/post-hood ConfigMap isn't a bad idea, but at the end of the day the job only needs a couple pieces of information: The DB connection info, and the Redis connection info. There are also situations where the former would be different for the migration job (i.e. using a connection pooler, which wouldn't work with migrations).

What I'm suggesting is, rather than create the configmap twice, we just add the env vars it needs directly. It solves for the case mentioned above, and simplifies it a little bit.

@angdraug, let me know what you think. Given how long it's taken me to look at this I totally understand if you've moved on from this, so no worries if that's the case. I'll just pick this up myself.

WyriHaximus · 2025-02-11T19:56:21Z

What I'm suggesting is, rather than create the configmap twice, we just add the env vars it needs directly. It solves for the case mentioned above, and simplifies it a little bit.

Speaking from experience, this will work perfectly fine. And is the simplest solution tbh

angdraug · 2025-02-16T23:07:59Z

I have indeed moved on from this, please feel free to take over, thanks!

angdraug and others added 3 commits January 15, 2023 15:30

Separate pre-deploy run of db:migrate

fa4a392

As recommended in Mastodon release notes, run db:migrate with SKIP_POST_DEPLOYMENT_MIGRATIONS=true in pre-install and pre-upgrade hooks, and again without the flag in post-install and post-upgrade. Co-authored-by: Sheogorath <[email protected]>

Duplicate mastodon-env ConfigMap for hooks

a749654

pre-install and pre-upgrade hooks run before the persistent ConfigMap resources are installed. As suggested in helm/helm#8694, create a hook with lower hook-weight and resource-policy=keep to make the same ConfigMap available in pre- hooks.

Deduplicate mastodon-env and db:migrate jobs using named templates

64f8527

This was referenced Jan 15, 2023

Helm Deployment - DB migrate job should run on post-upgrade #26

Open

Helm chart unable to create fresh server: relation "accounts" does not exist unless db migration job is run manually #18

Open

dunn reviewed Jan 17, 2023

View reviewed changes

dunn approved these changes Jan 19, 2023

View reviewed changes

paolomainardi mentioned this pull request Jan 31, 2023

feat: use a local chart sparkfabrik/terraform-google-gke-autopilot-mastodon#4

Merged

lilithmooncohen mentioned this pull request Feb 9, 2023

mastodon chart fixes FediHost/helm-charts#2

Merged

timetinytim mentioned this pull request Feb 12, 2025

Create new pre/post migrate jobs #163

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run db:migrate in pre- and post-install/upgrade (#18, #26) #37

Run db:migrate in pre- and post-install/upgrade (#18, #26) #37

angdraug commented Jan 15, 2023

dunn Jan 17, 2023

angdraug Jan 19, 2023

dunn Jan 19, 2023

WyriHaximus Feb 28, 2023

angdraug commented Jan 26, 2023

paolomainardi commented Jan 27, 2023 •

edited

Loading

dunn commented Jan 27, 2023

paolomainardi commented Jan 27, 2023

paolomainardi commented Jan 29, 2023 •

edited

Loading

paolomainardi commented Jan 29, 2023

paolomainardi commented Jan 30, 2023

renchap commented Feb 17, 2023

paolomainardi commented Feb 27, 2023

jessebot commented Jul 4, 2023

timetinytim commented Feb 10, 2025

paolomainardi commented Feb 10, 2025 •

edited

Loading

timetinytim commented Feb 10, 2025

timetinytim commented Feb 11, 2025

WyriHaximus commented Feb 11, 2025

angdraug commented Feb 16, 2025

Run db:migrate in pre- and post-install/upgrade (#18, #26) #37

Are you sure you want to change the base?

Run db:migrate in pre- and post-install/upgrade (#18, #26) #37

Conversation

angdraug commented Jan 15, 2023

dunn Jan 17, 2023

Choose a reason for hiding this comment

angdraug Jan 19, 2023

Choose a reason for hiding this comment

dunn Jan 19, 2023

Choose a reason for hiding this comment

WyriHaximus Feb 28, 2023

Choose a reason for hiding this comment

angdraug commented Jan 26, 2023

paolomainardi commented Jan 27, 2023 • edited Loading

dunn commented Jan 27, 2023

paolomainardi commented Jan 27, 2023

paolomainardi commented Jan 29, 2023 • edited Loading

paolomainardi commented Jan 29, 2023

paolomainardi commented Jan 30, 2023

renchap commented Feb 17, 2023

paolomainardi commented Feb 27, 2023

jessebot commented Jul 4, 2023

timetinytim commented Feb 10, 2025

paolomainardi commented Feb 10, 2025 • edited Loading

timetinytim commented Feb 10, 2025

timetinytim commented Feb 11, 2025

WyriHaximus commented Feb 11, 2025

angdraug commented Feb 16, 2025

paolomainardi commented Jan 27, 2023 •

edited

Loading

paolomainardi commented Jan 29, 2023 •

edited

Loading

paolomainardi commented Feb 10, 2025 •

edited

Loading