1. What is the issue? Please be detailed.
For the past couple of days, our server has been experiencing persistent 100% CPU utilization, primarily caused by PostgreSQL checkpointer processes. This high CPU usage continues for several hours, after which I have to restart the containers to bring the system back to normal. The server then remains stable for a few hours, but eventually, the CPU usage spikes again to 100%, and RAM usage also increases significantly.
I recently upgraded ODK Central from v2024.1.0 to v2024.2.1 to enable S3 feature support. However, the issue has only started appearing over the past few days, even though the upgrade was done a couple of weeks ago.
System Details
Server Specs: 2 CPU cores and 4 GB RAM (3917 MB total, with 824 MB available)
Disk: 60 GB total, 33 GB available
Swap: 1.1 GB of 4 GB in use
Current PostgreSQL Configuration (default)
shared_buffers: 128MB (16384 Ă— 8kB)
checkpoint_timeout: 300s (5 minutes)
max_wal_size: 1024MB
min_wal_size: 80MB
checkpoint_completion_target: 0.9
I have not modified any of these settings.
3. What have you tried to fix the issue?
Observed Symptoms:
PostgreSQL checkpointer processes consistently consume ~100% CPU each
Multiple checkpointer processes run simultaneously
Swap usage increases (1.1 GB of 4 GB used)
The issue persists even though the application appears to function normally
I haven’t made any changes or optimizations yet to resolve this.
4. Upload any forms or screenshots you can share publicly below.
I have attached a couple of screenshots below.
