Database backup failure discovered, emergency backup completed, infrastructure fixes applied. This page documents the incident and current state.
Database Backup Failure
CRITICAL: Daily pg_dump has been producing empty files since backup was first set up. The database has been running without valid SQL backups.
Root Cause
The database is 46GB (80GB+ with indexes). pg_dump hangs due to the Bingo chemical cartridge creating circular dependencies on rxn_bingo_idx_shadow tables. The dump process enters an infinite wait and eventually produces a 0-byte file. Individual table dumps work fine (tested: users 5KB, files 7KB).
Mitigating Factor
The rsync file backup has been working perfectly throughout — only the SQL dump component was broken. This means file uploads, SDF/MOL files, and exported data were always backed up. The risk was limited to database records (users, shares, activity, audit logs, molecule metadata).
Emergency Backup
RESOLVED: Emergency manual backup completed successfully — 8.7GB compressed (6.5GB gzipped) saved to USB storage.
The backup script needs to exclude tables that cause pg_dump to hang (Bingo shadow tables) and large reference tables that are reloadable from public sources. This will reduce the dump to user data, project data, shares, audit logs, and configuration — the irreplaceable stuff.