THEAP corruption after DB restart

13 Aug 2015

      Hi,

My DB contains around 1000 tables consisting of 100K-15M records each.
They all have the same schema, but I’m not using a single table because locality is required for the individual sets.

After restarting MonetDB, I’m experiencing data corruption in some of the tables (around 10-20):
- when selecting a single row from those tables (LIMIT 1), the mserver process crashes
- when calling the storage() function, the mserver process crashes
- the storage() function can only be used again after dropping the tables in question (by trial and error)

More information:
- The restart was performed as part of an EC2 instance stop/start, This should send a SIGTERM signal to monetdbd.
- The DB farm is stored on EBS.  Snapshots were made well before the restart, but the same tables are affected.
- The corrupted tables were used before successfully and the content has been unchanged since.
- No client connections were active at restart time.

mserver info:
...
2015-08-13 15:57:42 MSG DB[10633]: arguments: /usr/bin/mserver5 --dbpath=/opt/db-farm/DB --set merovingian_uri=mapi:monetdb://ip-HOST:50000/DB --set mapi_open=false --set mapi_port=0 --set mapi_usock=/opt/db-farm/DB/.mapi.sock --set monet_vault_key=/opt/db-farm/DB/.vaultkey --set gdk_nr_threads=4 --set max_clients=64 --set sql_optimizer=default_pipe --set monet_daemon=yes
2015-08-13 15:57:42 MSG DB[10633]: # MonetDB 5 server v11.19.15 "Oct2014-SP4"
2015-08-13 15:57:42 MSG DB[10633]: # Serving database ‘DB', using 4 threads
2015-08-13 15:57:42 MSG DB[10633]: # Compiled for x86_64-pc-linux-gnu/64bit with 64bit OIDs dynamically linked
2015-08-13 15:57:42 MSG DB[10633]: # Found 15.672 GiB available main-memory.
This error occurs a couple of times in the server logs, but only when first starting monetdbd:
...
2015-08-13 14:33:15 MSG: !ERROR: GDKload: cannot read: name=15/31/153162, ext=theap, 57144 bytes missing.
2015-08-13 14:33:15 MSG: !OS: No such file or directory
2015-08-13 14:33:15 MSG: !ERROR: GDKload: cannot read: name=15/25/152564, ext=theap, 58536 bytes missing.
...
2015-08-13 14:33:15 MSG: !OS: No such file or directory
And then each time an affected table is accessed:
...
2015-08-13 16:02:19 MSG merovingian[1041]: database ‘DB' (10640) was killed by signal SIGSEGV
I’m trying to reproduce the issue, but it would be helpful if anyone has any pointers.

My hunch would be that the table catalog and the theap files are not persisted in the same way, so when snapshotting or restarting, one of the two is out of sync.  Could it be that some dynamic column index is built, but then only persisted in mapped memory?
The comments at the top of the gdk/gdk_storage.c file seem to imply that this is not the case, but I’m not sure.

If so, how would one ensure that all changes are properly flushed to disk?  Would fsfreeze help?
What if an unexpected loss of power occurs?

Thanks,
- Dennis

THEAP corruption after DB restart

Dennis Lorson