Maintaining a node
Once your Neurobagel node is running and configured correctly, there are some recurring tasks you may have to do to keep it operating correctly.
Updating the Neurobagel services
Updating the Neurobagel Docker images
We are continuously improving Neurobagel tools and services, so you may want to update your Neurobagel node to the latest version to benefit from new features and bug fixes. We always publish our tools as Docker images on DockerHub.
Each Docker image has a semantic version tag (vX.Y.Z), and also two rolling tags:
latest
(the latest stable release). This is the default tag used in the Neurobageldocker-compose.yml
file.nightly
(the latest build from the main branch). This tag is only used for compatibility testing and should not be used in production.
You can pull the most recent docker images for Neurobagel tools by running:
docker compose --profile full_stack pull
Not sure what version you have?
Since latest
is a rolling tag, each latest
Docker image for a Neurobagel tool includes its corresponding semver number (vX.Y.X) as part of its Docker image labels.
You can find the labels for an image you have pulled in the image metadata, e.g.:
docker image inspect neurobagel/api:latest
docker image inspect --format='{{json .Config.Labels}}' neurobagel/api:latest
"Labels": {
"org.opencontainers.image.created": "https://github.com/neurobagel/api",
"org.opencontainers.image.revision": "01530f467e163f3dff595d3327bc60ba453de47d",
"org.opencontainers.image.version": "v0.3.1"
}
"org.opencontainers.image.version"
refers to the version number.
docker compose
will only pull the images used by the current deployment profile
If you don't specify a deployment profile, the default profile (full_stack
) will be used,
which pulls the images for all Neurobagel services including the node API, federation API, graph store, and graphical query tool.
See the deployment profiles section for more information on the available profiles.
Restarting services after an update
Whether you have updated the Docker images, the configuration, or the data of your Neurobagel node, you will need to restart the services to apply the changes.
To shut down a running Neurobagel node,
navigate to the path on your file system where
you have stored the docker-compose.yml
file from the initial setup and run:
docker compose --profile full_stack down
Then, to start the services again:
docker compose --profile full_stack up -d
Explicitly specify the deployment profile
To avoid unexpected behaviour when running docker compose
commands, we recommend always explicitly specifying the deployment profile you want to use with the -p
or --profile
flag.
Otherwise, docker compose
will only manage (start, stop, or update) the services in the default profile
(for more info, see Launching a profile).
Updating the data in your graph
The Neurobagel deployment recipe launches a dedicated graph database that stores the datasets for a single node.
The data in this graph database is loaded from the location specified in the
LOCAL_GRAPH_DATA
environment variable,
and can be changed at any time.
By default, the graph database will only contain an example dataset called BIDS synthetic
.
If you have followed the initial setup for deploying a Neurobagel node from our Docker Compose recipe, replacing the existing data in your graph database with your own data (or updated data) is a straightforward process.
Once you have generated or updated the JSONLD files you want to upload, to update the data in your graph:
-
Shut down the Neurobagel node, if it is already running
docker compose --profile full_stack down
(or, replace full_stack
with the profile you are using)
- Update the data files in the directory specified by the
LOCAL_GRAPH_DATA
variable in.env
, or simply change the path to a directory containing your JSONLD files. -
(Re)start the Neurobagel node
docker compose --profile full_stack up -d
Here are some other common scenarios where you might need to update the data in your graph:
Following a change in my dataset
When using Neurobagel tools on a dataset that is still undergoing data collection, you may need to update the Neurobagel annotations and/or graph-ready data for the dataset when you want to add new subjects or measurements or to correct mistakes in prior data versions.
For any of the below types of changes, you will need to regenerate a graph-ready .jsonld
file for the dataset which reflects the change.
If the phenotypic (tabular) data have changed
If new variables have been added to the dataset such that there are new columns in the phenotypic TSV you previously annotated using Neurobagel's annotation tool, you will need to:
-
Generate an updated data dictionary by annotating the new variables in your TSV following the annotation workflow
-
Generate a new graph-ready data file for the dataset by re-running the CLI on your updated TSV and data dictionary
If only the imaging data have changed
If the BIDS data for a dataset have changed without changes in the corresponding phenotypic TSV (e.g., if new modalities or scans have been acquired for a subject), you have two options:
- If you still have access to the dataset's phenotypic JSONLD generated from the
pheno
command of thebagel-cli
(step 1), you may choose to rerun only thebids
CLI command on the updated BIDS directory. This will generate a new graph-ready data file with updated imaging metadata of subjects.
OR
- Rerun the CLI entirely (
pheno
andbids
steps) to generate a new graph-ready data file for the dataset.
When in doubt, rerun both CLI commands.
If only the subjects have changed
If subjects have been added to or removed from the dataset but the phenotypic TSV is otherwise unchanged (i.e., only new or removed rows, without changes to the available variables), you will need to:
- Generate a new graph-ready data file for the dataset by re-running the CLI (
pheno
andbids
steps) on your updated TSV and existing data dictionary
Following a change in the Neurobagel data model
As Neurobagel continues developing the data model, new tool releases may introduce breaking changes to the data model for subject-level information in a .jsonld
graph data file.
Breaking changes will be highlighted in the release notes.
If you have already created .jsonld
files for a Neurobagel graph database but want to update your graph data to the latest Neurobagel data model following such a change, you can easily do so by rerunning the CLI on the existing data dictionaries and phenotypic TSVs for the dataset(s) in the graph.
This will ensure that if you use the latest version of the Neurobagel CLI to process new datasets (i.e., generate new .jsonld
files) for your database, the resulting data will not have conflicts with existing data in the graph.
Note that if upgrading to a newer version of the data model, you should regenerate the .jsonld
files for all datasets in your existing graph.
Re-uploading a modified dataset
To allow easy (re-)uploading of the updated .jsonld
for your dataset(s) to a graph database, we recommend making a copy of it in a central directory on your research data fileserver for storing local Neurobagel jsonld
datasets.
Then, simply follow the steps for uploading/updating a dataset in the graph database.
Updating your graph backend configuration
Updating existing database user permissions
If you want to change database access permissions (e.g., adding or removing access to a database) for an existing user in your GraphDB instance, you must do so manually.
Of note, in GraphDB, there is no straightforward REST API call to update a user's database access permissions without replacing the list of their existing database permissions ("grantedAuthorities"
) entirely.
Tip
You can verify a user's settings at any time with the following:
curl -u "admin:NewAdminPassword" http://localhost:7200/rest/security/users/DBUSER
Example: if user DBUSER
was granted read/write access to database my_db1
with the following command
(this command is run by default as part of graphdb_setup.sh
):
curl -X PUT --header 'Content-Type: application/json' -d '
{"grantedAuthorities": ["WRITE_REPO_my_db","READ_REPO_my_db"]}' http://localhost:7200/rest/security/users/DBUSER -u "admin:NewAdminPassword"
To grant DBUSER
read/write access to a second database my_db2
(while keeping the existing access to my_db1
),
you would rerun the above curl
command with all permissions (existing and new) specified since the existing permissions list will be overwritten:
curl -X PUT --header 'Content-Type: application/json' -d '
{"grantedAuthorities": ["WRITE_REPO_my_db1","READ_REPO_my_db1", "WRITE_REPO_my_db2","READ_REPO_my_db2"]}' http://localhost:7200/rest/security/users/DBUSER -u "admin:NewAdminPassword"
Similarly, to revoke my_db1
access so DBUSER
only has access to my_db2
:
curl -X PUT --header 'Content-Type: application/json' -d '
{"grantedAuthorities": ["WRITE_REPO_my_db2","READ_REPO_my_db2"]}' http://localhost:7200/rest/security/users/DBUSER -u "admin:NewAdminPassword"
Managing user permissions using the GraphDB Workbench
If you are managing multiple GraphDB databases, the web-based administration interface for a GraphDB instance, the Workbench, might be an easier way to manage user permissions than the REST API. More information on using the GraphDB Workbench can be found here.
Resetting your GraphDB instance
Each Neurobagel node has its own GraphDB instance, which is used to store the graph data for the node. If you want to reset your graph database and start again from scratch, follow these steps:
-
Ensure that your Neurobagel node is not running (i.e., shut down the Docker containers for the node).
docker compose --profile full_stack down
If you are not using the
full_stack
profile, replacefull_stack
with the name of the profile you are using. -
Delete the docker volume that contains the GraphDB data for your node.
docker volume rm neurobagel_node_graphdb_home
replace
neurobagel_node_graphdb_home
with the name of the volume that was created for your node. It is usually named<project_name>_graphdb_home
where<project_name>
is the name of your docker compose stack as defined inCOMPOSE_PROJECT_NAME
in the.env
file. -
Start your Neurobagel node again.
docker compose --profile full_stack up -d
If you are not using the
full_stack
profile, replacefull_stack
with the name of the profile you are using.
Use docker volume ls
to list all volumes on your system
You can use the docker volume ls
command to list all volumes on your system.
This will help you identify the name of the volume that was created for your Neurobagel node.
Some examples of when you might want to do this:
- You started but did not complete Neurobagel node setup previously and want to ensure you are using up-to-date instructions and recommended configuration options
- Your local node has stopped working after a configuration change to your graph database (e.g., your Neurobagel node API no longer starts or responds with an error, but you have confirmed all environment variables you have set should be correct)
Warning
This action will wipe any graph databases and users you previously created!