Databases

Overview

The primary type of database we use to store results in our group is  MongoDB . When running high-throughput calculations, using a database like MongoDB makes storing, querying, and analyzing data much easier and more reproducible.

Resources

I strongly recommend using Studio3T to interact with MongoDB, as it has a  free pro license for academics  and is easy to use. The Studio3T  Getting Started guide  has most of the things you need to learn how to use this program. When in doubt, ChatGPT is probably pretty good at suggesting how to make MongoDB queries. The "Visual Query Builder" in Studio3T is especially helpful.

MongoDB

If you're using Studio3T to connect to the MongoDB collection, you must be on the Global Protect VPN.
We have a MongoDB instance that runs on tiger-arrk. To connect to the MongoDB instance on tiger-arrk with Studio3T, use the following configuration:
  • Authentication
  • Authentication Mode: Legacy (SCRAM-SHA-1)
  • User name: Your MongoDB username (typically your NetID)
  • Password: Your MongoDB password (not your NetID password!). If you don't know this, it probably means you need to ask for an account to be made.
  • Authentication DB: Your database name (typically your NetID)
  • SSH
  • Enable "Use SSH tunnel to connect"
  • SSH Address:  tiger-arrk.princeton.edu 
  • SSH Username: Your NetID
  • SSH Auth Mode: Password (or your private key, if set up)
  • SSH Password (if not using a private key): Your NetID password