mongodb: mongod & logging notes

2025-07-05 11:42:11 +00:00 · 2022-09-08 00:16:32 +02:00 · 2022-09-08 00:16:32 +02:00 · 8f08bd96a4
commit 8f08bd96a4
parent 197703eb04
1 changed files with 524 additions and 467 deletions
--- a/docs/database/mongo-db.md
+++ b/docs/database/mongo-db.md
@ -84,9 +84,9 @@ db.<collection>.insertOne({document})  # implicit collection creation
 { "$expr":  { "$<comparison_operator>": [ "$<key>", "$<key>" ] } }  // compare field values (operators use aggregation syntax)
 ```
-## CRUD Operations
+## Mongo Query Language (MQL)
-### Create
+### Insertion
 It's possible to insert a single document with the command `insertOne()` or multiple documents with `insertMany()`.
@ -126,7 +126,7 @@ db.<collection>.insertMany([ { document }, { document } ] , { "ordered": false }
 > **Note**: If `insertMany()` fails the already inserted documents are not rolled back but all the successive ones (even the correct ones) will not be inserted.
-### Read
+### Querying
 ```sh
 db.<collection>.findOne()  # find only one document
@ -168,7 +168,7 @@ db.<collection>.find().hint( { $natural : -1 } )  # force the query to perform a
 > **Note**: `{ <key>: <value> }` in case of a field array will match if the array _contains_ the value
-### Update
+### Updating
 [Update Operators](https://docs.mongodb.com/manual/reference/operator/update/ "Update Operators Documentation")
@ -179,7 +179,7 @@ db.<collection>.updateOne(filter, update, {upsert: true})  # modify document if
 db.<collection>.updateOne(filter, { "$push": { ... }, "$set": { ... }, { "$inc": { ... }, ... } })
 ```
-### Delete
+### Deletion
 ```sh
 db.<collection>.deleteOne(filter, options)
@ -189,7 +189,11 @@ db.<collection>.drop()  # delete whole collection
 db.dropDatabase()  # delete entire database
 ```
-## [Mongoimport](https://docs.mongodb.com/database-tools/mongoimport/)
+---
 ## MongoDB Database Tools
 ### [Mongoimport](https://docs.mongodb.com/database-tools/mongoimport/)
 Utility to import all docs into a specified collection.  
 If the collection already exists `--drop` deletes it before reuploading it.
@ -210,7 +214,7 @@ mongoimport <options> <connection-string> <file>
 --jsonarray  # Accepts the import of data expressed with multiple MongoDB documents within a single json array. MAX 16 MB
 ```
-## [Mongoexport](https://docs.mongodb.com/database-tools/mongoexport/)
+### [Mongoexport](https://docs.mongodb.com/database-tools/mongoexport/)
 Utility to export documents into a specified file.
@ -232,7 +236,7 @@ mongoexport --collection=<collection> <options> <connection-string>
 --sort=<JSON>  # Specifies an ordering for exported results
 ```
-## [Mongodump][mongodump_docs] & [Mongorestore][mongorestore_docs]
+### [Mongodump][mongodump_docs] & [Mongorestore][mongorestore_docs]
 `mongodump` exports the content of a running server into `.bson` files.
@ -241,47 +245,7 @@ mongoexport --collection=<collection> <options> <connection-string>
 [mongodump_docs]: https://docs.mongodb.com/database-tools/mongodump/
 [mongorestore_docs]: https://docs.mongodb.com/database-tools/mongorestore/
-## Relations
+---
 **Nested / Embedded Documents**:
 - Group data logically
 - Optimal for data belonging together that do not overlap
 - Should avoid nesting too deep or making too long arrays (max doc size 16 mb)
 ```json
 {
    "_id": "ObjectId()",
    "<key>": "value",
    "<key>": "value",
    "innerDocument": {
        "<key>": "value",
        "<key>": "value"
    }
 }
 ```
 **References**:
 - Divide data between collections
 - Optimal for related but shared data used in relations or stand-alone
 - Allows to overtake nesting and size limits
 NoSQL databases do not have relations and references. It's the app that has to handle them.
 ```json
 {
    "<key>": "value",
    "references": ["id1", "id2"]
 }
 // referenced
 {
    "_id": "id1",
    "<key>": "value"
 }
 ```
 ## [Indexes](https://docs.mongodb.com/manual/indexes/ "Index Documentation")
@ -351,24 +315,7 @@ db.<collection>.dropIndexes()  # drop all indexes
 db.<collection>.dropIndex( { "index-name": 1 } )  # drop a specific index
 ```
-## Database Profiling
+---
 Profiling Levels:
 - `0`: no profiling
 - `1`: data on operations slower than `slowms`
 - `2`: data on all operations
 Logs are saved in the `system.profile` _capped_ collection.
 ```sh
 db.setProfilingLevel(n)  # set profiler level
 db.setProfilingLevel(1, { slowms: <ms> })
 db.getProfilingStatus()  # check profiler status
 db.system.profile.find().limit(n).sort( {} ).pretty()  # see logs
 db.system.profile.find().limit(n).sort( { ts : -1 } ).pretty()  # sort by decreasing timestamp
 ```
 ## Roles and permissions
@ -406,7 +353,109 @@ db.createUser(
 )
 ```
-## Sharding
+---
 ## Cluster Administration
 ### `mongod`
 `mongod` is the main deamon process for MongoDB. It's the core process of the database,
 handling connections, requests and persisting the data.
 `mongod` default configuration:
 - port: `27017`
 - dbpath: `/data/db`
 - bind_ip: `localhost`
 - auth: disabled
 [`mongod` config file][mongod_config_file]  
 [`mongod` command line options][mongod_cli_options]
 [mongod_config_file]: https://www.mongodb.com/docs/manual/reference/configuration-options "`mongod` config file docs"
 [mongod_cli_options]: https://www.mongodb.com/docs/manual/reference/program/mongod/#options "`mongod` command line options docs"
 ### Basic Shell Helpers
 ```sh
 db.<method>()  # database interaction
 db.<collection>.<method>()  # collection interaction
 rs.<method>();  # replica set deployment and management
 sh.<method>();  # sharded cluster deployment and management
 # user management
 db.createUser()
 db.dropUser()
 # collection management
 db.renameCollection()
 db.<collection>.createIndex()
 db.<collection>.drop()
 # database management
 db.dropDatabase()
 db.createCollection()
 # database status
 db.serverStatus()
 # database command (underlying to shell helpers and drivers)
 db.runCommand({ "<COMMAND>" })
 # help
 db.commandHelp("<command>)
 ```
 ### Logging
 The **process log** displays activity on the MongoDB instance and collects activities of various components:
 Log Verbosity Level:
 - `-1`: Inherit from parent
 - `0`: Default Verbosity (Information)
 - `1 - 5`: Increases the verbosity up to Debug messages
 ```sh
 db.getLogComponents()  # get components and their verbosity
 db.adminCommand({"getLog": "<scope>"})  # retrieve logs (getLog must be run on admin db -> adminCommand)
 db.setLogLevel(<level>, "<component>");  # set log level (output is OLD verbosity levels)
 tail -f /path/to/mongod.log  # read end og log file
 ```
 > **Note**: Log Message Structure: `<timestamp> <severity-level> <component> <connection> <event> ...`
 ### Database Profiling
 Profiling Levels:
 - `0`: no profiling
 - `1`: data on operations slower than `slowms` (default 100ms)
 - `2`: data on all operations
 Events captured by the profiler:
 - CRUD operations
 - Administrative operations
 - Configuration operations
 > **Note**: Logs are saved in the `system.profile` _capped_ collection.
 ```sh
 db.setProfilingLevel(n)  # set profiler level
 db.setProfilingLevel(1, { slowms: <ms> })
 db.getProfilingStatus()  # check profiler status
 db.system.profile.find().limit(n).sort( {} ).pretty()  # see logs
 db.system.profile.find().limit(n).sort( { ts : -1 } ).pretty()  # sort by decreasing timestamp
 ```
 ### [Replica set](https://docs.mongodb.com/manual/replication/)
 A **replica set** in MongoDB is a group of `mongod` processes that maintain the `same dataset`. Replica sets provide redundancy and high availability, and are the basis for all production deployments.
 ### Sharding
 **Sharding** is a MongoDB concept through which big datasets are subdivided in smaller sets and distributed towards multiple instances of MongoDB.  
 It's a technique used to improve the performances of large queries towards large quantities of data that require al lot of resources from the server.
@ -422,15 +471,23 @@ Shard components are:
 ![Shared Cluster](../img/mongodb_shared-cluster.png "Components of a shared cluster")
-### [Replica set](https://docs.mongodb.com/manual/replication/)
+---
 A **replica set** in MongoDB is a group of `mongod` processes that maintain the `same dataset`. Replica sets provide redundancy and high availability, and are the basis for all production deployments.
 ## [Aggregation Framework](https://docs.mongodb.com/manual/reference/operator/aggregation-pipeline/)
 Sequence of operations applied to a collection as a _pipeline_ to get a result: `db.collection.aggregate(pipeline, options)`.  
 Each step of the pipeline acts on its inputs and not on the original data in the collection.
 ### Variables
 Variable syntax in aggregations:
 - `$key`: field path
 - `$$UPPERCASE`: system variable (e.g.: `$$CURRENT`)
 - `$$foo`: user defined variable
 ### Aggregation Syntax
 ```sh
 db.<collection>.aggregate([