diff --git a/docs/performance_test/100_monitors.png b/docs/performance_test/100_monitors.png new file mode 100644 index 0000000..58abd6b Binary files /dev/null and b/docs/performance_test/100_monitors.png differ diff --git a/docs/performance_test/10_monitors.png b/docs/performance_test/10_monitors.png new file mode 100644 index 0000000..6f41391 Binary files /dev/null and b/docs/performance_test/10_monitors.png differ diff --git a/docs/performance_test/150_monitors.png b/docs/performance_test/150_monitors.png new file mode 100644 index 0000000..f03117a Binary files /dev/null and b/docs/performance_test/150_monitors.png differ diff --git a/docs/performance_test/README.md b/docs/performance_test/README.md index bad5653..60524f6 100644 --- a/docs/performance_test/README.md +++ b/docs/performance_test/README.md @@ -106,19 +106,19 @@ In order to monitor the ressource consumption during our tests we need to create We create 4 different queries using Prometheus as the data source. For each query we can use the `code` mode to create them from a PromQL query. -## OC stack consumption +### OC stack consumption ``` sum(container_memory_usage_bytes{name=~"oc-auth|oc-datacenter|oc-scheduler|oc-front|oc-schedulerd|oc-workflow|oc-catalog|oc-peer|oc-workspace|loki|mongo|traefik|nats"}) ``` -## Monitord consumption +### Monitord consumption ``` sum(container_memory_usage_bytes{image="oc-monitord"}) ``` -## Total RAM consumption +### Total RAM consumption ``` sum( @@ -128,8 +128,24 @@ sum( ) ``` -## Number of monitord containers +### Number of monitord containers ``` count(container_memory_usage_bytes{image="oc-monitord"} > 0) -``` \ No newline at end of file +``` + +# Launch executions + +We will use a script to insert in the DB the executions that will create the monitord containers. + +We need to retrieve two informations to execute the scripted insertion : + +- The **workflow id** for the workflow we want to instantiate, this is can be located in the DB +- A **token** to authentify against the API, connect to oc-front and retrieve the token in your browser network analyzer tool. + +Add these to the `insert_exex.sh` script. + +The script takes two arguments : +- **$1** : the number of executions, which are created by chunks of 10 using a CRON expression to create 10 execution**S** for each execution/namespace + +- **$2** : the number of minutes between now and the execution time for the executions. \ No newline at end of file diff --git a/docs/performance_test/insert_exec.sh b/docs/performance_test/insert_exec.sh index 2a07c14..6d54705 100755 --- a/docs/performance_test/insert_exec.sh +++ b/docs/performance_test/insert_exec.sh @@ -1,6 +1,7 @@ #!/bin/bash -TOKEN="LCSg2svSFdIBPucRCCmz7UrDmZKmsvODXU5PM0yWdYc.-LPNAoycUm5Kwm1IcyyP-wJvc8Zy7er36ozpjTWICKc.{"session":{"access_token":{"DELETE__AUTH_GROUP_USER_ID_GROUP_ID":"/auth/group/:user_id/:group_id","DELETE__AUTH_PERMISSION_PERMISSION_ID_ROLE_ID_RELATION":"/auth/permission/:permission_id/:role_id/:relation","DELETE__AUTH_ROLE_USER_ID_ROLE_ID":"/auth/role/:user_id/:role_id","DELETE__CATALOG_COMPUTE_ID":"/catalog/compute/:id","DELETE__CATALOG_DATA_ID":"/catalog/data/:id","DELETE__CATALOG_PROCESSING_ID":"/catalog/processing/:id","DELETE__CATALOG_RESOURCE_ID":"/catalog/resource/:id","DELETE__CATALOG_STORAGE_ID":"/catalog/storage/:id","DELETE__CATALOG_WORKFLOW_ID":"/catalog/workflow/:id","DELETE__SCHEDULER_ID":"/scheduler/:id","DELETE__WORKFLOW_ID":"/workflow/:id","DELETE__WORKSPACE_ID":"/workspace/:id","GET__CATALOG_COMPUTE_":"/catalog/compute/","GET__CATALOG_COMPUTE_ID":"/catalog/compute/:id","GET__CATALOG_COMPUTE_SEARCH_SEARCH":"/catalog/compute/search/:search","GET__CATALOG_DATA_":"/catalog/data/","GET__CATALOG_DATA_ID":"/catalog/data/:id","GET__CATALOG_DATA_SEARCH_SEARCH":"/catalog/data/search/:search","GET__CATALOG_ENUM_BOOKING_STATUS":"/catalog/enum/booking/status","GET__CATALOG_ENUM_INFRASTRUCTURE":"/catalog/enum/infrastructure","GET__CATALOG_ENUM_PRICING_REFUND_TYPE":"/catalog/enum/pricing/refund/type","GET__CATALOG_ENUM_PRICING_STRATEGY_BUY":"/catalog/enum/pricing/strategy/buy","GET__CATALOG_ENUM_PRICING_STRATEGY_DATA":"/catalog/enum/pricing/strategy/data","GET__CATALOG_ENUM_PRICING_STRATEGY_PRIVILEGE":"/catalog/enum/pricing/strategy/privilege","GET__CATALOG_ENUM_PRICING_STRATEGY_PRIVILEGE_STORAGE":"/catalog/enum/pricing/strategy/privilege/storage","GET__CATALOG_ENUM_PRICING_STRATEGY_STORAGE":"/catalog/enum/pricing/strategy/storage","GET__CATALOG_ENUM_PRICING_STRATEGY_TIME":"/catalog/enum/pricing/strategy/time","GET__CATALOG_ENUM_RESOURCE_TYPE":"/catalog/enum/resource/type","GET__CATALOG_ENUM_STATUS":"/catalog/enum/status","GET__CATALOG_ENUM_STORAGE_SIZE":"/catalog/enum/storage/size","GET__CATALOG_ENUM_STORAGE_TYPE":"/catalog/enum/storage/type","GET__CATALOG_PROCESSING_":"/catalog/processing/","GET__CATALOG_PROCESSING_ID":"/catalog/processing/:id","GET__CATALOG_PROCESSING_SEARCH_SEARCH":"/catalog/processing/search/:search","GET__CATALOG_RESOURCE_":"/catalog/resource/","GET__CATALOG_RESOURCE_ID":"/catalog/resource/:id","GET__CATALOG_RESOURCE_SEARCH_SEARCH":"/catalog/resource/search/:search","GET__CATALOG_STORAGE_":"/catalog/storage/","GET__CATALOG_STORAGE_ID":"/catalog/storage/:id","GET__CATALOG_STORAGE_SEARCH_SEARCH":"/catalog/storage/search/:search","GET__CATALOG_VERSION_":"/catalog/version/","GET__CATALOG_VERSION_STATUS":"/catalog/version/status","GET__CATALOG_WORKFLOW_":"/catalog/workflow/","GET__CATALOG_WORKFLOW_ID":"/catalog/workflow/:id","GET__CATALOG_WORKFLOW_SEARCH_SEARCH":"/catalog/workflow/search/:search","GET__DATACENTER_":"/datacenter/","GET__DATACENTER_ADMIRALTY_KUBECONFIG_EXECUTION":"/datacenter/admiralty/kubeconfig/:execution","GET__DATACENTER_ADMIRALTY_NODE_EXECUTION_PEER":"/datacenter/admiralty/node/:execution/:peer","GET__DATACENTER_ADMIRALTY_SECRET_EXECUTION_PEER":"/datacenter/admiralty/secret/:execution/:peer","GET__DATACENTER_ADMIRALTY_TARGETS":"/datacenter/admiralty/targets","GET__DATACENTER_ADMIRALTY_TARGETS_EXECUTION":"/datacenter/admiralty/targets/:execution","GET__DATACENTER_BOOKING_":"/datacenter/booking/","GET__DATACENTER_BOOKING_CHECK_ID_START_DATE_END_DATE":"/datacenter/booking/check/:id/:start_date/:end_date","GET__DATACENTER_BOOKING_ID":"/datacenter/booking/:id","GET__DATACENTER_BOOKING_SEARCH_EXECUTION_ID":"/datacenter/booking/search/execution/:id","GET__DATACENTER_BOOKING_SEARCH_START_DATE_END_DATE":"/datacenter/booking/search/:start_date/:end_date","GET__DATACENTER_ID":"/datacenter/:id","GET__DATACENTER_SESSION_TOKEN_ID_DURATION":"/datacenter/session/token/:id/:duration","GET__DATACENTER_VERSION_":"/datacenter/version/","GET__DATACENTER_VERSION_STATUS":"/datacenter/version/status","GET__PEER_":"/peer/","GET__PEER_ID":"/peer/:id","GET__PEER_SEARCH_SEARCH":"/peer/search/:search","GET__PEER_VERSION_":"/peer/version/","GET__SCHEDULER_EXECUTION_":"/scheduler/execution/","GET__SCHEDULER_EXECUTION_ID":"/scheduler/execution/:id","GET__SCHEDULER_EXECUTION_SEARCH_SEARCH":"/scheduler/execution/search/:search","GET__SCHEDULER_EXECUTION_SEARCH_START_DATE_END_DATE":"/scheduler/execution/search/:start_date/:end_date","GET__SCHEDULER_ID_ORDER":"/scheduler/:id/order","GET__SCHEDULER_VERSION_":"/scheduler/version/","GET__SCHEDULER_VERSION_STATUS":"/scheduler/version/status","GET__WORKFLOW_":"/workflow/","GET__WORKFLOW_CHECK_ID_START_DATE_END_DATE":"/workflow/check/:id/:start_date/:end_date","GET__WORKFLOW_ID":"/workflow/:id","GET__WORKFLOW_SEARCH_SEARCH":"/workflow/search/:search","GET__WORKFLOW_VERSION_":"/workflow/version/","GET__WORKFLOW_VERSION_STATUS":"/workflow/version/status","GET__WORKSPACE_":"/workspace/","GET__WORKSPACE_ID":"/workspace/:id","GET__WORKSPACE_SEARCH_SEARCH":"/workspace/search/:search","GET__WORKSPACE_VERSION_":"/workspace/version/","GET__WORKSPACE_VERSION_STATUS":"/workspace/version/status","POST__AUTH_GROUP_ID":"/auth/group/:id","POST__AUTH_ROLE_ID":"/auth/role/:id","POST__CATALOG_COMPUTE_":"/catalog/compute/","POST__CATALOG_DATA_":"/catalog/data/","POST__CATALOG_PROCESSING_":"/catalog/processing/","POST__CATALOG_STORAGE_":"/catalog/storage/","POST__CATALOG_WORKFLOW_":"/catalog/workflow/","POST__DATACENTER_ADMIRALTY_SECRET_EXECUTION_PEER":"/datacenter/admiralty/secret/:execution/:peer","POST__DATACENTER_ADMIRALTY_SOURCE_EXECUTION":"/datacenter/admiralty/source/:execution","POST__DATACENTER_ADMIRALTY_TARGET_EXECUTION_PEER":"/datacenter/admiralty/target/:execution/:peer","POST__DATACENTER_BOOKING_":"/datacenter/booking/","POST__PEER_ID_BLACKLIST":"/peer/:id/blacklist","POST__PEER_ID_PARTNER":"/peer/:id/partner","POST__PEER_ID_UNDO_STATE":"/peer/:id/undo_state","POST__PEER_STATUS_":"/peer/status/","POST__SCHEDULER_ID":"/scheduler/:id","POST__SCHEDULER_LOKI_":"/scheduler/loki/","POST__WORKFLOW_":"/workflow/","POST__WORKFLOW_PUBLISH_ID":"/workflow/publish/:id","POST__WORKSPACE_":"/workspace/","PUT__CATALOG_COMPUTE_ID":"/catalog/compute/:id","PUT__CATALOG_DATA_ID":"/catalog/data/:id","PUT__CATALOG_PROCESSING_ID":"/catalog/processing/:id","PUT__CATALOG_STORAGE_ID":"/catalog/storage/:id","PUT__CATALOG_WORKFLOW_ID":"/catalog/workflow/:id","PUT__DATACENTER_BOOKING_ID":"/datacenter/booking/:id","PUT__PEER_ID":"/peer/:id","PUT__WORKFLOW_ID":"/workflow/:id","PUT__WORKSPACE_ID":"/workspace/:id","exp":1748268804},"id_token":{"client_id":"22487339-e0c5-4d87-9cf1-29ddaaed117e","groups":[],"peer_id":"c0cece97-7730-4c2a-8c20-a30944564106","signature":"FxGjKZdOns2UW36UwCKiy1xSGxLNZjP6tid2wCg6hGtsVc1gZ/o8tMpTZ7uR/9Km1c+cXLWkb6VjWrv6ShLAXCux/of4oJoR+IAQ/7BOxCmm2GXMR/SRvJmJvehgQpJbFq0lYPCBdFPc/6FBHCpcJjDFHinep6cHaNMkJKqrrhtQk837R+vrRSOLun7ySHNnUHmlqNfWGtgxOyTWfP9UgAOl/01BasVYxL9uM/Tl3LtGQc/q/i1X+B31JZDE9BDO9LEtRFV3MD2al6a4436LNJESHURIYuJzs4CgwRCSOMkd20lvo8e3n92Ja8/X8PcbIzwkeAsHySuN67mKF7Sxv+zN0foo/MQlJ4+0AoUXJnHwXYD5TILKsCMYaQSJulBUm362hN6F175dVpxvh6cPE7QMtdRfRoGiaus0x1JK7+WgWEK1T8nt+RPqVCxlNERKEUTkbktzFiOyr3QpI2VRhm1uCMTKPFHeA9W1L6zJ1ZqJ76NdrHEt5k8T2vlxHImfUmXQzTVeyz+q4wkMgeuLg39h8UlOmjJgwCZRLgyC+d2KiJRzSh57oj0h1nYpSOPWPDx1xqVCgedVYg5OoXlNgptMouFbbnbr3q0p+NVNajqwalfv3x19w0UlAokgjWz9OmMFBvOjHe8POy6bXNK1+/+Kp1Q7VP9FeFfJzeXN49w=","username":"admin"}}}" +TOKEN="" +WORFLOW="" NB_EXEC=$1 TIME=$2 @@ -48,7 +49,7 @@ for ((i = 1; i <= $EXECS; i++)); do # echo $PAYLOAD - curl -X 'POST' 'http://localhost:8000/scheduler/62c55a70-e295-45e6-9925-a98137d59abc' \ + curl -X 'POST' "http://localhost:8000/scheduler/$WORKFLOW" \ -H 'accept: application/json' \ -H 'Content-Type: application/json' \ -d "$PAYLOAD" \ diff --git a/docs/performance_test/performance_report.md b/docs/performance_test/performance_report.md index e69de29..eaa4d6a 100644 --- a/docs/performance_test/performance_report.md +++ b/docs/performance_test/performance_report.md @@ -0,0 +1,43 @@ +We used a very simple mono node workflow which execute a simple sleep command within an alpine container + +![](wf_test_ram_1node.png) + +# 10 monitors + +![alt text](10_monitors.png) + +# 100 monitors + +![alt text](100_monitors.png) + +# 150 monitors + +![alt text](150_monitors.png) + +# Observations + +We see an increase in the memory usage by the OC stack which initially is around 600/700 MiB : + +``` +CONTAINER ID NAME CPU % MEM USAGE / LIMIT MEM % NET I/O BLOCK I/O PIDS +7ce889dd97cc oc-auth 0.00% 21.82MiB / 11.41GiB 0.19% 125MB / 61.9MB 23.3MB / 5.18MB 9 +93be30148a12 oc-catalog 0.14% 17.52MiB / 11.41GiB 0.15% 300MB / 110MB 35.1MB / 242kB 9 +611de96ee37e oc-datacenter 0.32% 21.85MiB / 11.41GiB 0.19% 38.7MB / 18.8MB 14.8MB / 0B 9 +dafb3027cfc6 oc-front 0.00% 5.887MiB / 11.41GiB 0.05% 162kB / 3.48MB 1.65MB / 12.3kB 7 +d7601fd64205 oc-peer 0.23% 16.46MiB / 11.41GiB 0.14% 201MB / 74.2MB 27.6MB / 606kB 9 +a78eb053f0c8 oc-scheduler 0.00% 17.24MiB / 11.41GiB 0.15% 125MB / 61.1MB 17.3MB / 1.13MB 10 +bfbc3c7c2c14 oc-schedulerd 0.07% 15.05MiB / 11.41GiB 0.13% 303MB / 293MB 7.58MB / 176kB 9 +304bb6a65897 oc-workflow 0.44% 107.6MiB / 11.41GiB 0.92% 2.54GB / 2.65GB 50.9MB / 11.2MB 10 +62e243c1c28f oc-workspace 0.13% 17.1MiB / 11.41GiB 0.15% 193MB / 95.6MB 34.4MB / 2.14MB 10 +3c9311c8b963 loki 1.57% 147.4MiB / 11.41GiB 1.26% 37.4MB / 16.4MB 148MB / 459MB 13 +01284abc3c8e mongo 1.48% 86.78MiB / 11.41GiB 0.74% 564MB / 1.48GB 35.6MB / 5.35GB 94 +14fc9ac33688 traefik 2.61% 49.53MiB / 11.41GiB 0.42% 72.1MB / 72.1MB 127MB / 2.2MB 13 +4f1b7890c622 nats 0.70% 78.14MiB / 11.41GiB 0.67% 2.64GB / 2.36GB 17.3MB / 2.2MB 14 + +Total 631.2 Mb +``` + +However over time with the repetition of a large number of scheduling that the stacks uses a larger amount of RAM. + +Espacially it seems that **loki**, **nats**, **mongo**, **oc-datacenter** and **oc-workflow** grow overs 150 MiB. This can be explained by the cache growing in these containers, which seems to be reduced every time the containers are restarted. + diff --git a/docs/performance_test/wf_test_ram_1node.png b/docs/performance_test/wf_test_ram_1node.png new file mode 100644 index 0000000..b1cc69a Binary files /dev/null and b/docs/performance_test/wf_test_ram_1node.png differ diff --git a/performance_test b/performance_test new file mode 100644 index 0000000..b1cc69a Binary files /dev/null and b/performance_test differ