You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: xdmod/README.md
+77-5
Original file line number
Diff line number
Diff line change
@@ -1,8 +1,82 @@
1
-
## Overview
1
+
## Pre-seeding XDMoD with data for tutorial
2
+
Due to time constraints for the half day tutorial we will skip the manual setup steps and have provided a database populated with this information.
3
+
These manual steps involve completing `xdmod-setup` and shredding, ingesting and aggregating HPC job accounting and performance data.
4
+
If you'd like to walk through these steps yourself, you can delete the XDMoD databases and start from scratch. [Follow the detailed instructions below](#getting-started).
2
5
3
-
**NOTE:**
4
-
Due to this tutorial being virtual and much shorter than anticipated; this part of the tutorial is going to be a bit more of an interactive demo. Some parts are going to be skipped over quicker than usual, however, our team is available in SLACK and Zoom chat to answer any questions that you may have.
6
+
## Tutorial: Using XDMoD
5
7
8
+
### Center Staff / Center Director View
9
+
- Login as `sfoster` password `ilovelinux`
10
+
11
+
Directly after logging in users assigned the Center Staff or Center Director role will be greeted with a dashboard that
12
+
helps manage and get a sense of the health of a center as a whole. In addition to the default charts, the dashboard can be
13
+
further customized on a user by user basis to meet the needs of each individual user / center.
14
+
15
+
- Click the "CPU Hours and Number of Jobs - Top 20 Users" chart
16
+
17
+
Clicking on any of the charts located in the upper two rows of the dashboard will zoom in to give the user a more detailed look at the
18
+
data contained within. You can close the zoomed in view by pressing the `esc` button or clicking the `x` located in the upper right hand corner
19
+
of the frame. An important and useful feature to note is that while the chart is zoomed in a "Open in Metric Explorer"
20
+
button is visible at the bottom of the frame. Clicking this button will open this chart in our "Metric Explorer".
21
+
22
+
- Click the "Open in Metric Explorer" button.
23
+
24
+
The Metric Explorer is meant to be the main method of interacting with existing and new charts. It provides the
25
+
user with the most control over what and how data is displayed. While working with a chart in the Metric Explorer you
26
+
can also choose to make it available in the Report Generator. The Report Generator provides the ability to have any
27
+
number of charts automatically generated and sent to you on a regular basis.
28
+
29
+
- Click the "Available for Report" checkbox
30
+
- Click the "Report Generator" tab
31
+
32
+
You should see the chart from the Metric Explorer in the panel on the right hand side of the screen.
33
+
34
+
- Click the "New" button in the "My Reports" toolbar
35
+
- Drag and Drop the chart from the "Available Charts" to the "Included Charts" section of the newly created report.
36
+
- Click "Save"
37
+
- Click "Download"
38
+
- Click "As PDF"
39
+
- Click "View Report" when the report is done generating.
40
+
41
+
- Click the "Efficiency" Tab
42
+
43
+
The Efficiency tab provides Center Staff with a powerful tool to quickly identify users who may be in need of help with
44
+
their jobs. By default four analytics are provided for categorizing a users jobs, CPU Usage, GPU Usage, Memory Use, and Homogeniety.
45
+
In this tutorial we will focus on CPU Usage.
46
+
47
+
- Click the "CPU Usage" chart.
48
+
49
+
This chart is organized such that the x axis is Average CPU %: Idle while the y axis is the total number of CPU Hours.
50
+
This orientation ensures that the points of greatest interest are always located in the upper right hand side. To further
51
+
ease identification, the points in the upper right hand quandrant are colored red.
52
+
53
+
- Click a red point in the upper hand quadrant of the chart.
54
+
55
+
You should now be presented with a histogram of percentage time that the CPU cores were idle compared to the overall usage.
56
+
57
+
- Click one of the bars in the histogram
58
+
59
+
You should now presented with a list of the jobs that make up this bar.
60
+
61
+
- Click a job.
62
+
63
+
Clicking a job has brought us to the "Job Viewer", this tab provides the ability to view the job level accounting and
64
+
performance statistics, including timeseries data for this particular job.
65
+
66
+
- Expand the 'Timeseries' tree item
67
+
- Click the "CPU User" item
68
+
69
+
While viewing timeseries data you are able to drill down by further clicking on a nodes data point, in this case the first click will drill down to that points node
70
+
clicking a point in this chart will then take you to that points CPU.
71
+
72
+
### Principal Investigator and User View
73
+
Due to time constraints the Principal Investigator and User Views willl not be explored in any appreciable depth, but
74
+
each role does have a Dashboard that has been customized to their needs and has access to the same basic features of XDMoD
75
+
as a Center Staff / Director user. The major difference being that Principal Investigators can see the jobs for all of
76
+
their associated users while a normal user can only see their own jobs.
77
+
78
+
79
+
## Getting Started
6
80
In this part of the tutorial we are going to go over the installation and configuration of Open XDMoD.
7
81
The base component of Open XDMoD uses the job accounting logs from your HPC
8
82
resource manager as the data source. We have also installed the optional Job Performance Module. This allows Open XDMoD to also display performance data for HPC jobs.
@@ -13,8 +87,6 @@ Command Line Demos in a Light color, are meant to be watched. Dark theme are int
13
87
14
88
## Submit some jobs to the cluster
15
89
16
-
**NOTE:** For the PEARC2021 tutorial the Presenter has already done this on their machine. If you are interested in running this on your own please do so.
17
-
18
90
Before we install and configure XDMoD we are going to submit
19
91
some HPC jobs to the cluster. This will ensure that we'll have something to view when we're done setting up XDMoD.
0 commit comments