I want to get a breakdown of the total read/write/other IOPS over a specific period, say 24 hours or a week for example. OCUM only seems to provide avg/min/max - is there a way to get the total performance data I want a different way?
The screenshot @pedro_rocha provided is captured from a NAbox deployment that @robverhoeven suggested earlier in the thread. NAbox leverages NetApp Harvest, Graphite, and GrafanaLabs to provide a customizable holistic view. Harvest is the data collector, Graphite is the metrics database, and Grafana provides the dashboards.
There is a short intro video, screenshots and everything you need to setup NAbox .
In OCUM there is no presentation of the total IOPS for a period of time - like 24 hours. However in the zoom view shown above you will get the average of the IOPS over the period of time. That can be used to calculate the total IOPS as (average IOPS * number of seconds in the period of time of interest). This number will grow quickly with the length of the period of time. For this reason, i.e., avoid large numbers, OCUM shows only averages of IOPS.
For 72 hour period there is an inventory page that lists all the volumes and displays the average IOPS - again from there one can calculate the total IOPS for 72 hours.
the average you see in the zoomed graph - together with min max and 95th percentile - is the average of the IOPS value every measurement point in the period of time - so the differences in IOPS in that period of time are already captured by that average number
average = (sum of IOPS every point in time) / (total points in time)
Since you are looking for sum of IOPS across all points in time, average can be used to stretch that number across every second of the period of time of interest.
I normally prefer to investigate performance related events through Grafana (NAbox) graphs. AIQUM is very nice though.... I think it's more a personal preference because once you get used to AIQUM, you do almost the same tasks in terms of analyzing graphs and etc.
If you don't ever used NAbox, you should give it a try. It looks faster and more fluid to pin point those IOPS/latency/Throughtput events using graphs and different time frames.