data.coverage

data.coverage analyzes available data for a given time range and generates a report of the estimated coverage. The estimated coverage number is the number of defined times vs the expected number based on the mode of the time intervals seen as a percentage. For example, suppose you get output for the variable BaB_A12, which reads 49.56/60. The first number represents the percentage of time that the instrument is getting data for this variable. The second number represents periodicity of the instrument retrieving data in seconds.

Command Line Usage

data.coverage [--interval=seconds|monthly|quarter|day] 
                [--records=R11a,R21a,...] 
                [--stdin] [--source=raw] [--csv]
                [station] [start end] [var1 ... varN]

Arguments

start and end

The time specifiers for the data to be retrieved. Start is inclusive while end is exclusive, so all data contained within the half open interval [start,end) will be returned. Any convertible time format is accepted. Absent when reading from standard input.

station

The station identifier code. For example 'brw'. Case insensitive.

var1 ... varN

List of variables to inspect, one specifier per argument (space separated). Optional if records are set (below), if absent in that case then defaulting to all variables contained in those records. Accepts regular expression matches, wrapped in “^variable$”. NOTE: Regular expression matched variables tend to be very slow to query if records are not set.

--interval=seconds|month|week|quarter|day

Interval to report on, by default one month. If set to zero report over the whole time range. The special values “month”, “week”, “quarter” and “day” are also recognized. If defined as a number it is in seconds.

--records=R11a,R21a,...

List of records to query. When present only these records are queried even if the variables list above would match more or would not match them at all.

--stdin

Read from standard input, records and variable specifications are optional if present. If records or variables are set then they are used to filter the output.

--source=raw

Set the archive to query from when not reading from standard input, defaulting to raw.

--csv

Output data in CSV format instead of space separated.

Example Usage

All missing neph data

data.coverage --records=S11a sgp 2008.0 2009.0

Only neph scatterings on a weekly basis

data.coverage --records=S11a --interval=week sgp 2008.0 2009.0 'Bs.*'

All scatterings quarterly

data.coverage --interval=quarter sgp 2008.0 2009.0 'Bs.*'

Missing averaged data

data.avg sgp S11a 2008.0 2009.0 | data.coverage --interval=week

Quarter coverage for all stations

for stn in $STATIONS; do data.coverage --interval=quarter --source=clean --csv $stn 1976.0 2009.5 'BsG.*' 'Ba[OLK].*' 'N_.*'; done

Note that is is very time consuming and will report coverage for stations that may not be verified in the DB system.

Information

Data