Databases (db)#

class rubin_sim.maf.db.DisplayRow(**kwargs)[source]#

Bases: Base

Define contents and format of the displays table.

(Table to list the display properties for each metric.)

class rubin_sim.maf.db.MetricRow(**kwargs)[source]#

Bases: Base

Define contents and format of metric list table.

(Table to list all metrics, their info_label, and their output data files).

class rubin_sim.maf.db.PlotRow(**kwargs)[source]#

Bases: Base

Define contents and format of plot list table.

(Table to list all plots, link them to relevant metrics in MetricList, and provide info on filename).

class rubin_sim.maf.db.ResultsDb(out_dir=None, database=None, verbose=False, timeout=180)[source]#

Bases: object

ResultsDb is a sqlite database containing information on the metrics run via MAF, the plots created, the display information (such as captions), and any summary statistics output.

static build_summary_name(metric_name, metric_info_label, slicer_name, summary_stat_name=None)[source]#: Standardize a complete summary metric name, combining the metric + slicer + summary + info_label.

close()[source]#: Close connection to database.

downgrade_database()[source]#: Downgrade resultsDb to work with v0.10<MAF< v1.0 There is also a ‘upgradeDatabase’ to update to the newer style with ‘metric_info_label.

get_all_metric_ids()[source]#: Return a list of all metric_ids.

get_metric_data_files(metric_id=None)[source]#: Get the metric data filenames for all or a single metric. Returns a list.

get_metric_display_info(metric_id=None)[source]#

Get the contents of the metrics and displays table, together with the ‘basemetric_name’ (optionally, for metric_id list). Returns a numpy array of the metric information + display information.

An underlying assumption here is that all metrics have some display info. This may not always be the case.

get_metric_id(metric_name, slicer_name=None, metric_info_label=None, run_name=None)[source]#

Find metric bundle Ids from the metric table.

Parameters:

metric_name (str) – Name of the Metric
slicer_name (str, opt) – Name of the Slicer to match
metric_info_label (str, opt) – Metadata value to match
run_name (str, opt) – Name of the simulation (run_name) to match

Returns:

metric_id – List of matching metric_ids

Return type:

list of int

get_metric_id_like(metric_name_like=None, slicer_name_like=None, metric_info_label_like=None, run_name=None)[source]#

Find metric bundle Ids from the metric table, but search for names ‘like’ the values. (instead of a strict match from get_metric_id).

Parameters:

metric_name (str) – Partial name of the Metric
slicer_name (str, opt) – Partial name of the Slicer to match
metric_info_label (str, opt) – Partial info_label value to match
run_name (str, opt) – Name of the simulation (run_name) to match (exact)

Returns:

metric_id – List of matching metric_ids

Return type:

list of int

get_metric_info(metric_id=None, with_sim_name=False)[source]#

Get the simple metric info, without display information.

Parameters:

metric_id (int list, or None) – If None, data for all metrics are returned. Otherwise, only data corresponding to the supplied metric ID or IDs are returned
with_sim_name (bool) – If True, include the run name in the fields returned

Returns:

plotFiles –

metric_id: The metric ID
metric_name: The metric name
basemetric_names: The base metric names
slicer_name: The name of the slicer used in the bundleGroup
sql_constraint: The full sql constraint used in the bundleGroup
metric_info_label: Metadata extracted from the sql_constraint (usually the filter)
metric_datafile: The file name of the file with the metric data itself.
run_name: The name of the run plotted (if with_sim_name was True)

Return type:

numpy.recarray

get_plot_files(metric_id=None, with_sim_name=False)[source]#

Find the metric_id, name, info_label, and all plot info (optionally for metric_id list).

Parameters:

metric_id (int list, or None) – If None, plots for all metrics are returned. Otherwise, only plots corresponding to the supplied metric ID or IDs are returned
with_sim_name (bool) – If True, include the run name in the fields returned

Returns:

plotFiles –

metric_id: The metric ID
metric_name: The metric name
metric_info_label: info_label extracted from the sql constraint (usually the filter)
plot_type: The plot type
plot_file: The full plot file (pdf by default)
thumb_file: A plot thumbnail file name (png)
run_name: The name of the run plotted (if with_sim_name was True)

Return type:

numpy.recarray

get_run_name()[source]#: Return a list of the run_names for the metric bundles in the database.

get_summary_stats(metric_id=None, summary_name=None, summary_name_like=None, summary_name_notlike=None, with_sim_name=False)[source]#

Get the summary stats (optionally for metric_id list). Optionally, also specify the summary metric name. Returns a numpy array of the metric information + summary statistic information.

Parameters:

metric_id (int or list of int) – Metric bundle Ids to match from the metric table
summary_name (str, opt) – Match this summary statistic name exactly.
summary_name_like (str, opt) – Partial match to this summary statistic name.
summary_name_notlike (str, opt) – Exclude summary statistics with summary names like this.
with_sim_name (bool, opt) – If True, add the run_name to the returned numpy recarray.

Returns:

summarystats – Numpy recarray containing the selected summary stat information.

Return type:

np.recarray

open()[source]#: Open connection to database

update_database()[source]#

Update the results_db from ‘metricMetaData’ to ‘metric_info_label’ and now also changing the camel case to snake case (metricId to metric_id, etc.).

This updates results_db to work with the current version of MAF, including RunComparison and showMaf. There is also a ‘downgrade_database’ to revert to the older style with ‘metricMetadata.

update_display(metric_id, display_dict, overwrite=True)[source]#

Add a row to or update a row in the displays table.

Parameters:

metric_id (int) – The metricID for this metric bundle in the metrics table
display_dict (dict) – Dictionary containing the display info (group/subgroup/order/caption)
overwrite (bool, opt) – Replaces existing row with same metric_id if overwrite is True.

update_metric(metric_name, slicer_name, run_name, sql_constraint, metric_info_label, metric_datafile)[source]#

Add a row to or update a row in the metrics table.

Parameters:

metric_name (str) – Name of the Metric
slicer_name (str) – Name of the Slicer
run_name (str) – Name of the simulation (run_name, simName, run_name..)
sql_constraint (str) – sql_constraint relevant for the metric bundle
metric_info_label (str) – Information associated with the metric. Could be derived from the sql_constraint or could be a more descriptive version, specified by the user.
metric_datafile (str) – The data file the metric bundle output is stored in.

Returns:

metric_id (int) – The Id number of this metric in the metrics table.
If same metric (same metric_name, slicer_name, run_name,
sql_constraint, infoLabel) already exists, it does nothing.

update_plot(metric_id, plot_type, plot_file, overwrite=False)[source]#

Add a row to or update a row in the plot table.

Parameters:

metric_id (int) – The metric Id of this metric bundle in the metrics table
plot_type (str) – The type of this plot (oneDbinned data, healpix map, etc.)
plot_file (str) – The filename for this plot
overwrite (bool) – If True, replaces existing row. If False, an additional plot is added to the output (e.g. with a different range of color values, etc).

update_summary_stat(metric_id, summary_name, summary_value, ntry=3, pause_time=100)[source]#

Add a row to or update a row in the summary statistic table.

Most summary statistics will be a simple name (string) + value (float) pair. For special summary statistics which must return multiple values, the base name can be provided as ‘name’, together with a np.ndarray as ‘value’, where the array also has ‘name’ and ‘value’ columns (and each name/value pair is then saved as a summary statistic associated with this same metric_id).

Parameters:

metric_id (int) – The metric Id of this metric bundle
summary_name (str) – The name of this summary statistic
summary_value (: float or np.ndarray) – The value for this summary statistic. If this is a np.ndarray, then it should also have ‘name’ and ‘value’ columns to save each value to rows in the summary stats.
ntry (int, opt) – The number of times to retry if database is locked.
pause_time (int, opt) – Time to wait until trying again.

class rubin_sim.maf.db.SummaryStatRow(**kwargs)[source]#

Bases: Base

Define contents and format of the summary statistics table.

(Table to list and link summary stats to relevant metrics in MetricList, and provide summary stat name, value and potentially a comment).

class rubin_sim.maf.db.TrackingDb(database=None, trackingDbverbose=False)[source]#

Bases: object

Sqlite database to track MAF output runs and their locations, for show_maf

add_run(run_group=None, run_name=None, run_comment=None, run_version=None, run_date=None, maf_comment=None, maf_version=None, maf_date=None, maf_dir=None, db_file=None, maf_run_id=None)[source]#

Add a run to the tracking database.

Parameters:

run_group (str, optional) – Set a name to group this run with (eg. “Tier 1, 2016”).
run_name (str, optional) – Set a name for the opsim run.
run_comment (str, optional) – Set a comment describing the opsim run.
run_version (str, optional) – Set the version of opsim.
run_date (str, optional) – Set the date the opsim run was created.
maf_comment (str, optional) – Set a comment to describe the MAF analysis.
maf_version (str, optional) – Set the version of MAF used for analysis.
maf_date (str, optional) – Set the date the MAF analysis was run.
maf_dir (str, optional) – The relative path to the MAF directory. Will be converted to a relative path if absolute.
db_file (str, optional) – The relative path to the Opsim SQLite database file.
maf_run_id (int, optional) – The maf_run_id to assign to this record in the database (note this is a primary key!). If this run (ie the maf_dir) exists in the database already, this will be ignored.

Returns:

maf_run_id – The maf_run_id stored in the database.

Return type:

int

delRun(runId)[source]#: Remove a run from the tracking database.

rubin_sim.maf.db.add_run_to_database(maf_dir, tracking_db_file, run_group=None, run_name=None, run_comment=None, maf_comment=None, db_file=None, maf_version=None, maf_date=None, sched_version=None, sched_date=None, skip_extras=False)[source]#

Adds information about a MAF analysis run to a MAF tracking database.

Parameters:

maf_dir (str) – Path to the directory where the MAF results are located.
tracking_db_file (str) – Full filename (+path) to the tracking database to use.
run_group (str, optional) – Name to use to group this run with other opsim runs.
run_name (str, optional) – Name of the opsim run.
run_comment (str, optional) – Comment about the opsim run.
run_version (str, optional) – Value to use for the opsim version information.
maf_comment (str, optional) – Comment about the MAF analysis.
db_file (str, optional) – Relative path + name of the opsim database file.