Public Member Functions
def	__init__ (self, quantity_name, unit=None, outlier_z_score=None, absolute=False, contact='', plot_name=None, plot_title=None, plot_name_prefix='', plot_name_postfix='', plot_title_postfix='', referenceFileName=None # if set binnings of plots will be read from corresponding histograms)

def	analyse (self, truths, estimates, variances=None, auxiliaries={}, which_plots=None, is_expert=None)

def	contact (self)

def	contact (self, contact)

def	write (self, tDirectory=None)

Public Attributes
	quantity_name
	name of the quantity the analysis is performed on

	unit
	unit the quantity is given in

	outlier_z_score
	the outlier score defines in terms of how many std deviations a data point is considered as an outlier

	absolute
	if true only the absolute value is compared

	plot_name
	name of the plot

	plot_title
	title of the plot

	plot_name_prefix
	prefix to be prepended to the plot name

	plot_name_postfix
	post fix to be append after the plot name

	plot_title_postfix
	postfix to be appended after the title

	plots
	dictionary to store the plots

	referenceFileName
	name of the reference file, if set the binnings of plots will be read from the corresponding object in that file

	contact
	Forward the contract to all plots by reassigning the contact.

Static Public Attributes
float	default_outlier_z_score = 5.0
	default outlier z score

str	default_plot_name = "{plot_name_prefix}_{subplot_name}{plot_name_postfix}"
	default plot name

str	default_plot_title = "{subplot_title} of {quantity_name}{plot_title_postfix}"
	default plot title

list	default_which_plots
	default list of plots to be created in this analysis

bool	default_is_expert = True
	if true the plots created here are declared as expert plots in the validation

Protected Attributes
	_contact
	contact information

Detailed Description

Performs a comparison of an estimated quantity to their truths by generating standardized validation plots.

Definition at line 26 of file pull.py.

Constructor & Destructor Documentation

◆ init()

def __init__	(	self,
		quantity_name,
		unit = `None`,
		outlier_z_score = `None`,
		absolute = `False`,
		contact = `''`,
		plot_name = `None`,
		plot_title = `None`,
		plot_name_prefix = `''`,
		plot_name_postfix = `''`,
		plot_title_postfix = `''`,
		referenceFileName = `None # if set binnings of plots will be read from corresponding histograms`
	)

Performs a comparison of an estimated quantity to their truths by generating standardized validation plots.

Definition at line 53 of file pull.py.

    ):
        """Performs a comparison of an estimated quantity to their truths by generating standardized validation plots."""
 
        self.quantity_name = quantity_name
        
        self.unit = unit or get_unit(quantity_name)
 
        if outlier_z_score is None:
            self.outlier_z_score = self.default_outlier_z_score
        else:
            self.outlier_z_score = outlier_z_score
 
        self.absolute = absolute
 
        self.plot_name = plot_name
        
        self.plot_title = plot_title
 
        self.plot_name_prefix = plot_name_prefix or root_save_name(quantity_name)
        
        self.plot_name_postfix = plot_name_postfix
        
        self.plot_title_postfix = plot_title_postfix
 
        self._contact = contact
        
        self.plots = collections.OrderedDict()
 
        self.referenceFileName = referenceFileName

Member Function Documentation

◆ analyse()

def analyse	(	self,
		truths,
		estimates,
		variances = `None`,
		auxiliaries = `{}`,
		which_plots = `None`,
		is_expert = `None`
	)

Compares the concrete estimate to the truth and generates plots of the estimates, residuals, pulls and p-values.
Close indicates if the figure shall be closed after they are saved.

Parameters
----------
truths : array_like(float)
    Sample of the true values
estimates : array_like(float)
    Corresponding estimations
variances : array_like(float), optional
    Corresponding variance estimations
auxiliaries : Dict[name, array_like(float)], optional
    Auxiliary variable to show distribution of residuals and pull as function
selected_plots : list(str), optional
    List of analysis plots to be generated. All if not given.
    Currently valid names are
    truths, estimates, diag_profile, diag_scatter, residuals,
    sigmas, pulls, p_values

Definition at line 103 of file pull.py.

    ):
        """Compares the concrete estimate to the truth and generates plots of the estimates, residuals, pulls and p-values.
        Close indicates if the figure shall be closed after they are saved.
 
        Parameters
        ----------
        truths : array_like(float)
            Sample of the true values
        estimates : array_like(float)
            Corresponding estimations
        variances : array_like(float), optional
            Corresponding variance estimations
        auxiliaries : Dict[name, array_like(float)], optional
            Auxiliary variable to show distribution of residuals and pull as function
        selected_plots : list(str), optional
            List of analysis plots to be generated. All if not given.
            Currently valid names are
            truths, estimates, diag_profile, diag_scatter, residuals,
            sigmas, pulls, p_values
        """
 
        if is_expert is None:
            is_expert = self.default_is_expert
 
        if which_plots is None:
            which_plots = self.default_which_plots
 
        quantity_name = self.quantity_name
 
        axis_label = compose_axis_label(quantity_name, self.unit)
 
        plot_name_prefix = self.plot_name_prefix
        outlier_z_score = self.outlier_z_score
 
        absolute = self.absolute
        # Compare only the absolute value by taking the absolute of the curvature truth
        # and flip the sign of the estimate
        if absolute:
            absolute_truths = truths.copy()
            absolute_estimates = estimates.copy()
 
            flip_sign_for = truths < 0
            absolute_truths[flip_sign_for] = -truths[flip_sign_for]
            absolute_estimates[flip_sign_for] = -estimates[flip_sign_for]
 
            truths = absolute_truths
            estimates = absolute_estimates
 
            quantity_name = 'absolute ' + quantity_name
 
        residuals = estimates - truths
 
        if variances is not None:
            sigmas = np.sqrt(variances)
            pulls = np.divide(residuals, sigmas)
            p_values = 1.0 - erf(np.abs(pulls))
 
        plot_name = self.plot_name
        if plot_name is None:
            plot_name = self.default_plot_name
 
        plot_name = formatter.format(plot_name,
                                     quantity_name=quantity_name,
                                     plot_name_prefix=plot_name_prefix,
                                     plot_name_postfix=self.plot_name_postfix)
 
        plot_title = self.plot_title
        if plot_title is None:
            plot_title = self.default_plot_title
 
        plot_title = formatter.format(plot_title,
                                      quantity_name=quantity_name,
                                      plot_title_postfix=self.plot_title_postfix)
 
        # Truths #
        
 
        if "truths" in which_plots:
            # Distribution of truths
            truths_hist_name = formatter.format(plot_name, subplot_name="truths")
            truths_hist = ValidationPlot(truths_hist_name, self.referenceFileName)
            truths_hist.hist(truths,
                             outlier_z_score=outlier_z_score,
                             is_expert=is_expert)
            truths_hist.xlabel = axis_label
            truths_hist.title = formatter.format(plot_title, subplot_title='True distribution')
 
            self.plots['truths'] = truths_hist
 
        # Estimates #
        
 
        if "estimates" in which_plots:
            # Distribution of estimates
            estimates_hist_name = formatter.format(plot_name, subplot_name="estimates")
            estimates_hist = ValidationPlot(estimates_hist_name, self.referenceFileName)
            estimates_hist.hist(estimates,
                                outlier_z_score=outlier_z_score,
                                is_expert=is_expert)
            estimates_hist.xlabel = axis_label
            estimates_hist.title = formatter.format(plot_title, subplot_title='Estimates distribution')
 
            self.plots['estimates'] = estimates_hist
 
        # Diagonal plots #
        
        if "diag_scatter" in which_plots:
            # Estimates versus truths scatter plot
            estimates_by_truths_scatter_name = formatter.format(plot_name, subplot_name="diag_scatter")
            estimates_by_truths_scatter = ValidationPlot(estimates_by_truths_scatter_name, self.referenceFileName)
            estimates_by_truths_scatter.scatter(truths,
                                                estimates,
                                                outlier_z_score=outlier_z_score,
                                                is_expert=is_expert)
            estimates_by_truths_scatter.xlabel = 'True ' + axis_label
            estimates_by_truths_scatter.ylabel = 'Estimated ' + axis_label
            estimates_by_truths_scatter.title = formatter.format(plot_title, subplot_title='Diagonal scatter plot')
 
            self.plots['diag_scatter'] = estimates_by_truths_scatter
 
        if "diag_profile" in which_plots:
            # Estimates versus truths profile plot
            estimates_by_truths_profile_name = formatter.format(plot_name, subplot_name="diag_profile")
            estimates_by_truths_profile = ValidationPlot(estimates_by_truths_profile_name, self.referenceFileName)
 
            # Fill residuals and correct afterwards
            estimates_by_truths_profile.profile(truths,
                                                estimates - truths,
                                                outlier_z_score=outlier_z_score,
                                                gaus_z_score=4,
                                                is_expert=is_expert)
 
            # Correct with TF1 - only works because the Gaussian fit is active.
            hist = estimates_by_truths_profile.histograms[0]
            GetBinContent = hist.GetBinContent
            GetBinCenter = hist.GetBinCenter
            SetBinContent = hist.SetBinContent
            for i_bin in range(hist.GetNbinsX() + 2):
                residual = GetBinContent(i_bin)
                truth = GetBinCenter(i_bin)
                if residual != 0:
                    SetBinContent(i_bin, residual + truth)
 
            # Reset maximum and minimum
            estimates_by_truths_profile.histograms[0].SetMaximum()
            estimates_by_truths_profile.histograms[0].SetMinimum()
 
            estimates_by_truths_profile.xlabel = 'True ' + axis_label
            estimates_by_truths_profile.ylabel = 'Estimated ' + axis_label
 
            estimates_by_truths_profile.title = formatter.format(plot_title, subplot_title='Diagonal profile')
            estimates_by_truths_profile.fit_diag()
 
            self.plots['diag_profile'] = estimates_by_truths_profile
 
        # Residuals #
        
        if "residuals" in which_plots:
            # Distribution of the residuals
            residuals_hist_name = formatter.format(plot_name, subplot_name="residuals")
            residuals_hist = ValidationPlot(residuals_hist_name, self.referenceFileName)
            residuals_hist.hist(residuals,
                                outlier_z_score=outlier_z_score,
                                is_expert=is_expert)
            residuals_hist.xlabel = compose_axis_label("#Delta " + quantity_name + " (estimate - truth)", self.unit)
            residuals_hist.title = formatter.format(plot_title, subplot_title='Residual distribution')
 
            self.plots['residuals'] = residuals_hist
 
        # Variances #
        
        if variances is not None and "sigmas" in which_plots:
 
            # Distribution of sigmas
            sigmas_hist_name = formatter.format(plot_name, subplot_name="sigmas")
            sigmas_hist = ValidationPlot(sigmas_hist_name, self.referenceFileName)
            sigmas_hist.hist(sigmas,
                             lower_bound=0,
                             outlier_z_score=outlier_z_score,
                             is_expert=is_expert)
            sigmas_hist.xlabel = compose_axis_label("#sigma (" + quantity_name + ')', self.unit)
            sigmas_hist.title = formatter.format(plot_title, subplot_title='Estimated variance distribution')
 
            self.plots['sigmas'] = sigmas_hist
 
        # Pulls #
        
        if variances is not None and "pulls" in which_plots:
 
            # Distribution of pulls
            pulls_hist_name = formatter.format(plot_name, subplot_name="pulls")
            pulls_hist = ValidationPlot(pulls_hist_name, self.referenceFileName)
            pulls_hist.hist(pulls, outlier_z_score=outlier_z_score, is_expert=is_expert)
            pulls_hist.xlabel = "pull (" + quantity_name + ")"
            pulls_hist.title = formatter.format(plot_title, subplot_title='Pull distribution')
            pulls_hist.fit_gaus(z_score=1)
 
            self.plots['pulls'] = pulls_hist
 
        # P-Values #
        
        if variances is not None and "p_values" in which_plots:
 
            # Distribution of p_values
            p_values_hist_name = formatter.format(plot_name, subplot_name="p-values")
            p_values_hist = ValidationPlot(p_values_hist_name, self.referenceFileName)
            p_values_hist.hist(p_values, lower_bound=0, upper_bound=1, is_expert=is_expert)
            p_values_hist.xlabel = "p-value (" + quantity_name + ")"
            p_values_hist.title = formatter.format(plot_title, subplot_title='P-value distribution')
            p_values_hist.fit_const()
 
            self.plots['p_values'] = p_values_hist
 
        # Auxialliary variables #
        # ##################### #
        for aux_name, aux_values in auxiliaries.items():
            if "aux_residual_hist2d" in which_plots or "aux" in which_plots:
                # Distribution of the residuals over auxiliary variable
                aux_residuals_hist2d_name = formatter.format(plot_name,
                                                             subplot_name=f"residuals over {aux_name}")
                aux_residuals_hist2d = ValidationPlot(aux_residuals_hist2d_name, self.referenceFileName)
                aux_residuals_hist2d.hist2d(aux_values,
                                            residuals,
                                            outlier_z_score=outlier_z_score,
                                            allow_discrete=True,
                                            is_expert=is_expert)
                aux_residuals_hist2d.xlabel = compose_axis_label(aux_name)
                aux_residuals_hist2d.ylabel = compose_axis_label("#Delta " + quantity_name + " (estimate - truth)", self.unit)
                aux_residuals_hist2d.title = formatter.format(plot_title,
                                                              subplot_title=f'Residual distribution over {aux_name}')
 
                self.plots['aux_residuals_hist2d_' + aux_name] = aux_residuals_hist2d
 
            if "aux_residual_profile" in which_plots or "aux" in which_plots:
                # Distribution of the residuals over auxiliary variable
                aux_residuals_profile_name = formatter.format(plot_name,
                                                              subplot_name=f"residuals profile over {aux_name}")
                aux_residuals_profile = ValidationPlot(aux_residuals_profile_name, self.referenceFileName)
                aux_residuals_profile.profile(aux_values,
                                              residuals,
                                              outlier_z_score=outlier_z_score,
                                              gaus_z_score=1.5,
                                              allow_discrete=True,
                                              is_expert=is_expert,
                                              )
                aux_residuals_profile.xlabel = compose_axis_label(aux_name)
                aux_residuals_profile.ylabel = compose_axis_label("#Delta " + quantity_name + " (estimate - truth)", self.unit)
                aux_residuals_profile.title = formatter.format(plot_title,
                                                               subplot_title=f'Residual profile over {aux_name}')
 
                self.plots['aux_residuals_profile_' + aux_name] = aux_residuals_profile
 
            if variances is not None and ("aux_pull_hist2d" in which_plots or "aux" in which_plots):
                # Distribution of the pulls over auxiliary variable
                aux_pulls_hist2d_name = formatter.format(plot_name,
                                                         subplot_name=f"pulls over {aux_name}")
                aux_pulls_hist2d = ValidationPlot(aux_pulls_hist2d_name, self.referenceFileName)
                aux_pulls_hist2d.hist2d(aux_values,
                                        pulls,
                                        outlier_z_score=outlier_z_score,
                                        allow_discrete=True,
                                        is_expert=is_expert)
                aux_pulls_hist2d.xlabel = compose_axis_label(aux_name)
                aux_pulls_hist2d.ylabel = "pull (" + quantity_name + ")"
                aux_pulls_hist2d.title = formatter.format(plot_title,
                                                          subplot_title=f'Pull scatter over {aux_name}')
 
                self.plots['aux_pulls_hist2d_' + aux_name] = aux_pulls_hist2d
 
            if variances is not None and ("aux_pull_profile" in which_plots or "aux" in which_plots):
                # Distribution of the pulls over auxiliary variable
                aux_pulls_profile_name = formatter.format(plot_name,
                                                          subplot_name=f"pull profile over {aux_name}")
                aux_pulls_profile = ValidationPlot(aux_pulls_profile_name, self.referenceFileName)
                aux_pulls_profile.profile(aux_values,
                                          pulls,
                                          outlier_z_score=outlier_z_score,
                                          gaus_z_score=1.5,
                                          allow_discrete=True,
                                          is_expert=is_expert)
                aux_pulls_profile.xlabel = compose_axis_label(aux_name)
                aux_pulls_profile.ylabel = "pull (" + quantity_name + ")"
                aux_pulls_profile.title = formatter.format(plot_title,
                                                           subplot_title=f'Pull profile over {aux_name}')
 
                self.plots['aux_pulls_profile_' + aux_name] = aux_pulls_profile
 
        
        self.contact = self.contact
 

◆ contact() [1/2]

def contact ( self )

 returns the contact

Definition at line 402 of file pull.py.

    def contact(self):
        """ returns the contact """
        return self._contact
 

◆ contact() [2/2]

def contact	(	self,
		contact
	)

  sets the contact

  parameters:
  contact: new contact information

Definition at line 407 of file pull.py.

    def contact(self, contact):
        """
          sets the contact
 
          parameters:
          contact: new contact information
        """
        self._contact = contact
        for validation_plot in list(self.plots.values()):
            validation_plot.contact = contact
 

◆ write()

def write	(	self,
		tDirectory = `None`
	)

 Write all validation plot to the given Root directory
    parameters:
    tDirectory - the root directory were to write to

Definition at line 418 of file pull.py.

    def write(self, tDirectory=None):
        """ Write all validation plot to the given Root directory
            parameters:
            tDirectory - the root directory were to write to
        """
        for validation_plot in list(self.plots.values()):
            validation_plot.write(tDirectory)

Member Data Documentation

◆ _contact

_contact

protected

contact information

Definition at line 96 of file pull.py.

◆ absolute

absolute

if true only the absolute value is compared

Definition at line 81 of file pull.py.

◆ contact

contact

Forward the contract to all plots by reassigning the contact.

Definition at line 399 of file pull.py.

◆ default_is_expert

bool default_is_expert = True

static

if true the plots created here are declared as expert plots in the validation

Definition at line 51 of file pull.py.

◆ default_outlier_z_score

float default_outlier_z_score = 5.0

static

default outlier z score

Definition at line 29 of file pull.py.

◆ default_plot_name

str default_plot_name = "{plot_name_prefix}_{subplot_name}{plot_name_postfix}"

static

default plot name

Definition at line 31 of file pull.py.

◆ default_plot_title

str default_plot_title = "{subplot_title} of {quantity_name}{plot_title_postfix}"

static

default plot title

Definition at line 33 of file pull.py.

◆ default_which_plots

list default_which_plots

static

Initial value:

=  [
        "truths",
        "estimates",
        "diag_profile",
        "diag_scatter",
        "residuals",
        "sigmas",
        "pulls",
        "p_values",
        "aux_residual_hist2d",
        "aux_residual_profile",
        "aux_pull_hist2d",
        "aux_pull_profile",
    ]

default list of plots to be created in this analysis

Definition at line 35 of file pull.py.

◆ outlier_z_score

outlier_z_score

the outlier score defines in terms of how many std deviations a data point is considered as an outlier

Definition at line 76 of file pull.py.

◆ plot_name

plot_name

name of the plot

Definition at line 84 of file pull.py.

◆ plot_name_postfix

plot_name_postfix

post fix to be append after the plot name

Definition at line 91 of file pull.py.

◆ plot_name_prefix

plot_name_prefix

prefix to be prepended to the plot name

Definition at line 89 of file pull.py.

◆ plot_title

plot_title

title of the plot

Definition at line 86 of file pull.py.

◆ plot_title_postfix

plot_title_postfix

postfix to be appended after the title

Definition at line 93 of file pull.py.

◆ plots

plots

dictionary to store the plots

Definition at line 98 of file pull.py.

◆ quantity_name

quantity_name

name of the quantity the analysis is performed on

Definition at line 70 of file pull.py.

◆ referenceFileName

referenceFileName

name of the reference file, if set the binnings of plots will be read from the corresponding object in that file

Definition at line 101 of file pull.py.

◆ unit

unit

unit the quantity is given in

Definition at line 72 of file pull.py.

The documentation for this class was generated from the following file:

tracking/scripts/tracking/validation/pull.py

Public Member Functions

Public Attributes

Static Public Attributes

Protected Attributes

Detailed Description

Constructor & Destructor Documentation

◆ __init__()

Member Function Documentation

◆ analyse()

◆ contact() [1/2]

◆ contact() [2/2]

◆ write()

Member Data Documentation

◆ _contact

◆ absolute

◆ contact

◆ default_is_expert

◆ default_outlier_z_score

◆ default_plot_name

◆ default_plot_title

◆ default_which_plots

◆ outlier_z_score

◆ plot_name

◆ plot_name_postfix

◆ plot_name_prefix

◆ plot_title

◆ plot_title_postfix

◆ plots

◆ quantity_name

◆ referenceFileName

◆ unit

◆ init()