Inheritance diagram for GraphDataSet:

Collaboration diagram for GraphDataSet:

Public Member Functions
def	__init__ (self, root, n_files=None, samples=None, features=[], edge_features=[], global_features=[], normalize=None, **kwargs)

def	processed_file_names (self)

def	process (self)

Public Attributes
	root
	Root path.

	normalize
	Normalize.

	n_files
	Number of files.

	node_features
	Node features.

	edge_features
	Edge features.

	global_features
	Global features.

	samples
	Samples.

	slices
	Data and Slices.

Detailed Description

Dataset handler for converting Belle II data to PyTorch geometric InMemoryDataset.

The ROOT format expects the tree in every file to be named ``Tree``,
and all node features to have the format ``feat_FEATNAME``.

.. note:: This expects the files under root to have the structure ``root/**/<file_name>.root``
    where the root path is different for train and val.
    The ``**/`` is to handle subdirectories, e.g. ``sub00``.

Args:
    root (str): Path to ROOT files.
    n_files (int): Load only ``n_files`` files.
    samples (int): Load only ``samples`` events.
    features (list): List of node features names.
    edge_features (list): List of edge features names.
    global_features (list): List of global features names.
    normalize (bool): Whether to normalize input features.

Definition at line 258 of file geometric_datasets.py.

Constructor & Destructor Documentation

◆ init()

def __init__	(		self,
			root,
			n_files = `None`,
			samples = `None`,
			features = `[]`,
			edge_features = `[]`,
			global_features = `[]`,
			normalize = `None`,
		**	kwargs
	)

Initialization.

Definition at line 279 of file geometric_datasets.py.

     ):
         """
         Initialization.
         """
         assert isinstance(
             features, list
         ), f'Argument "features" must be a list and not {type(features)}'
         assert len(features) > 0, "You need to use at least one node feature"
  
         
         self.root = Path(root)
  
         
         self.normalize = normalize
  
         
         self.n_files = n_files
         
         self.node_features = features
         
         self.edge_features = edge_features
         
         self.global_features = global_features
         
         self.samples = samples
  
         # Delete processed files, in case
         file_path = Path(self.root, "processed")
         files = list(file_path.glob("*.pt"))
         for f in files:
             f.unlink(missing_ok=True)
  
         # Needs to be called after having assigned all attributes
         super().__init__(root, None, None, None)
  
         
         self.data, self.slices = torch.load(self.processed_paths[0])
  

Member Function Documentation

◆ process()

def process ( self )

Processes the data to create graph objects and stores them in ``root/processed/processed_data.pt``
where the root path is different for train and val.

Called internally by PyTorch.

Definition at line 334 of file geometric_datasets.py.

◆ processed_file_names()

def processed_file_names ( self )

Name of processed file.

Definition at line 328 of file geometric_datasets.py.

The documentation for this class was generated from the following file:

analysis/scripts/grafei/model/geometric_datasets.py

Public Member Functions