Bisect Plot Examples¶

Introduction¶

This notebook explains the use case of the prepare_datasets, and plot_bisect functions from the Plaid library. The function is used to generate bisect plots for different scenarios using file paths and PLAID objects.

# Importing Required Libraries
from pathlib import Path
import os

from plaid import Dataset
from plaid.post.bisect import plot_bisect, prepare_datasets
from plaid import ProblemDefinition

# Setting up Directories
try:
    dataset_directory = Path(__file__).parent.parent.parent / "tests" / "post"
except NameError:
    dataset_directory = Path("..") / ".." / ".." / ".." / "tests" / "post"

Prepare Datasets for comparision¶

Assuming you have reference and predicted datasets, and a problem definition, The prepare_datasets function is used to obtain output scalars for subsequent analysis.

# Load PLAID datasets and problem metadata objects
ref_ds = Dataset(dataset_directory / "dataset_ref")
pred_ds = Dataset(dataset_directory / "dataset_near_pred")
problem = ProblemDefinition(dataset_directory / "problem_definition")

# Get output scalars from reference and prediction dataset
ref_out_scalars, pred_out_scalars, out_scalars_names = prepare_datasets(
    ref_ds, pred_ds, problem, verbose=True
)

print(f"{out_scalars_names = }\n")

/tmp/ipykernel_3425/4066271457.py:7: DeprecationWarning: use `get_out_features_identifiers` instead [since v0.1.8] (will be removed in v0.2.0)
  ref_out_scalars, pred_out_scalars, out_scalars_names = prepare_datasets(

  0%|          | 0/30 [00:00<?, ?it/s]

100%|██████████| 30/30 [00:00<00:00, 36588.87it/s]

out_scalars_names = ['feature_2']

# Get output scalar
key = out_scalars_names[0]

print(f"KEY '{key}':\n")
print(f"ID{' ' * 5}--REF_out_scalars--{' ' * 7}--PRED_out_scalars--")

# Print output scalar values for both datasets
index = 0
for item1, item2 in zip(ref_out_scalars[key], pred_out_scalars[key]):
    print(
        f"{str(index).ljust(2)}  |  {str(item1).ljust(20)}  |   {str(item2).ljust(20)}"
    )
    index += 1

KEY 'feature_2':

ID     --REF_out_scalars--       --PRED_out_scalars--
 |  0.6026743950988548    |   0.6026804218428058  
 |  0.4010294552789103    |   0.40103346557346314 
 |  0.4594691101022568    |   0.4594737047933578  
 |  0.1734062211920243    |   0.17340795525423625 
 |  0.21668654508983876   |   0.21668871195528966 
 |  0.2181819183396143    |   0.21818410015879772 
 |  0.5203262339832092    |   0.5203314372455491  
 |  0.6797440087453753    |   0.6797508061854628  
 |  0.7589424187607021    |   0.7589500081848898  
 |  0.6345020332023174    |   0.6345083782226494  
|  0.7550570027405668    |   0.7550645533105942  
|  0.11902182132257      |   0.11902301154078324 
|  0.2277144265635649    |   0.22771670370783054 
|  0.26104884230989456   |   0.26105145279831765 
|  0.7418202872333769    |   0.7418277054362493  
|  0.8670795819513495    |   0.867088252747169   
|  0.9819893923184365    |   0.9819992122123598  
|  0.14881640806200802   |   0.14881789622608865 
|  0.10477886888726429   |   0.10477991667595317 
|  0.8075981575733645    |   0.8076062335549403  
|  0.11251682284184616   |   0.11251794801007459 
|  0.6874297933397363    |   0.6874366676376698  
|  0.6063536260025336    |   0.6063596895387937  
|  0.6013786933270137    |   0.601384707113947   
|  0.30270251139317916   |   0.3027055384182931  
|  0.49927569741356614   |   0.49928069017054033 
|  0.43825290644848336   |   0.4382572889775479  
|  0.46219564786354983   |   0.4622002698200285  
|  0.9643573718446001    |   0.9643670154183186  
|  0.09651135295899105   |   0.09651231807252064 

Plotting with File Paths¶

Here, we load the datasets and problem metadata from file paths and use the plot_bisect function to generate a bisect plot for a specific scalar, in this case, “scalar_2.”

print("=== Plot with file paths ===")

# Load PLAID datasets and problem metadata from files
ref_path = dataset_directory / "dataset_ref"
pred_path = dataset_directory / "dataset_pred"
problem_path = dataset_directory / "problem_definition"

# Using file paths to generate bisect plot on feature_2
plot_bisect(ref_path, pred_path, problem_path, "feature_2", "differ_bisect_plot")

=== Plot with file paths ===

../../../_images/332fef07268fc99a4fa8bc1e39cde81945a171cf1b0d165e6110d9fb5424811c.png

Plotting with PLAID¶

In this section, we demonstrate how to use PLAID objects directly to generate a bisect plot. This can be advantageous when working with PLAID datasets in memory.

print("=== Plot with PLAID objects ===")

# Load PLAID datasets and problem metadata objects
ref_path = Dataset(dataset_directory / "dataset_ref")
pred_path = Dataset(dataset_directory / "dataset_pred")
problem_path = ProblemDefinition(dataset_directory / "problem_definition")

# Using PLAID objects to generate bisect plot on feature_2
plot_bisect(ref_path, pred_path, problem_path, "feature_2", "equal_bisect_plot")

=== Plot with PLAID objects ===

Mixing with Scalar Index and Verbose¶

In this final section, we showcase a mix of file paths and PLAID objects, incorporating a scalar index and enabling the verbose option when generating a bisect plot. This can provide more detailed information during the plotting process.

print("=== Mix with scalar index and verbose ===")

# Mix
ref_path = dataset_directory / "dataset_ref"
pred_path = dataset_directory / "dataset_near_pred"
problem_path = ProblemDefinition(dataset_directory / "problem_definition")

# Using scalar index and verbose option to generate bisect plot
scalar_index = 0
plot_bisect(
    ref_path,
    pred_path,
    problem_path,
    scalar_index,
    "converge_bisect_plot",
    verbose=True,
)

os.remove("converge_bisect_plot.png")
os.remove("differ_bisect_plot.png")
os.remove("equal_bisect_plot.png")

=== Mix with scalar index and verbose ===
Data preprocessing...

  0%|          | 0/30 [00:00<?, ?it/s]

100%|██████████| 30/30 [00:00<00:00, 22786.87it/s]

Bisect graph construction...
Bisect graph saving...

...Bisect plot done

../../../_images/9c5bd2933caf4a2aadbb31f69b1cf2fd5c68b21f90a3c992af1c1cd6a5f1357b.png