The STATS NetCDF files

Contents

The STATS NetCDF files#

How to use Xarray with the -STATS.nc NetCDF files

Access the settings used to process the data#

This is metada contained within the ‘steps’ attribute. pyopia.pipeline.steps_from_xstats() can extract this from the xarray for you:

toml_steps = pyopia.pipeline.steps_from_xstats(xstats)
toml_steps

{'general': {'raw_files': 'raw_data/*.silc', 'pixel_size': 24},
 'steps': {'classifier': {'pipeline_class': 'pyopia.classify.Classify',
   'model_path': 'keras_model.h5'},
  'load': {'pipeline_class': 'pyopia.instrument.silcam.SilCamLoad'},
  'imageprep': {'pipeline_class': 'pyopia.instrument.silcam.ImagePrep',
   'image_level': 'imraw'},
  'segmentation': {'pipeline_class': 'pyopia.process.Segment',
   'threshold': 0.85},
  'statextract': {'pipeline_class': 'pyopia.process.CalculateStats'},
  'output': {'pipeline_class': 'pyopia.io.StatsToDisc',
   'output_datafile': './test'}}}

You can use this to modify settings, or re-process a dataset using pyopia.pipeline.Pipeline

Or you might want to access some other metadata, such as pixel size, for use in analysis:

toml_steps['general']['pixel_size']

We can plot directly from xarray in exactly the same way as from the Pandas DataFrame (so it doesn’t matter which you use here). The benefit of ‘xstats’ as an xarray is that it now contains it’s own metadata

import matplotlib.pyplot as plt

dias, vd = pyopia.statistics.vd_from_stats(xstats, toml_steps['general']['pixel_size'])

plt.plot(dias, vd, label=f"Threshold={toml_steps['steps']['segmentation']['threshold']}")
plt.xscale('log')
plt.xlabel('ECD [um]')
plt.ylabel('Volume Distribution [uL/sample vol.]')
plt.legend()
plt.show()

../_images/3a77f9ce87e0da9dd54c84392f6ecd4025487f59cb0e4bd05db50b38dcb3bbf7.png