Exploratory Data Analysis¶
-
src.actions.eda.extract_full_eda_df(mseq_list, options, output_path_list, lseq_list=None)[source]¶ Creates and saves a list of full size image EDA dataframes from a list MaskSequences.
- Parameters
mseq_list (
List[MaskSequence]) – A list of MaskSequence objectsoptions (
Dict) – the options of what should be included in the dataframe; the option name should be the key and the value should be either true or falseoutput_path_list (
List[str]) – the list of output csv file pathslseq_list (
Optional[List[LeafSequence]]) – A list of LeafSequence objects; this is only required if the linked filename option is used
- Return type
None- Returns
None
-
src.actions.eda.extract_tiles_eda_df(mseq_list, options, output_path_list, lseq_list=None)[source]¶ Creates and saves a list of tile image EDA dataframes from a list MaskSequences.
- Parameters
mseq_list (
List[MaskSequence]) – A list of MaskSequence objectsoptions (
Dict) – the options of what should be included in the dataframe; the option name should be the key and the value should be either true or falseoutput_path_list (
List[str]) – the list of output csv file pathslseq_list (
Optional[List[LeafSequence]]) – A list of LeafSequence objects; this is only required if the linked filename option is used
- Return type
None- Returns
None
-
src.actions.eda.extract_full_databunch_df(lseq_list, mseq_list, output_path_list, embolism_only=False)[source]¶ Extracts a databunch dataframe of full size images. The first field is the leaf path and the second field is the mask name. This is useful for Fastai.
- Parameters
lseq_list (
List[LeafSequence]) – A list of LeafSequence objectsmseq_list (
List[MaskSequence]) – A list of MaskSequence objectsoutput_path_list (
List[str]) – the list of output csv file pathsembolism_only – whether only leaves with embolisms should be used
- Return type
None- Returns
None
-
src.actions.eda.extract_tiles_databunch_df(lseq_list, mseq_list, output_path_list, tile_embolism_only=False, leaf_embolism_only=False)[source]¶ Extracts a databunch dataframe of full size images. The first field is the leaf path and the second field is the mask name. This is useful for Fastai.
- Parameters
lseq_list (
List[LeafSequence]) – A list of LeafSequence objectsmseq_list (
List[MaskSequence]) – A list of MaskSequence objectsoutput_path_list (
List[str]) – the list of output csv file pathstile_embolism_only (
bool) – whether only tiles with embolisms should be usedleaf_embolism_only (
bool) – whether only leaves with embolisms should be used
- Return type
None- Returns
None