ageas.n_iter_extraction
- ageas.n_iter_extraction(hangar: Hangar, operation_name: str = 'extraction', accelerator: str = 'cpu', cuda_devices: list = None, query_dataset=None, test_dataset=None, exp_dataset=None, max_extraction_iter: int = 1, extract_top_n: int = 10, use_gene_names: bool = True, seed: int = 42, verbose: bool = None, selection_args: dict = None, explain_args: dict = None) tuple
N-iteration extraction of top regulatory factors per class.
Each iteration runs
n_kfold_selection(), debriefs the surviving deck to obtain per-class explanation scores, prunes outlier features, and feeds the trimmed dataset back into the next iteration. The per-iteration explanations are L1-aggregated and the top-N factors per class are ranked across iterations.- Parameters:
hangar – Source hangar from which the operating squad is generated.
operation_name – Operation name under which per-iteration reports are stored.
accelerator – Accelerator hint forwarded to selection (
'cpu'or'cuda').cuda_devices – Optional GPU device indices.
query_dataset – Required dataset that drives selection and (by default) the explanation pass.
test_dataset – Optional held-out test dataset.
exp_dataset – Optional explanation dataset. Defaults to
test_datasetwhen provided, otherwisequery_dataset.max_extraction_iter – Number of extraction iterations.
extract_top_n – Number of top factors to retain per class in the final ranking.
use_gene_names – If
True, the returned table usesadata.var['name']as the index instead of feature IDs.seed – Random seed forwarded to selection.
verbose – If
True, emit per-iteration progress logs.selection_args – Extra keyword arguments forwarded to
n_kfold_selection().explain_args – Extra keyword arguments forwarded to
debrief().
- Returns:
(top_factors, final_exp).top_factorsis the per-class ranked factor table (with anOutlier_Itercolumn flagging held-out features), andfinal_expis the L1-normalised integrated explanation table.