AutoMLR 1.0.0
- CRAN pretest fix: replaced the
extreme_surv_screen()
regression test with a deterministic survival-only fixture using
stepwise_cox, so the test no longer depends on optional
glmnet behavior and remains stable on Windows and Debian
R-devel incoming checks.
- Single-cohort report fix: cohort-aware survival, binary, continuous,
and ordinal report tables now use a unified panel rule. A single real
cohort is labelled by its cohort name, a literal
All cohort
is labelled Overall, and only multi-cohort analyses add
Overall plus individual cohorts. This removes duplicated
single-cohort panels such as Overall plus
TCGA_LUAD_PanCan.
- Survival publication figures now adapt more layout settings to the
actual rendered data: feature-importance and SHAP-style plots use
data-driven left margins and point sizes, nomogram row spacing expands
with the number of horizons, calibration and decision-curve legends
choose less occupied plot corners, and the combination benchmark
allocates relatively more space to the side performance bars.
- Binary publication figures now use the same data-aware plotting
utilities for ROC, precision-recall, calibration, decision-curve,
confusion-matrix, feature-importance, and benchmark plots, reducing
label clipping and legend overlap in single- and multi-cohort
outputs.
- Continuous and ordinal publication figures now compute
ranking/importance margins from the longest model or feature label and
reduce point size in observed-vs-predicted or observed-vs-score scatter
plots as sample size increases. Ordinal confusion-matrix labels and cell
counts also scale with the number and length of classes.
- DESCRIPTION wording was tightened for CRAN by removing unnecessary
acronym parentheticals where the full method name is already written
out.
- Routed model-evaluation and ensemble-fitting progress output through
log_message() so initialize_auto_logging()
captures messages such as Evaluating ... and
Fitting ... in the log file as well as the console.
- Changed survival-SVM candidate evaluation to use k-fold resampling
by default (
surv_svm_resampling = "kfold") to avoid known
single-row prediction failures from survivalsvm under
leave-one-out cross-validation.
- Improved publication plotting defaults: heatmaps drop all-NA rows
while marking partial NA cells, heatmap color scales adapt to the finite
metric range, forest plots use data-aware x-axis limits, and continuous
residual histograms use data-driven breaks and axis ranges.
- Added continuous and ordinal ensemble-member fitting progress
messages.
- Exported
automlr_input_to_surv_xy(),
automlr_input_to_binary_xy(),
automlr_input_to_continuous_xy(), and
automlr_input_to_ordinal_xy() so users can call lower-level
evaluation functions without relying on internal :::
helpers.
- Added optional automatic threshold recommendations for
threshold-style ensemble selection:
auto_min_cindex,
auto_min_auc, auto_min_r2, and
auto_min_qwk, controlled by
auto_quantile.
- Added helper functions
recommend_surv_cindex_threshold(),
recommend_binary_auc_threshold(),
recommend_continuous_r2_threshold(), and
recommend_ordinal_qwk_threshold() for explicit threshold
review.
- Moved heavyweight model engines from strong imports to optional
suggested dependencies so installation no longer requires all modelling
backends.
- Added
check_automlr_dependencies() to report available
and missing model backends, optional features, expected skip/degradation
behavior, and install commands.
- Made logging and parallel execution degrade gracefully when optional
log4r, future, or future.apply
packages are unavailable.
- Added continuous-outcome workflows:
prepare_continuous_cohort_input(),
continuousmlr_parameters(),
fit_continuous_ensemble(),
export_continuous_results(), and
render_continuous_report().
- Added ordinal-outcome workflows:
prepare_ordinal_cohort_input(),
ordinalmlr_parameters(),
fit_ordinal_ensemble(),
export_ordinal_results(), and
render_ordinal_report().
- Added 18 default continuous/ordinal model variants across penalized
regression, linear / stepwise linear regression, GBM, random forest,
PCA-linear, and mean-baseline families.
- Continuous model selection defaults to out-of-fold RMSE, with MAE,
R-squared, Pearson, Spearman, cohort performance, observed-vs-predicted,
residual, and feature-importance diagnostics.
- Ordinal model selection defaults to out-of-fold quadratic weighted
kappa, with accuracy, balanced accuracy, class MAE, score RMSE,
Spearman, confusion-matrix, observed-score, and feature-importance
diagnostics.
- Hardened the binary-classification workflow: multi-class outcomes
are now rejected by default unless users explicitly request
collapse_other = TRUE, and negative_class can
be supplied for clear positive / negative mapping.
- Added binary preprocessing audits for missingness filtering, median
imputation, zero / low-variance filtering, and optional feature
standardization.
- Added binary k-fold and repeated k-fold resampling options while
preserving LOOCV as the default.
- Split binary exported predictions into apparent and out-of-fold
probabilities/classes; default diagnostic tables and plots now use
out-of-fold probabilities.
- Fixed binary
strategy = "threshold" report/export
compatibility.
- Added binary
model_performance_forest.csv and
fig9_model_performance_forest with OOF ROC AUC and
approximate 95% CI.
- Added a binary-classification workflow parallel to the survival
workflow:
prepare_binary_cohort_input(),
binarymlr_parameters(), fit_binary_ensemble(),
export_binary_results(), and binary summary helpers.
- Added 18 default binary model variants across 9 algorithm families:
penalized logistic regression, standard / stepwise logistic regression,
GBM, random forest, PCA-logistic, and Gaussian naive Bayes.
- Added binary single-model and single-/two-model
probability-combination ranking by LOOCV ROC AUC, with PR-AUC, Brier
score, threshold metrics, cohort AUC stability, calibration, DCA,
confusion matrix, and feature importance exported as diagnostics.
- Added default explainability and clinical-utility outputs to regular
survival exports: permutation feature importance, SHAP-style
median-ablation summary and dependence plots, risk-score nomogram,
calibration curve, decision curve analysis, and a model C-index forest
plot.
- Refined the nomogram and SHAP-style figure set after FigureYa /
regplot and SHAP documentation review: the nomogram now uses a points /
total-points / event-risk ruler layout with risk-score distribution
marks, while the SHAP figures follow mean-absolute-contribution bar,
beeswarm summary, and dependence-with-density conventions.
- Added corresponding CSV audit tables:
feature_importance.csv,
shap_approx_contributions.csv,
risk_score_nomogram.csv,
calibration_curve.csv, dca_curve.csv,
model_cindex_forest.csv, and
risk_prediction_horizon.csv.
- Embedded the final publication figure set directly in the default
HTML report while keeping diagnostic figures in a separate diagnostic
folder.
- Clarified the four regular-analysis interpretation checkpoints in
the default report bundle: data preparation, base-model screening,
ensemble selection, and explainability / clinical utility.
- Added
summarize_explainability_results() and bilingual
interpretation text explaining that SHAP-style outputs are
median-ablation approximations and that nomogram / calibration / DCA
diagnostics are based on the final risk-score Cox calibration.
- Stored the training feature matrix inside fitted survival ensembles
so exported explainability diagnostics can be regenerated from the
fitted object.
- Validated the regular workflow on a 100-sample TCGA-LUAD test
dataset with all publication figures, CSV tables, HTML report, and
bilingual summaries generated successfully.
AutoMLR 0.1.0
- Added 18 default survival-model variants across 10 registry
entries.
- Added LOOCV C-index evaluation, two-model all-subsets combination
search, and weighted survival-risk ensembles.
- Added fold-level LOOCV parallelism and shared glmnet fits for lambda
variants.
- Added cohort / resampling stability diagnostics while keeping
C-index as the default selection criterion.
- Added direct
fit_surv_ensemble(automlr_input) support
so cohort labels from prepare_cohort_input() are used
automatically for stability diagnostics.
- Added
render_surv_report() to write an HTML report with
separate figures/ and tables/ folders.
- Added
export_surv_results() for batch output, including
publication-ready figures, tables, fitted objects, and session
metadata.
- Added training-set
risk_scores.csv export and a
risk-stratified Kaplan-Meier figure.
- Improved publication figures with Kaplan-Meier number-at-risk
tables, cohort heatmap legends, and optional time-dependent AUC
output.
- Added optional timeROC curve export and a complete all-single-model
cohort C-index heatmap for full variant-level inspection.
- Added IRLS-inspired publication panels: a combination-by-cohort
benchmark matrix, multi-cohort Kaplan-Meier panels, and multi-cohort
timeROC panels.
- Deduplicated the default publication output to a final figure set:
all-model heatmap, combination benchmark matrix, multi-cohort KM where
estimable, and multi-cohort timeROC when
timeROC is
available.
- Added
extreme_surv_screen() for two-stage extreme
screening: apparent full-data upper-bound ranking followed by top-N
70/30 seed search.
- Added
export_extreme_screen_results() with complete
audit tables and a Morandi-toned extreme-screening figure set.
- Added
summarize_extreme_screen_results() and automatic
bilingual English / Chinese summary_report.md export to
explain the best apparent models, seed-search leaders, train /
validation C-index results, cohort diagnostics, and failure notes.
- Added regular-analysis Markdown summary templates for data
preparation, base-model screening, and ensemble selection.
render_surv_report() and export_surv_results()
now write bilingual English / Chinese summaries by default.
- Changed the regular ensemble default to search single- and two-model
candidates (
min_models = 1, max_models = 2) so
users can directly compare the best single model with the best two-model
combination.
- Added a figure-rich tutorial at
inst/tutorials/AutoMLR_tutorial.md, with standalone code
blocks and interpretation guides for regular analysis and extreme
screening.