Initial CRAN release.
gaussian_copula_synthesizer()) that fits a single joint
copula over all modeled columns: numerical,
categorical, and boolean.norm, beta,
gamma, truncnorm, and uniform by
Kolmogorov-Smirnov distance. Per-column overrides via
numerical_distributions; global default via
default_distribution.sample() for unconditional generation and
sample_conditions() for conditional generation on
categorical or boolean values via rejection sampling.metadata(),
set_column_type(), set_primary_key()) with
auto-detection and JSON serialization (metadata_to_json(),
save_metadata()).add_constraint(), check_constraints()),
enforced via rejection sampling.quality_report() aggregates metrics into the
two-property hierarchy used by the Python SDMetrics
library:
correlation_similarity() for numerical pairs,
contingency_similarity() for categorical pairs). ML
efficacy (train-on-synthetic / test-on-real, TSTR/TRTR) is reported
separately, not folded into the overall score.diagnostic_report() checks structural validity:
boundary adherence (numerical ranges), category adherence (categorical
values), and key uniqueness for primary keys.privacy_report() reports the nearest-neighbour distance
ratio (NNDR) and, optionally, attribute disclosure risk.autoplot() methods for quality, diagnostic, and privacy
reports.adult_income — a 500-row sample of the
UCI Adult Income dataset used in examples and vignettes.