dataProfilerR 0.2.1
CRAN release: 2026-06-24
Changes requested during the initial CRAN review:
- Added method references (Shapiro-Wilk, Anderson-Darling, Cramer’s V) to the Description field.
-
normality_tests()no longer touches the global random-number state. Large columns are now reduced with a deterministic, evenly-spaced subsample instead ofset.seed()+sample(); theseedargument has been removed.
dataProfilerR 0.2.0
New analysis and reporting:
-
report()renders a complete profile to a self-contained HTML file (requires pandoc, via ). -
categorical_association()andplot_association()add Cramer’s V between categorical columns (the categorical analogue of the correlation matrix). -
analyze_dates()profiles date/datetime columns: range, unique count, and the largest gap between consecutive timestamps. -
compare_groups()summarises numeric columns within the levels of a grouping column (grouped/comparative profiling).
Pipeline changes:
-
profile_data()gainsgroup_by(adds a grouped comparison to the diagnostics) anddistributions(setFALSEto skip the eager per-column distribution plots on wide data). Association and date results are now part of the returned object, andplot()acceptswhich = "association". -
summary()now also prints date, association and grouped-comparison sections when present.
dataProfilerR 0.1.0
- First version:
profile_data()with type inference, missing-value analysis, summary statistics (incl. skewness/kurtosis), normality tests, outlier detection (IQR/z-score/robust), correlation analysis, a data-quality score, andggplot2figures, returned as adata_profileS3 object withprint(),summary()andplot()methods.