dataProfilerR: automated exploratory data analysis
Source:R/dataProfilerR-package.R
dataProfilerR-package.RddataProfilerR profiles a data frame with a single call. It infers column
types, quantifies missingness, computes distributional statistics, runs
normality tests, detects outliers, measures correlation, and rolls the
findings into a data-quality score. It also builds a set of ggplot2
visualisations. The main entry point is profile_data(), which returns a
data_profile S3 object with print(), summary() and plot() methods.
Design
The package uses the S3 object system. The profiling result is a plain list
with class "data_profile", which keeps the structure transparent and easy
to inspect, serialise, and extend. S4 would add formality (and overhead) that
an EDA result object does not need.
Author
Maintainer: Muhammad Farooqi mqfarooqi@gmail.com
Authors:
Muhammad Farooqi mqfarooqi@gmail.com