A toolkit to simplify data exploration and preparation in R, enabling quicker insights and reducing time spent on tedious data manipulations.
- Identify constant columns to prevent redundant computations.
- Detect duplicate columns to streamline datasets.
- Generate frequency tables for each column, with outputs in Excel format.
- Create a "glimpse" of your dataset for quick overview.
- Determine the primary key of your dataset for efficient data manipulation.
- Calculate percentiles for all numeric columns to understand data distribution.
To install the package from CRAN, use:
install.packages("dataframeexplorer")
Here are some common functions you can use with dataframeexplorer
:
- Detect constant columns:
detect_const_cols(dataset)
- Find duplicate columns:
detect_dupl_cols(dataset)
- Generate a frequency table:
frequency_table(dataset, output_filename = "frequency_table.xlsx")
- Create a dataset glimpse:
glimpse_to_file(dataset, output_filename = "glimpse.txt")
- Determine primary key:
level_of_data(dataset, output_filename = "level.txt")
- Calculate percentiles:
percentiles_table(dataset, output_filename = "percentiles.xlsx")
- R (>= 3.3.0)
- Imports:
tibble
,data.table
,magrittr
,openxlsx
,dplyr
,plyr
,tidyr
,stringr
This package is licensed under the MIT License. See the LICENSE file for details.
Developed and maintained by Ashrith Reddy.
For full documentation and examples, please visit the CRAN page.