-Checklist Solution
1-Duplicates solution
data<-dataset #dataset
#1. Duplicates values: Find the duplicates
#values (only) in primary key better
#or all of the dataset
#packages:
library(skimr)
library(Hmisc)
data_1<-unique(data) #duplicates in primary key
before<-length(data$primarykey)
before
after<-length(data_1$primarykey)
after
different<-before-after
different
before_after_matrix<-cbind(before,after)
before_after_matrix2- Missing Value solution
3-Outliers solution
Last updated