Hope only the skewfactor of target tables needs to be considered, bcoz it is where the data distribution is going to happen. cardinality of the source data for target table primary index column / columns should be unique or contains less duplicate. If it is going to have more non-unique value or duplicate value then skewness will get increased as most of the data will rush into single amp
↧