Dataset statistics
Number of variables | 19 |
---|---|
Number of observations | 33053760 |
Missing cells | 208735906 |
Missing cells (%) | 33.2% |
Total size in memory | 4.7 GiB |
Average record size in memory | 152.0 B |
Variable types
Numeric | 13 |
---|---|
Text | 6 |
collater_valueofguarantee_1124L has 32583584 (98.6%) missing values | Missing |
collater_valueofguarantee_876L has 31725982 (96.0%) missing values | Missing |
pmts_dpd_1073P has 27121373 (82.1%) missing values | Missing |
pmts_dpd_303P has 18691259 (56.5%) missing values | Missing |
pmts_month_158T has 23804232 (72.0%) missing values | Missing |
pmts_month_706T has 2619852 (7.9%) missing values | Missing |
pmts_overdue_1140A has 27091280 (82.0%) missing values | Missing |
pmts_overdue_1152A has 18674260 (56.5%) missing values | Missing |
pmts_year_1139T has 23804232 (72.0%) missing values | Missing |
pmts_year_507T has 2619852 (7.9%) missing values | Missing |
collater_valueofguarantee_1124L is highly skewed (γ1 = 44.20781194) | Skewed |
collater_valueofguarantee_876L is highly skewed (γ1 = 79.59866574) | Skewed |
pmts_dpd_303P is highly skewed (γ1 = 31.61005875) | Skewed |
pmts_overdue_1140A is highly skewed (γ1 = 1218.634842) | Skewed |
pmts_overdue_1152A is highly skewed (γ1 = 305.0209325) | Skewed |
collater_valueofguarantee_1124L has 430530 (1.3%) zeros | Zeros |
collater_valueofguarantee_876L has 1178949 (3.6%) zeros | Zeros |
num_group1 has 6717721 (20.3%) zeros | Zeros |
num_group2 has 1377202 (4.2%) zeros | Zeros |
pmts_dpd_1073P has 5596903 (16.9%) zeros | Zeros |
pmts_dpd_303P has 12092577 (36.6%) zeros | Zeros |
pmts_overdue_1140A has 5621230 (17.0%) zeros | Zeros |
pmts_overdue_1152A has 12008791 (36.3%) zeros | Zeros |
Reproduction
Analysis started | 2024-02-13 19:48:47.646804 |
---|---|
Analysis finished | 2024-02-13 19:49:42.540205 |
Duration | 54.89 seconds |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
Distinct | 231250 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1457854.653 |
Minimum | 36830 |
---|---|
Maximum | 2658153 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 36830 |
---|---|
5-th percentile | 180883 |
Q1 | 917023 |
median | 1666366 |
Q3 | 1716789 |
95-th percentile | 2648671 |
Maximum | 2658153 |
Range | 2621323 |
Interquartile range (IQR) | 799766 |
Descriptive statistics
Standard deviation | 657453.4552 |
---|---|
Coefficient of variation (CV) | 0.4509732531 |
Kurtosis | 0.03472349159 |
Mean | 1457854.653 |
Median Absolute Deviation (MAD) | 58227 |
Skewness | -0.4679276101 |
Sum | 4.81875778 × 1013 |
Variance | 4.322450457 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1706379 | 5986 | < 0.1% |
188150 | 3720 | < 0.1% |
1653560 | 3540 | < 0.1% |
179043 | 2604 | < 0.1% |
887695 | 2363 | < 0.1% |
185622 | 2328 | < 0.1% |
1741318 | 2124 | < 0.1% |
1663882 | 2112 | < 0.1% |
895818 | 2112 | < 0.1% |
869668 | 2040 | < 0.1% |
Other values (231240) | 33024831 |
Value | Count | Frequency (%) |
36830 | 144 | |
36883 | 36 | < 0.1% |
37083 | 60 | |
37128 | 36 | < 0.1% |
37129 | 72 |
Value | Count | Frequency (%) |
2658153 | 36 | < 0.1% |
2658152 | 156 | |
2658151 | 348 | |
2658150 | 228 | |
2658149 | 156 |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 252.2 MiB |
Value | Count | Frequency (%) |
a55475b1 | 32583584 | |
9a0c095e | 320167 | 1.0% |
8fd95e4b | 149736 | 0.5% |
06fb9ba8 | 273 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 98220655 | |
a | 32904024 | 12.4% |
b | 32733866 | 12.4% |
4 | 32733320 | 12.4% |
7 | 32583584 | 12.3% |
1 | 32583584 | 12.3% |
9 | 790343 | 0.3% |
0 | 640607 | 0.2% |
e | 469903 | 0.2% |
c | 320167 | 0.1% |
Other values (4) | 450027 | 0.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 197702375 | |
Lowercase Letter | 66727705 | 25.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 98220655 | |
4 | 32733320 | 16.6% |
7 | 32583584 | 16.5% |
1 | 32583584 | 16.5% |
9 | 790343 | 0.4% |
0 | 640607 | 0.3% |
8 | 150009 | 0.1% |
6 | 273 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 32904024 | |
b | 32733866 | |
e | 469903 | 0.7% |
c | 320167 | 0.5% |
f | 150009 | 0.2% |
d | 149736 | 0.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 197702375 | |
Latin | 66727705 | 25.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 98220655 | |
4 | 32733320 | 16.6% |
7 | 32583584 | 16.5% |
1 | 32583584 | 16.5% |
9 | 790343 | 0.4% |
0 | 640607 | 0.3% |
8 | 150009 | 0.1% |
6 | 273 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 32904024 | |
b | 32733866 | |
e | 469903 | 0.7% |
c | 320167 | 0.5% |
f | 150009 | 0.2% |
d | 149736 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 264430080 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 98220655 | |
a | 32904024 | 12.4% |
b | 32733866 | 12.4% |
4 | 32733320 | 12.4% |
7 | 32583584 | 12.3% |
1 | 32583584 | 12.3% |
9 | 790343 | 0.3% |
0 | 640607 | 0.2% |
e | 469903 | 0.2% |
c | 320167 | 0.1% |
Other values (4) | 450027 | 0.2% |
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 252.2 MiB |
Value | Count | Frequency (%) |
a55475b1 | 31725982 | |
9a0c095e | 721659 | 2.2% |
8fd95e4b | 603656 | 1.8% |
06fb9ba8 | 2129 | < 0.1% |
3cbe86ba | 322 | < 0.1% |
c7a5ad39 | 10 | < 0.1% |
9276e4bb | 2 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 96503271 | |
a | 32450112 | 12.3% |
b | 32334544 | 12.2% |
4 | 32329640 | 12.2% |
7 | 31725994 | 12.0% |
1 | 31725982 | 12.0% |
9 | 2049115 | 0.8% |
0 | 1445447 | 0.5% |
e | 1325639 | 0.5% |
c | 721991 | 0.3% |
Other values (6) | 1818345 | 0.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 196388343 | |
Lowercase Letter | 68041737 | 25.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 96503271 | |
4 | 32329640 | 16.5% |
7 | 31725994 | 16.2% |
1 | 31725982 | 16.2% |
9 | 2049115 | 1.0% |
0 | 1445447 | 0.7% |
8 | 606107 | 0.3% |
6 | 2453 | < 0.1% |
3 | 332 | < 0.1% |
2 | 2 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 32450112 | |
b | 32334544 | |
e | 1325639 | 1.9% |
c | 721991 | 1.1% |
f | 605785 | 0.9% |
d | 603666 | 0.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 196388343 | |
Latin | 68041737 | 25.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 96503271 | |
4 | 32329640 | 16.5% |
7 | 31725994 | 16.2% |
1 | 31725982 | 16.2% |
9 | 2049115 | 1.0% |
0 | 1445447 | 0.7% |
8 | 606107 | 0.3% |
6 | 2453 | < 0.1% |
3 | 332 | < 0.1% |
2 | 2 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 32450112 | |
b | 32334544 | |
e | 1325639 | 1.9% |
c | 721991 | 1.1% |
f | 605785 | 0.9% |
d | 603666 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 264430080 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 96503271 | |
a | 32450112 | 12.3% |
b | 32334544 | 12.2% |
4 | 32329640 | 12.2% |
7 | 31725994 | 12.0% |
1 | 31725982 | 12.0% |
9 | 2049115 | 0.8% |
0 | 1445447 | 0.5% |
e | 1325639 | 0.5% |
c | 721991 | 0.3% |
Other values (6) | 1818345 | 0.7% |
collater_valueofguarantee_1124L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 22269 |
---|---|
Distinct (%) | 4.7% |
Missing | 32583584 |
Missing (%) | 98.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10444179.1 |
Minimum | 0 |
---|---|
Maximum | 1.875995509 × 1010 |
Zeros | 430530 |
Zeros (%) | 1.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 5887446.5 |
Maximum | 1.875995509 × 1010 |
Range | 1.875995509 × 1010 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 279746416.4 |
---|---|
Coefficient of variation (CV) | 26.78491183 |
Kurtosis | 2151.88332 |
Mean | 10444179.1 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 44.20781194 |
Sum | 4.910602354 × 1012 |
Variance | 7.825805749 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 430530 | 1.3% |
1300000000 | 427 | < 0.1% |
3000000 | 250 | < 0.1% |
4000000 | 211 | < 0.1% |
1 | 196 | < 0.1% |
2000000 | 187 | < 0.1% |
5000000 | 183 | < 0.1% |
10000000 | 173 | < 0.1% |
500000000 | 160 | < 0.1% |
20000000 | 160 | < 0.1% |
Other values (22259) | 37699 | 0.1% |
(Missing) | 32583584 |
Value | Count | Frequency (%) |
0 | 430530 | |
0.1 | 2 | < 0.1% |
1 | 196 | < 0.1% |
62.87 | 1 | < 0.1% |
600 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1.875995509 × 1010 | 2 | < 0.1% |
1.758995509 × 1010 | 2 | < 0.1% |
1.59 × 1010 | 2 | < 0.1% |
1.43 × 1010 | 126 | |
1.29 × 1010 | 2 | < 0.1% |
collater_valueofguarantee_876L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 45779 |
---|---|
Distinct (%) | 3.4% |
Missing | 31725982 |
Missing (%) | 96.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4160211.827 |
Minimum | 0 |
---|---|
Maximum | 1.758995509 × 1010 |
Zeros | 1178949 |
Zeros (%) | 3.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 205000 |
Maximum | 1.758995509 × 1010 |
Range | 1.758995509 × 1010 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 153782034.7 |
---|---|
Coefficient of variation (CV) | 36.96495301 |
Kurtosis | 7171.495339 |
Mean | 4160211.827 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 79.59866574 |
Sum | 5.523837739 × 1012 |
Variance | 2.364891419 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1178949 | 3.6% |
60000 | 3771 | < 0.1% |
130000 | 2951 | < 0.1% |
100000 | 2693 | < 0.1% |
50000 | 1947 | < 0.1% |
65000 | 1855 | < 0.1% |
80000 | 1310 | < 0.1% |
150000 | 1247 | < 0.1% |
70000 | 1238 | < 0.1% |
200000 | 1193 | < 0.1% |
Other values (45769) | 130624 | 0.4% |
(Missing) | 31725982 |
Value | Count | Frequency (%) |
0 | 1178949 | |
0.01 | 12 | < 0.1% |
0.02 | 50 | < 0.1% |
0.03 | 28 | < 0.1% |
0.04 | 3 | < 0.1% |
Value | Count | Frequency (%) |
1.758995509 × 1010 | 2 | < 0.1% |
1.59 × 1010 | 2 | < 0.1% |
1.43 × 1010 | 117 | |
6903800000 | 8 | < 0.1% |
6804986362 | 32 | < 0.1% |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 252.2 MiB |
Value | Count | Frequency (%) |
a55475b1 | 31725982 | |
c7a5ad39 | 1002634 | 3.0% |
3cbe86ba | 223621 | 0.7% |
9276e4bb | 35479 | 0.1% |
0e63c0f0 | 20524 | 0.1% |
168ad9f3 | 9980 | < 0.1% |
7b62420e | 8534 | < 0.1% |
5224034a | 7598 | < 0.1% |
940efad7 | 6724 | < 0.1% |
2fd21cf1 | 3780 | < 0.1% |
Other values (5) | 8904 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 96193183 | |
a | 33987913 | 12.9% |
7 | 32784858 | 12.4% |
b | 32255626 | 12.2% |
4 | 31802888 | 12.0% |
1 | 31744990 | 12.0% |
3 | 1267894 | 0.5% |
c | 1255564 | 0.5% |
9 | 1061891 | 0.4% |
d | 1025575 | 0.4% |
Other values (6) | 1049698 | 0.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 195561669 | |
Lowercase Letter | 68868411 | 26.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 96193183 | |
7 | 32784858 | 16.8% |
4 | 31802888 | 16.3% |
1 | 31744990 | 16.2% |
3 | 1267894 | 0.6% |
9 | 1061891 | 0.5% |
6 | 301048 | 0.2% |
8 | 237526 | 0.1% |
0 | 89631 | < 0.1% |
2 | 77760 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 33987913 | |
b | 32255626 | |
c | 1255564 | 1.8% |
d | 1025575 | 1.5% |
e | 296350 | 0.4% |
f | 47383 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 195561669 | |
Latin | 68868411 | 26.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 96193183 | |
7 | 32784858 | 16.8% |
4 | 31802888 | 16.3% |
1 | 31744990 | 16.2% |
3 | 1267894 | 0.6% |
9 | 1061891 | 0.5% |
6 | 301048 | 0.2% |
8 | 237526 | 0.1% |
0 | 89631 | < 0.1% |
2 | 77760 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 33987913 | |
b | 32255626 | |
c | 1255564 | 1.8% |
d | 1025575 | 1.5% |
e | 296350 | 0.4% |
f | 47383 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 264430080 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 96193183 | |
a | 33987913 | 12.9% |
7 | 32784858 | 12.4% |
b | 32255626 | 12.2% |
4 | 31802888 | 12.0% |
1 | 31744990 | 12.0% |
3 | 1267894 | 0.5% |
c | 1255564 | 0.5% |
9 | 1061891 | 0.4% |
d | 1025575 | 0.4% |
Other values (6) | 1049698 | 0.4% |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 252.2 MiB |
Value | Count | Frequency (%) |
a55475b1 | 32583584 | |
c7a5ad39 | 416281 | 1.3% |
9276e4bb | 20888 | 0.1% |
0e63c0f0 | 14267 | < 0.1% |
168ad9f3 | 7741 | < 0.1% |
7b62420e | 5328 | < 0.1% |
3cbe86ba | 1803 | < 0.1% |
f4d8a027 | 1344 | < 0.1% |
940efad7 | 1223 | < 0.1% |
2fd21cf1 | 575 | < 0.1% |
Other values (5) | 726 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 98167722 | |
a | 33428888 | 12.6% |
7 | 33028817 | 12.5% |
b | 32634458 | 12.3% |
4 | 32613528 | 12.3% |
1 | 32592602 | 12.3% |
9 | 446501 | 0.2% |
3 | 440654 | 0.2% |
c | 433237 | 0.2% |
d | 427164 | 0.2% |
Other values (6) | 216509 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 197436967 | |
Lowercase Letter | 66993113 | 25.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 98167722 | |
7 | 33028817 | 16.7% |
4 | 32613528 | 16.5% |
1 | 32592602 | 16.5% |
9 | 446501 | 0.2% |
3 | 440654 | 0.2% |
0 | 51143 | < 0.1% |
6 | 50191 | < 0.1% |
2 | 34794 | < 0.1% |
8 | 11015 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 33428888 | |
b | 32634458 | |
c | 433237 | 0.6% |
d | 427164 | 0.6% |
e | 43636 | 0.1% |
f | 25730 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 197436967 | |
Latin | 66993113 | 25.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 98167722 | |
7 | 33028817 | 16.7% |
4 | 32613528 | 16.5% |
1 | 32592602 | 16.5% |
9 | 446501 | 0.2% |
3 | 440654 | 0.2% |
0 | 51143 | < 0.1% |
6 | 50191 | < 0.1% |
2 | 34794 | < 0.1% |
8 | 11015 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 33428888 | |
b | 32634458 | |
c | 433237 | 0.6% |
d | 427164 | 0.6% |
e | 43636 | 0.1% |
f | 25730 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 264430080 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 98167722 | |
a | 33428888 | 12.6% |
7 | 33028817 | 12.5% |
b | 32634458 | 12.3% |
4 | 32613528 | 12.3% |
1 | 32592602 | 12.3% |
9 | 446501 | 0.2% |
3 | 440654 | 0.2% |
c | 433237 | 0.2% |
d | 427164 | 0.2% |
Other values (6) | 216509 | 0.1% |
num_group1
Real number (ℝ)
ZEROS
 
Distinct | 272 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.701797224 |
Minimum | 0 |
---|---|
Maximum | 271 |
Zeros | 6717721 |
Zeros (%) | 20.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 3 |
Q3 | 6 |
95-th percentile | 15 |
Maximum | 271 |
Range | 271 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 6.427976775 |
---|---|
Coefficient of variation (CV) | 1.367131858 |
Kurtosis | 106.8148704 |
Mean | 4.701797224 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 6.169206137 |
Sum | 155412077 |
Variance | 41.31888542 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 6717721 | |
1 | 4900392 | |
2 | 3807814 | |
3 | 3071154 | |
4 | 2522337 | 7.6% |
5 | 2085478 | 6.3% |
6 | 1727995 | 5.2% |
7 | 1425804 | 4.3% |
8 | 1175784 | 3.6% |
9 | 975470 | 3.0% |
Other values (262) | 4643811 |
Value | Count | Frequency (%) |
0 | 6717721 | |
1 | 4900392 | |
2 | 3807814 | |
3 | 3071154 | |
4 | 2522337 | 7.6% |
Value | Count | Frequency (%) |
271 | 12 | |
270 | 12 | |
269 | 12 | |
268 | 12 | |
267 | 12 |
num_group2
Real number (ℝ)
ZEROS
 
Distinct | 101 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.20154557 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 1377202 |
Zeros (%) | 4.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 12 |
Q3 | 20 |
95-th percentile | 31 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 9.278772653 |
---|---|
Coefficient of variation (CV) | 0.7028550257 |
Kurtosis | -0.4330814259 |
Mean | 13.20154557 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.545312801 |
Sum | 436360719 |
Variance | 86.09562195 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1377202 | 4.2% |
1 | 1377070 | 4.2% |
7 | 1377069 | 4.2% |
11 | 1377069 | 4.2% |
10 | 1377069 | 4.2% |
9 | 1377069 | 4.2% |
8 | 1377069 | 4.2% |
6 | 1377069 | 4.2% |
5 | 1377069 | 4.2% |
4 | 1377069 | 4.2% |
Other values (91) | 19282936 |
Value | Count | Frequency (%) |
0 | 1377202 | |
1 | 1377070 | |
2 | 1377069 | |
3 | 1377069 | |
4 | 1377069 |
Value | Count | Frequency (%) |
100 | 1 | < 0.1% |
99 | 1 | < 0.1% |
98 | 60 | |
97 | 60 | |
96 | 60 |
pmts_dpd_1073P
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 3971 |
---|---|
Distinct (%) | 0.1% |
Missing | 27121373 |
Missing (%) | 82.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12.62910764 |
Minimum | 0 |
---|---|
Maximum | 4595 |
Zeros | 5596903 |
Zeros (%) | 16.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1 |
Maximum | 4595 |
Range | 4595 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 143.0431327 |
---|---|
Coefficient of variation (CV) | 11.32646397 |
Kurtosis | 325.6255625 |
Mean | 12.62910764 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 16.44645718 |
Sum | 74920754 |
Variance | 20461.33781 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5596903 | 16.9% |
1 | 55318 | 0.2% |
2 | 20595 | 0.1% |
3 | 19988 | 0.1% |
4 | 16944 | 0.1% |
7 | 10950 | < 0.1% |
5 | 10159 | < 0.1% |
6 | 9495 | < 0.1% |
8 | 7566 | < 0.1% |
10 | 7525 | < 0.1% |
Other values (3961) | 176944 | 0.5% |
(Missing) | 27121373 |
Value | Count | Frequency (%) |
0 | 5596903 | |
1 | 55318 | 0.2% |
2 | 20595 | 0.1% |
3 | 19988 | 0.1% |
4 | 16944 | 0.1% |
Value | Count | Frequency (%) |
4595 | 1 | |
4590 | 1 | |
4562 | 1 | |
4537 | 1 | |
4536 | 2 |
pmts_dpd_303P
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 4338 |
---|---|
Distinct (%) | < 0.1% |
Missing | 18691259 |
Missing (%) | 56.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 54.52469796 |
Minimum | -30 |
---|---|
Maximum | 117000 |
Zeros | 12092577 |
Zeros (%) | 36.6% |
Negative | 2579 |
Negative (%) | < 0.1% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | -30 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 322 |
Maximum | 117000 |
Range | 117030 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 274.7676134 |
---|---|
Coefficient of variation (CV) | 5.039323897 |
Kurtosis | 8288.849719 |
Mean | 54.52469796 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 31.61005875 |
Sum | 783111029 |
Variance | 75497.24137 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 12092577 | |
1 | 333585 | 1.0% |
3 | 87221 | 0.3% |
2 | 82689 | 0.3% |
4 | 71130 | 0.2% |
6 | 58203 | 0.2% |
5 | 46779 | 0.1% |
7 | 45654 | 0.1% |
9 | 32873 | 0.1% |
8 | 31830 | 0.1% |
Other values (4328) | 1479960 | 4.5% |
(Missing) | 18691259 |
Value | Count | Frequency (%) |
-30 | 1 | < 0.1% |
-10 | 11 | < 0.1% |
-9 | 2 | < 0.1% |
-8 | 24 | |
-7 | 57 |
Value | Count | Frequency (%) |
117000 | 1 | < 0.1% |
84575 | 3 | |
84574 | 1 | < 0.1% |
84561 | 1 | < 0.1% |
84560 | 1 | < 0.1% |
pmts_month_158T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 23804232 |
Missing (%) | 72.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052716 |
---|---|
Coefficient of variation (CV) | 0.5310850333 |
Kurtosis | -1.216783226 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 60121932 |
Variance | 11.91666796 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 770794 | 2.3% |
3 | 770794 | 2.3% |
4 | 770794 | 2.3% |
5 | 770794 | 2.3% |
6 | 770794 | 2.3% |
7 | 770794 | 2.3% |
8 | 770794 | 2.3% |
9 | 770794 | 2.3% |
10 | 770794 | 2.3% |
11 | 770794 | 2.3% |
Other values (2) | 1541588 | 4.7% |
(Missing) | 23804232 |
Value | Count | Frequency (%) |
1 | 770794 | |
2 | 770794 | |
3 | 770794 | |
4 | 770794 | |
5 | 770794 |
Value | Count | Frequency (%) |
12 | 770794 | |
11 | 770794 | |
10 | 770794 | |
9 | 770794 | |
8 | 770794 |
pmts_month_706T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 2619852 |
Missing (%) | 7.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052586 |
---|---|
Coefficient of variation (CV) | 0.5310850133 |
Kurtosis | -1.21678322 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 197820402 |
Variance | 11.91666706 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 2536159 | |
3 | 2536159 | |
4 | 2536159 | |
5 | 2536159 | |
6 | 2536159 | |
7 | 2536159 | |
8 | 2536159 | |
9 | 2536159 | |
10 | 2536159 | |
11 | 2536159 | |
Other values (2) | 5072318 | |
(Missing) | 2619852 |
Value | Count | Frequency (%) |
1 | 2536159 | |
2 | 2536159 | |
3 | 2536159 | |
4 | 2536159 | |
5 | 2536159 |
Value | Count | Frequency (%) |
12 | 2536159 | |
11 | 2536159 | |
10 | 2536159 | |
9 | 2536159 | |
8 | 2536159 |
pmts_overdue_1140A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 241676 |
---|---|
Distinct (%) | 4.1% |
Missing | 27091280 |
Missing (%) | 82.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2047.064177 |
Minimum | 0 |
---|---|
Maximum | 401980320 |
Zeros | 5621230 |
Zeros (%) | 17.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 464.67798 |
Maximum | 401980320 |
Range | 401980320 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 265823.9755 |
---|---|
Coefficient of variation (CV) | 129.8562001 |
Kurtosis | 1777281.069 |
Mean | 2047.064177 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1218.634842 |
Sum | 1.220557921 × 1010 |
Variance | 7.066238593 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5621230 | 17.0% |
400 | 470 | < 0.1% |
14 | 388 | < 0.1% |
1000 | 387 | < 0.1% |
10 | 372 | < 0.1% |
2000 | 296 | < 0.1% |
0.8 | 251 | < 0.1% |
2600 | 250 | < 0.1% |
0.2 | 244 | < 0.1% |
2 | 231 | < 0.1% |
Other values (241666) | 338361 | 1.0% |
(Missing) | 27091280 |
Value | Count | Frequency (%) |
0 | 5621230 | |
0.002 | 24 | < 0.1% |
0.004 | 30 | < 0.1% |
0.006 | 25 | < 0.1% |
0.008 | 36 | < 0.1% |
Value | Count | Frequency (%) |
401980320 | 2 | < 0.1% |
132118310 | 2 | < 0.1% |
50082108 | 5 | |
48865336 | 1 | < 0.1% |
48513324 | 1 | < 0.1% |
pmts_overdue_1152A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 946534 |
---|---|
Distinct (%) | 6.6% |
Missing | 18674260 |
Missing (%) | 56.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4190.192563 |
Minimum | 0 |
---|---|
Maximum | 50906470 |
Zeros | 12008791 |
Zeros (%) | 36.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 14885.33825 |
Maximum | 50906470 |
Range | 50906470 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 105340.4401 |
---|---|
Coefficient of variation (CV) | 25.13976113 |
Kurtosis | 122510.518 |
Mean | 4190.192563 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 305.0209325 |
Sum | 6.025287397 × 1010 |
Variance | 1.109660832 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 12008791 | |
0.2 | 7194 | < 0.1% |
1000 | 4299 | < 0.1% |
0.4 | 3012 | < 0.1% |
2000 | 2910 | < 0.1% |
3000 | 2481 | < 0.1% |
0.8 | 2258 | < 0.1% |
0.6 | 2066 | < 0.1% |
2 | 2066 | < 0.1% |
1.6 | 1957 | < 0.1% |
Other values (946524) | 2342466 | 7.1% |
(Missing) | 18674260 |
Value | Count | Frequency (%) |
0 | 12008791 | |
0.002 | 215 | < 0.1% |
0.004 | 113 | < 0.1% |
0.006 | 111 | < 0.1% |
0.008 | 89 | < 0.1% |
Value | Count | Frequency (%) |
50906470 | 1 | |
48977290 | 1 | |
48927640 | 1 | |
48927636 | 2 | |
48807636 | 1 |
pmts_year_1139T
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 23804232 |
Missing (%) | 72.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2018.580244 |
Minimum | 2016 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 2016 |
---|---|
5-th percentile | 2017 |
Q1 | 2018 |
median | 2019 |
Q3 | 2019 |
95-th percentile | 2020 |
Maximum | 2021 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.7459096442 |
---|---|
Coefficient of variation (CV) | 0.0003695219184 |
Kurtosis | -0.04878475407 |
Mean | 2018.580244 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -0.534206743 |
Sum | 1.867091449 × 1010 |
Variance | 0.5563811973 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2019 | 5182120 | 15.7% |
2018 | 2635644 | 8.0% |
2017 | 894940 | 2.7% |
2020 | 529346 | 1.6% |
2021 | 7214 | < 0.1% |
2016 | 264 | < 0.1% |
(Missing) | 23804232 |
Value | Count | Frequency (%) |
2016 | 264 | < 0.1% |
2017 | 894940 | 2.7% |
2018 | 2635644 | |
2019 | 5182120 | |
2020 | 529346 | 1.6% |
Value | Count | Frequency (%) |
2021 | 7214 | < 0.1% |
2020 | 529346 | 1.6% |
2019 | 5182120 | |
2018 | 2635644 | |
2017 | 894940 | 2.7% |
pmts_year_507T
Real number (ℝ)
MISSING
 
Distinct | 21 |
---|---|
Distinct (%) | < 0.1% |
Missing | 2619852 |
Missing (%) | 7.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2014.328105 |
Minimum | 2001 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 252.2 MiB |
Quantile statistics
Minimum | 2001 |
---|---|
5-th percentile | 2007 |
Q1 | 2012 |
median | 2015 |
Q3 | 2018 |
95-th percentile | 2019 |
Maximum | 2021 |
Range | 20 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 3.93218005 |
---|---|
Coefficient of variation (CV) | 0.001952105042 |
Kurtosis | -0.6057916566 |
Mean | 2014.328105 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.6855102705 |
Sum | 6.130387622 × 1010 |
Variance | 15.46203994 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2018 | 4590473 | |
2017 | 3791656 | |
2019 | 3340832 | |
2016 | 2736638 | |
2015 | 2443386 | |
2014 | 2274555 | |
2013 | 1980486 | 6.0% |
2012 | 1628101 | 4.9% |
2011 | 1421849 | 4.3% |
2007 | 1196710 | 3.6% |
Other values (11) | 5029222 | |
(Missing) | 2619852 |
Value | Count | Frequency (%) |
2001 | 209 | < 0.1% |
2002 | 1108 | < 0.1% |
2003 | 2761 | < 0.1% |
2004 | 71995 | 0.2% |
2005 | 342980 |
Value | Count | Frequency (%) |
2021 | 157 | < 0.1% |
2020 | 270177 | 0.8% |
2019 | 3340832 | |
2018 | 4590473 | |
2017 | 3791656 |
Distinct | 10 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 252.2 MiB |
Value | Count | Frequency (%) |
a55475b1 | 31739743 | |
ab3c25cf | 1288784 | 3.9% |
15f04f45 | 13783 | < 0.1% |
be4fd70b | 6821 | < 0.1% |
daf49a8a | 4530 | < 0.1% |
71ddaa88 | 49 | < 0.1% |
0c42a10e | 24 | < 0.1% |
1d94eac1 | 20 | < 0.1% |
9ba4314a | 4 | < 0.1% |
652d52e3 | 2 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 96535583 | |
a | 33042267 | 12.5% |
b | 33042173 | 12.5% |
4 | 31778712 | 12.0% |
1 | 31753643 | 12.0% |
7 | 31746613 | 12.0% |
c | 2577612 | 1.0% |
f | 1327701 | 0.5% |
2 | 1288812 | 0.5% |
3 | 1288790 | 0.5% |
Other values (6) | 48174 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 194421989 | |
Lowercase Letter | 70008091 | 26.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 96535583 | |
4 | 31778712 | 16.3% |
1 | 31753643 | 16.3% |
7 | 31746613 | 16.3% |
2 | 1288812 | 0.7% |
3 | 1288790 | 0.7% |
0 | 20652 | < 0.1% |
8 | 4628 | < 0.1% |
9 | 4554 | < 0.1% |
6 | 2 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 33042267 | |
b | 33042173 | |
c | 2577612 | 3.7% |
f | 1327701 | 1.9% |
d | 11471 | < 0.1% |
e | 6867 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 194421989 | |
Latin | 70008091 | 26.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 96535583 | |
4 | 31778712 | 16.3% |
1 | 31753643 | 16.3% |
7 | 31746613 | 16.3% |
2 | 1288812 | 0.7% |
3 | 1288790 | 0.7% |
0 | 20652 | < 0.1% |
8 | 4628 | < 0.1% |
9 | 4554 | < 0.1% |
6 | 2 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 33042267 | |
b | 33042173 | |
c | 2577612 | 3.7% |
f | 1327701 | 1.9% |
d | 11471 | < 0.1% |
e | 6867 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 264430080 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 96535583 | |
a | 33042267 | 12.5% |
b | 33042173 | 12.5% |
4 | 31778712 | 12.0% |
1 | 31753643 | 12.0% |
7 | 31746613 | 12.0% |
c | 2577612 | 1.0% |
f | 1327701 | 0.5% |
2 | 1288812 | 0.5% |
3 | 1288790 | 0.5% |
Other values (6) | 48174 | < 0.1% |
Distinct | 9 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 252.2 MiB |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 8.000020784 |
Min length | 8 |
Characters and Unicode
Total characters | 264430767 |
---|---|
Distinct characters | 18 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | ab3c25cf |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 32600314 | |
ab3c25cf | 441403 | 1.3% |
be4fd70b | 4584 | < 0.1% |
15f04f45 | 3496 | < 0.1% |
daf49a8a | 3257 | < 0.1% |
p28_48_88 | 687 | < 0.1% |
0c42a10e | 11 | < 0.1% |
71ddaa88 | 7 | < 0.1% |
652d52e3 | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 98249339 | |
a | 33051513 | 12.5% |
b | 33050885 | 12.5% |
4 | 32615845 | 12.3% |
7 | 32604905 | 12.3% |
1 | 32603828 | 12.3% |
c | 882817 | 0.3% |
f | 456236 | 0.2% |
2 | 442103 | 0.2% |
3 | 441404 | 0.2% |
Other values (8) | 31892 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 196974803 | |
Lowercase Letter | 67453903 | 25.5% |
Connector Punctuation | 1374 | < 0.1% |
Uppercase Letter | 687 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 98249339 | |
4 | 32615845 | 16.6% |
7 | 32604905 | 16.6% |
1 | 32603828 | 16.6% |
2 | 442103 | 0.2% |
3 | 441404 | 0.2% |
0 | 8102 | < 0.1% |
8 | 6019 | < 0.1% |
9 | 3257 | < 0.1% |
6 | 1 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 33051513 | |
b | 33050885 | |
c | 882817 | 1.3% |
f | 456236 | 0.7% |
d | 7856 | < 0.1% |
e | 4596 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1374 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 687 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 196976177 | |
Latin | 67454590 | 25.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 98249339 | |
4 | 32615845 | 16.6% |
7 | 32604905 | 16.6% |
1 | 32603828 | 16.6% |
2 | 442103 | 0.2% |
3 | 441404 | 0.2% |
0 | 8102 | < 0.1% |
8 | 6019 | < 0.1% |
9 | 3257 | < 0.1% |
_ | 1374 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 33051513 | |
b | 33050885 | |
c | 882817 | 1.3% |
f | 456236 | 0.7% |
d | 7856 | < 0.1% |
e | 4596 | < 0.1% |
P | 687 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 264430767 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 98249339 | |
a | 33051513 | 12.5% |
b | 33050885 | 12.5% |
4 | 32615845 | 12.3% |
7 | 32604905 | 12.3% |
1 | 32603828 | 12.3% |
c | 882817 | 0.3% |
f | 456236 | 0.2% |
2 | 442103 | 0.2% |
3 | 441404 | 0.2% |
Other values (8) | 31892 | < 0.1% |