Dataset statistics
Number of variables | 19 |
---|---|
Number of observations | 4386062 |
Missing cells | 28029984 |
Missing cells (%) | 33.6% |
Total size in memory | 635.8 MiB |
Average record size in memory | 152.0 B |
Variable types
Numeric | 13 |
---|---|
Text | 6 |
collater_valueofguarantee_1124L has 4340725 (99.0%) missing values | Missing |
collater_valueofguarantee_876L has 4209154 (96.0%) missing values | Missing |
pmts_dpd_1073P has 3729399 (85.0%) missing values | Missing |
pmts_dpd_303P has 2445060 (55.7%) missing values | Missing |
pmts_month_158T has 3280058 (74.8%) missing values | Missing |
pmts_month_706T has 288770 (6.6%) missing values | Missing |
pmts_overdue_1140A has 3725455 (84.9%) missing values | Missing |
pmts_overdue_1152A has 2442535 (55.7%) missing values | Missing |
pmts_year_1139T has 3280058 (74.8%) missing values | Missing |
pmts_year_507T has 288770 (6.6%) missing values | Missing |
collater_valueofguarantee_1124L is highly skewed (γ1 = 81.69064668) | Skewed |
collater_valueofguarantee_876L is highly skewed (γ1 = 80.87769684) | Skewed |
pmts_dpd_1073P is highly skewed (γ1 = 26.9832622) | Skewed |
pmts_dpd_303P is highly skewed (γ1 = 92.41077498) | Skewed |
pmts_overdue_1140A is highly skewed (γ1 = 105.9926051) | Skewed |
pmts_overdue_1152A is highly skewed (γ1 = 186.8551368) | Skewed |
collater_valueofguarantee_876L has 161055 (3.7%) zeros | Zeros |
num_group1 has 755928 (17.2%) zeros | Zeros |
num_group2 has 178326 (4.1%) zeros | Zeros |
pmts_dpd_1073P has 624821 (14.2%) zeros | Zeros |
pmts_dpd_303P has 1622067 (37.0%) zeros | Zeros |
pmts_overdue_1140A has 628168 (14.3%) zeros | Zeros |
pmts_overdue_1152A has 1612253 (36.8%) zeros | Zeros |
Reproduction
Analysis started | 2024-02-13 19:44:21.892070 |
---|---|
Analysis finished | 2024-02-13 19:44:33.608998 |
Duration | 11.72 seconds |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
Distinct | 23734 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1570147.672 |
Minimum | 56408 |
---|---|
Maximum | 2703454 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 56408 |
---|---|
5-th percentile | 256297 |
Q1 | 1023878 |
median | 1939098 |
Q3 | 1944519 |
95-th percentile | 2702346 |
Maximum | 2703454 |
Range | 2647046 |
Interquartile range (IQR) | 920641 |
Descriptive statistics
Standard deviation | 819484.4816 |
---|---|
Coefficient of variation (CV) | 0.5219155474 |
Kurtosis | -0.9099583482 |
Mean | 1570147.672 |
Median Absolute Deviation (MAD) | 6973 |
Skewness | -0.5919843554 |
Sum | 6.886765038 × 1012 |
Variance | 6.715548156 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1936653 | 2004 | < 0.1% |
257343 | 1716 | < 0.1% |
257052 | 1704 | < 0.1% |
1939829 | 1572 | < 0.1% |
1025405 | 1332 | < 0.1% |
255943 | 1320 | < 0.1% |
1945583 | 1272 | < 0.1% |
1937123 | 1224 | < 0.1% |
1939842 | 1212 | < 0.1% |
1942430 | 1212 | < 0.1% |
Other values (23724) | 4371494 |
Value | Count | Frequency (%) |
56408 | 108 | |
56451 | 24 | < 0.1% |
56556 | 192 | |
56579 | 84 | |
56703 | 24 | < 0.1% |
Value | Count | Frequency (%) |
2703454 | 252 | |
2703453 | 348 | |
2703452 | 72 | < 0.1% |
2703451 | 96 | < 0.1% |
2703450 | 252 |
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 33.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 4340725 | |
9a0c095e | 31104 | 0.7% |
8fd95e4b | 14216 | 0.3% |
06fb9ba8 | 14 | < 0.1% |
26cf31be | 3 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 13067495 | |
a | 4371843 | 12.5% |
b | 4354972 | 12.4% |
4 | 4354941 | 12.4% |
1 | 4340728 | 12.4% |
7 | 4340725 | 12.4% |
9 | 76438 | 0.2% |
0 | 62222 | 0.2% |
e | 45323 | 0.1% |
c | 31107 | 0.1% |
Other values (6) | 42702 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26256802 | |
Lowercase Letter | 8831694 | 25.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 13067495 | |
4 | 4354941 | 16.6% |
1 | 4340728 | 16.5% |
7 | 4340725 | 16.5% |
9 | 76438 | 0.3% |
0 | 62222 | 0.2% |
8 | 14230 | 0.1% |
6 | 17 | < 0.1% |
2 | 3 | < 0.1% |
3 | 3 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 4371843 | |
b | 4354972 | |
e | 45323 | 0.5% |
c | 31107 | 0.4% |
f | 14233 | 0.2% |
d | 14216 | 0.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 26256802 | |
Latin | 8831694 | 25.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 13067495 | |
4 | 4354941 | 16.6% |
1 | 4340728 | 16.5% |
7 | 4340725 | 16.5% |
9 | 76438 | 0.3% |
0 | 62222 | 0.2% |
8 | 14230 | 0.1% |
6 | 17 | < 0.1% |
2 | 3 | < 0.1% |
3 | 3 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 4371843 | |
b | 4354972 | |
e | 45323 | 0.5% |
c | 31107 | 0.4% |
f | 14233 | 0.2% |
d | 14216 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35088496 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 13067495 | |
a | 4371843 | 12.5% |
b | 4354972 | 12.4% |
4 | 4354941 | 12.4% |
1 | 4340728 | 12.4% |
7 | 4340725 | 12.4% |
9 | 76438 | 0.2% |
0 | 62222 | 0.2% |
e | 45323 | 0.1% |
c | 31107 | 0.1% |
Other values (6) | 42702 | 0.1% |
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 33.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 4209154 | |
9a0c095e | 94501 | 2.2% |
8fd95e4b | 82177 | 1.9% |
06fb9ba8 | 197 | < 0.1% |
3cbe86ba | 32 | < 0.1% |
9276e4bb | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 12804140 | |
a | 4303884 | 12.3% |
b | 4291791 | 12.2% |
4 | 4291332 | 12.2% |
7 | 4209155 | 12.0% |
1 | 4209154 | 12.0% |
9 | 271377 | 0.8% |
0 | 189199 | 0.5% |
e | 176711 | 0.5% |
c | 94533 | 0.3% |
Other values (6) | 247220 | 0.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26057026 | |
Lowercase Letter | 9031470 | 25.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 12804140 | |
4 | 4291332 | 16.5% |
7 | 4209155 | 16.2% |
1 | 4209154 | 16.2% |
9 | 271377 | 1.0% |
0 | 189199 | 0.7% |
8 | 82406 | 0.3% |
6 | 230 | < 0.1% |
3 | 32 | < 0.1% |
2 | 1 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 4303884 | |
b | 4291791 | |
e | 176711 | 2.0% |
c | 94533 | 1.0% |
f | 82374 | 0.9% |
d | 82177 | 0.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 26057026 | |
Latin | 9031470 | 25.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 12804140 | |
4 | 4291332 | 16.5% |
7 | 4209155 | 16.2% |
1 | 4209154 | 16.2% |
9 | 271377 | 1.0% |
0 | 189199 | 0.7% |
8 | 82406 | 0.3% |
6 | 230 | < 0.1% |
3 | 32 | < 0.1% |
2 | 1 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 4303884 | |
b | 4291791 | |
e | 176711 | 2.0% |
c | 94533 | 1.0% |
f | 82374 | 0.9% |
d | 82177 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35088496 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 12804140 | |
a | 4303884 | 12.3% |
b | 4291791 | 12.2% |
4 | 4291332 | 12.2% |
7 | 4209155 | 12.0% |
1 | 4209154 | 12.0% |
9 | 271377 | 0.8% |
0 | 189199 | 0.5% |
e | 176711 | 0.5% |
c | 94533 | 0.3% |
Other values (6) | 247220 | 0.7% |
collater_valueofguarantee_1124L
Real number (ℝ)
MISSING
  SKEWED
 
Distinct | 2374 |
---|---|
Distinct (%) | 5.2% |
Missing | 4340725 |
Missing (%) | 99.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1250201.297 |
Minimum | 0 |
---|---|
Maximum | 3515294000 |
Zeros | 42324 |
Zeros (%) | 1.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3007364.784 |
Maximum | 3515294000 |
Range | 3515294000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 32106519.85 |
---|---|
Coefficient of variation (CV) | 25.68108027 |
Kurtosis | 7871.848916 |
Mean | 1250201.297 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 81.69064668 |
Sum | 5.668037618 × 1010 |
Variance | 1.030828617 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 42324 | 1.0% |
500000 | 20 | < 0.1% |
12000000 | 19 | < 0.1% |
200000 | 15 | < 0.1% |
4000000 | 15 | < 0.1% |
11385000 | 14 | < 0.1% |
3000000 | 14 | < 0.1% |
2000000 | 13 | < 0.1% |
5000000 | 11 | < 0.1% |
30000000 | 11 | < 0.1% |
Other values (2364) | 2881 | 0.1% |
(Missing) | 4340725 |
Value | Count | Frequency (%) |
0 | 42324 | |
1 | 10 | < 0.1% |
1150 | 1 | < 0.1% |
2712 | 1 | < 0.1% |
3800 | 1 | < 0.1% |
Value | Count | Frequency (%) |
3515294000 | 1 | < 0.1% |
3200000000 | 2 | |
1800000000 | 1 | < 0.1% |
1050000000 | 1 | < 0.1% |
1000000000 | 4 |
collater_valueofguarantee_876L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 6556 |
---|---|
Distinct (%) | 3.7% |
Missing | 4209154 |
Missing (%) | 96.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 786298.2358 |
Minimum | 0 |
---|---|
Maximum | 3250000000 |
Zeros | 161055 |
Zeros (%) | 3.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 130000 |
Maximum | 3250000000 |
Range | 3250000000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 31452025.18 |
---|---|
Coefficient of variation (CV) | 40.00012178 |
Kurtosis | 7380.550392 |
Mean | 786298.2358 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 80.87769684 |
Sum | 1.391024483 × 1011 |
Variance | 9.892298882 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 161055 | 3.7% |
60000 | 464 | < 0.1% |
130000 | 359 | < 0.1% |
100000 | 345 | < 0.1% |
30000000 | 249 | < 0.1% |
50000 | 205 | < 0.1% |
65000 | 192 | < 0.1% |
70000 | 161 | < 0.1% |
150000 | 161 | < 0.1% |
80000 | 160 | < 0.1% |
Other values (6546) | 13557 | 0.3% |
(Missing) | 4209154 |
Value | Count | Frequency (%) |
0 | 161055 | |
0.99 | 3 | < 0.1% |
1 | 46 | < 0.1% |
1.8 | 2 | < 0.1% |
4 | 1 | < 0.1% |
Value | Count | Frequency (%) |
3250000000 | 5 | |
3200000000 | 5 | |
2000000000 | 11 | |
1200000000 | 7 | |
1015106700 | 1 | < 0.1% |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 33.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 4209249 | |
c7a5ad39 | 140275 | 3.2% |
3cbe86ba | 24588 | 0.6% |
9276e4bb | 3686 | 0.1% |
0e63c0f0 | 3265 | 0.1% |
168ad9f3 | 1180 | < 0.1% |
5224034a | 899 | < 0.1% |
7b62420e | 837 | < 0.1% |
940efad7 | 755 | < 0.1% |
2fd21cf1 | 481 | < 0.1% |
Other values (5) | 847 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 12769513 | |
a | 4517896 | 12.9% |
7 | 4355347 | 12.4% |
b | 4267036 | 12.2% |
4 | 4217196 | 12.0% |
1 | 4211675 | 12.0% |
3 | 170515 | 0.5% |
c | 169201 | 0.5% |
9 | 146512 | 0.4% |
d | 142828 | 0.4% |
Other values (6) | 120777 | 0.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 25951815 | |
Lowercase Letter | 9136681 | 26.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 12769513 | |
7 | 4355347 | 16.8% |
4 | 4217196 | 16.3% |
1 | 4211675 | 16.2% |
3 | 170515 | 0.7% |
9 | 146512 | 0.6% |
6 | 33958 | 0.1% |
8 | 26189 | 0.1% |
0 | 12653 | < 0.1% |
2 | 8257 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 4517896 | |
b | 4267036 | |
c | 169201 | 1.9% |
d | 142828 | 1.6% |
e | 33415 | 0.4% |
f | 6305 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 25951815 | |
Latin | 9136681 | 26.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 12769513 | |
7 | 4355347 | 16.8% |
4 | 4217196 | 16.3% |
1 | 4211675 | 16.2% |
3 | 170515 | 0.7% |
9 | 146512 | 0.6% |
6 | 33958 | 0.1% |
8 | 26189 | 0.1% |
0 | 12653 | < 0.1% |
2 | 8257 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 4517896 | |
b | 4267036 | |
c | 169201 | 1.9% |
d | 142828 | 1.6% |
e | 33415 | 0.4% |
f | 6305 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35088496 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 12769513 | |
a | 4517896 | 12.9% |
7 | 4355347 | 12.4% |
b | 4267036 | 12.2% |
4 | 4217196 | 12.0% |
1 | 4211675 | 12.0% |
3 | 170515 | 0.5% |
c | 169201 | 0.5% |
9 | 146512 | 0.4% |
d | 142828 | 0.4% |
Other values (6) | 120777 | 0.3% |
Distinct | 14 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 33.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 4340785 | |
c7a5ad39 | 41366 | 0.9% |
9276e4bb | 1591 | < 0.1% |
0e63c0f0 | 1109 | < 0.1% |
168ad9f3 | 442 | < 0.1% |
7b62420e | 439 | < 0.1% |
940efad7 | 116 | < 0.1% |
f4d8a027 | 79 | < 0.1% |
3cbe86ba | 51 | < 0.1% |
2fd21cf1 | 41 | < 0.1% |
Other values (4) | 43 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 13063762 | |
a | 4424232 | 12.6% |
7 | 4384396 | 12.5% |
b | 4344528 | 12.4% |
4 | 4343058 | 12.4% |
1 | 4341327 | 12.4% |
9 | 43523 | 0.1% |
3 | 42991 | 0.1% |
c | 42589 | 0.1% |
d | 42044 | 0.1% |
Other values (6) | 16046 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26229951 | |
Lowercase Letter | 8858545 | 25.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 13063762 | |
7 | 4384396 | 16.7% |
4 | 4343058 | 16.6% |
1 | 4341327 | 16.6% |
9 | 43523 | 0.2% |
3 | 42991 | 0.2% |
0 | 3984 | < 0.1% |
6 | 3652 | < 0.1% |
2 | 2668 | < 0.1% |
8 | 590 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 4424232 | |
b | 4344528 | |
c | 42589 | 0.5% |
d | 42044 | 0.5% |
e | 3324 | < 0.1% |
f | 1828 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 26229951 | |
Latin | 8858545 | 25.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 13063762 | |
7 | 4384396 | 16.7% |
4 | 4343058 | 16.6% |
1 | 4341327 | 16.6% |
9 | 43523 | 0.2% |
3 | 42991 | 0.2% |
0 | 3984 | < 0.1% |
6 | 3652 | < 0.1% |
2 | 2668 | < 0.1% |
8 | 590 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 4424232 | |
b | 4344528 | |
c | 42589 | 0.5% |
d | 42044 | 0.5% |
e | 3324 | < 0.1% |
f | 1828 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35088496 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 13063762 | |
a | 4424232 | 12.6% |
7 | 4384396 | 12.5% |
b | 4344528 | 12.4% |
4 | 4343058 | 12.4% |
1 | 4341327 | 12.4% |
9 | 43523 | 0.1% |
3 | 42991 | 0.1% |
c | 42589 | 0.1% |
d | 42044 | 0.1% |
Other values (6) | 16046 | < 0.1% |
num_group1
Real number (ℝ)
ZEROS
 
Distinct | 120 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.491723555 |
Minimum | 0 |
---|---|
Maximum | 119 |
Zeros | 755928 |
Zeros (%) | 17.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 3 |
Q3 | 8 |
95-th percentile | 18 |
Maximum | 119 |
Range | 119 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 6.988234071 |
---|---|
Coefficient of variation (CV) | 1.272502886 |
Kurtosis | 29.00008007 |
Mean | 5.491723555 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 3.8286012 |
Sum | 24087040 |
Variance | 48.83541543 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 755928 | |
1 | 587413 | |
2 | 476894 | |
3 | 397025 | |
4 | 336111 | |
5 | 286647 | 6.5% |
6 | 242773 | 5.5% |
7 | 205885 | 4.7% |
8 | 173654 | 4.0% |
9 | 145200 | 3.3% |
Other values (110) | 778532 |
Value | Count | Frequency (%) |
0 | 755928 | |
1 | 587413 | |
2 | 476894 | |
3 | 397025 | |
4 | 336111 |
Value | Count | Frequency (%) |
119 | 36 | |
118 | 36 | |
117 | 36 | |
116 | 48 | |
115 | 36 |
num_group2
Real number (ℝ)
ZEROS
 
Distinct | 36 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.46609897 |
Minimum | 0 |
---|---|
Maximum | 35 |
Zeros | 178326 |
Zeros (%) | 4.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 12 |
Q3 | 20 |
95-th percentile | 32 |
Maximum | 35 |
Range | 35 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 9.366761932 |
---|---|
Coefficient of variation (CV) | 0.6955809511 |
Kurtosis | -0.6928494774 |
Mean | 13.46609897 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.4895278852 |
Sum | 59063145 |
Variance | 87.73622909 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 178326 | 4.1% |
6 | 178306 | 4.1% |
11 | 178306 | 4.1% |
10 | 178306 | 4.1% |
8 | 178306 | 4.1% |
7 | 178306 | 4.1% |
9 | 178306 | 4.1% |
5 | 178306 | 4.1% |
4 | 178306 | 4.1% |
3 | 178306 | 4.1% |
Other values (26) | 2602982 |
Value | Count | Frequency (%) |
0 | 178326 | |
1 | 178306 | |
2 | 178306 | |
3 | 178306 | |
4 | 178306 |
Value | Count | Frequency (%) |
35 | 55441 | |
34 | 55441 | |
33 | 55441 | |
32 | 55441 | |
31 | 55441 |
pmts_dpd_1073P
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 1678 |
---|---|
Distinct (%) | 0.3% |
Missing | 3729399 |
Missing (%) | 85.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.19676455 |
Minimum | 0 |
---|---|
Maximum | 4155 |
Zeros | 624821 |
Zeros (%) | 14.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 4155 |
Range | 4155 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 90.26236627 |
---|---|
Coefficient of variation (CV) | 17.36895435 |
Kurtosis | 864.1461394 |
Mean | 5.19676455 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 26.9832622 |
Sum | 3412523 |
Variance | 8147.294764 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 624821 | 14.2% |
1 | 5855 | 0.1% |
2 | 2542 | 0.1% |
3 | 1905 | < 0.1% |
4 | 1790 | < 0.1% |
7 | 1160 | < 0.1% |
5 | 1084 | < 0.1% |
6 | 1017 | < 0.1% |
8 | 1007 | < 0.1% |
10 | 838 | < 0.1% |
Other values (1668) | 14644 | 0.3% |
(Missing) | 3729399 |
Value | Count | Frequency (%) |
0 | 624821 | |
1 | 5855 | 0.1% |
2 | 2542 | 0.1% |
3 | 1905 | < 0.1% |
4 | 1790 | < 0.1% |
Value | Count | Frequency (%) |
4155 | 1 | |
4136 | 1 | |
4110 | 1 | |
4079 | 1 | |
4053 | 1 |
pmts_dpd_303P
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 3930 |
---|---|
Distinct (%) | 0.2% |
Missing | 2445060 |
Missing (%) | 55.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 60.13521573 |
Minimum | -4 |
---|---|
Maximum | 144000 |
Zeros | 1622067 |
Zeros (%) | 37.0% |
Negative | 257 |
Negative (%) | < 0.1% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | -4 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 383 |
Maximum | 144000 |
Range | 144004 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 329.9638175 |
---|---|
Coefficient of variation (CV) | 5.48703141 |
Kurtosis | 30124.70291 |
Mean | 60.13521573 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 92.41077498 |
Sum | 116722574 |
Variance | 108876.1209 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1622067 | |
1 | 43495 | 1.0% |
3 | 11351 | 0.3% |
2 | 11135 | 0.3% |
4 | 9405 | 0.2% |
6 | 7649 | 0.2% |
7 | 6425 | 0.1% |
5 | 6305 | 0.1% |
9 | 4578 | 0.1% |
8 | 4463 | 0.1% |
Other values (3920) | 214129 | 4.9% |
(Missing) | 2445060 |
Value | Count | Frequency (%) |
-4 | 11 | < 0.1% |
-3 | 25 | < 0.1% |
-2 | 68 | < 0.1% |
-1 | 153 | < 0.1% |
0 | 1622067 |
Value | Count | Frequency (%) |
144000 | 1 | |
84574 | 1 | |
84561 | 2 | |
84532 | 1 | |
84505 | 1 |
pmts_month_158T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 3280058 |
Missing (%) | 74.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.45205409 |
---|---|
Coefficient of variation (CV) | 0.5310852446 |
Kurtosis | -1.216783293 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 7189026 |
Variance | 11.91667744 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 92167 | 2.1% |
3 | 92167 | 2.1% |
4 | 92167 | 2.1% |
5 | 92167 | 2.1% |
6 | 92167 | 2.1% |
7 | 92167 | 2.1% |
8 | 92167 | 2.1% |
9 | 92167 | 2.1% |
10 | 92167 | 2.1% |
11 | 92167 | 2.1% |
Other values (2) | 184334 | 4.2% |
(Missing) | 3280058 |
Value | Count | Frequency (%) |
1 | 92167 | |
2 | 92167 | |
3 | 92167 | |
4 | 92167 | |
5 | 92167 |
Value | Count | Frequency (%) |
12 | 92167 | |
11 | 92167 | |
10 | 92167 | |
9 | 92167 | |
8 | 92167 |
pmts_month_706T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 288770 |
Missing (%) | 6.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052951 |
---|---|
Coefficient of variation (CV) | 0.5310850694 |
Kurtosis | -1.216783237 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 26632398 |
Variance | 11.91666958 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 341441 | |
3 | 341441 | |
4 | 341441 | |
5 | 341441 | |
6 | 341441 | |
7 | 341441 | |
8 | 341441 | |
9 | 341441 | |
10 | 341441 | |
11 | 341441 | |
Other values (2) | 682882 |
Value | Count | Frequency (%) |
1 | 341441 | |
2 | 341441 | |
3 | 341441 | |
4 | 341441 | |
5 | 341441 |
Value | Count | Frequency (%) |
12 | 341441 | |
11 | 341441 | |
10 | 341441 | |
9 | 341441 | |
8 | 341441 |
pmts_overdue_1140A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 26436 |
---|---|
Distinct (%) | 4.0% |
Missing | 3725455 |
Missing (%) | 84.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 924.5240739 |
Minimum | 0 |
---|---|
Maximum | 8737926 |
Zeros | 628168 |
Zeros (%) | 14.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 8737926 |
Range | 8737926 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 38970.96235 |
---|---|
Coefficient of variation (CV) | 42.15245817 |
Kurtosis | 13629.83893 |
Mean | 924.5240739 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 105.9926051 |
Sum | 610747074.9 |
Variance | 1518735906 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 628168 | 14.3% |
99.8 | 89 | < 0.1% |
10 | 52 | < 0.1% |
14 | 45 | < 0.1% |
400 | 31 | < 0.1% |
0.2 | 29 | < 0.1% |
10.400001 | 27 | < 0.1% |
4 | 26 | < 0.1% |
10500 | 24 | < 0.1% |
69126.055 | 24 | < 0.1% |
Other values (26426) | 32092 | 0.7% |
(Missing) | 3725455 |
Value | Count | Frequency (%) |
0 | 628168 | |
0.002 | 1 | < 0.1% |
0.004 | 6 | < 0.1% |
0.006 | 2 | < 0.1% |
0.008 | 4 | < 0.1% |
Value | Count | Frequency (%) |
8737926 | 1 | |
6350786 | 1 | |
5523147 | 1 | |
5315128 | 1 | |
5182516 | 1 |
pmts_overdue_1152A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 170033 |
---|---|
Distinct (%) | 8.7% |
Missing | 2442535 |
Missing (%) | 55.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4111.655686 |
Minimum | 0 |
---|---|
Maximum | 17317146 |
Zeros | 1612253 |
Zeros (%) | 36.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 16963.4 |
Maximum | 17317146 |
Range | 17317146 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 64789.7787 |
---|---|
Coefficient of variation (CV) | 15.75758858 |
Kurtosis | 45567.63126 |
Mean | 4111.655686 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 186.8551368 |
Sum | 7991113840 |
Variance | 4197715424 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1612253 | |
0.2 | 894 | < 0.1% |
1000 | 471 | < 0.1% |
0.4 | 373 | < 0.1% |
2000 | 333 | < 0.1% |
0.6 | 310 | < 0.1% |
3000 | 297 | < 0.1% |
0.8 | 294 | < 0.1% |
1.6 | 286 | < 0.1% |
2 | 278 | < 0.1% |
Other values (170023) | 327738 | 7.5% |
(Missing) | 2442535 |
Value | Count | Frequency (%) |
0 | 1612253 | |
0.002 | 27 | < 0.1% |
0.004 | 18 | < 0.1% |
0.006 | 16 | < 0.1% |
0.008 | 19 | < 0.1% |
Value | Count | Frequency (%) |
17317146 | 1 | |
17230230 | 1 | |
17141206 | 1 | |
17051322 | 1 | |
16963518 | 1 |
pmts_year_1139T
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 3280058 |
Missing (%) | 74.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2019.388432 |
Minimum | 2016 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 2016 |
---|---|
5-th percentile | 2018 |
Q1 | 2019 |
median | 2020 |
Q3 | 2020 |
95-th percentile | 2020 |
Maximum | 2021 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.8013438654 |
---|---|
Coefficient of variation (CV) | 0.0003968250253 |
Kurtosis | -0.697380051 |
Mean | 2019.388432 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.3462998149 |
Sum | 2233451683 |
Variance | 0.6421519906 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2020 | 520011 | 11.9% |
2019 | 362429 | 8.3% |
2018 | 179094 | 4.1% |
2021 | 44413 | 1.0% |
2017 | 35 | < 0.1% |
2016 | 22 | < 0.1% |
(Missing) | 3280058 |
Value | Count | Frequency (%) |
2016 | 22 | < 0.1% |
2017 | 35 | < 0.1% |
2018 | 179094 | 4.1% |
2019 | 362429 | |
2020 | 520011 |
Value | Count | Frequency (%) |
2021 | 44413 | 1.0% |
2020 | 520011 | |
2019 | 362429 | |
2018 | 179094 | 4.1% |
2017 | 35 | < 0.1% |
pmts_year_507T
Real number (ℝ)
MISSING
 
Distinct | 20 |
---|---|
Distinct (%) | < 0.1% |
Missing | 288770 |
Missing (%) | 6.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2015.066429 |
Minimum | 2002 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 33.5 MiB |
Quantile statistics
Minimum | 2002 |
---|---|
5-th percentile | 2007 |
Q1 | 2012 |
median | 2016 |
Q3 | 2018 |
95-th percentile | 2020 |
Maximum | 2021 |
Range | 19 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 4.072742933 |
---|---|
Coefficient of variation (CV) | 0.002021145743 |
Kurtosis | -0.4521212835 |
Mean | 2015.066429 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.76809658 |
Sum | 8256315557 |
Variance | 16.587235 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2019 | 603853 | |
2018 | 583318 | |
2017 | 432657 | |
2020 | 322828 | |
2016 | 316104 | |
2015 | 281667 | 6.4% |
2014 | 263154 | 6.0% |
2013 | 229842 | 5.2% |
2012 | 188788 | 4.3% |
2011 | 163820 | 3.7% |
Other values (10) | 711261 | |
(Missing) | 288770 |
Value | Count | Frequency (%) |
2002 | 88 | < 0.1% |
2003 | 316 | < 0.1% |
2004 | 7827 | 0.2% |
2005 | 39143 | 0.9% |
2006 | 99161 |
Value | Count | Frequency (%) |
2021 | 24768 | 0.6% |
2020 | 322828 | |
2019 | 603853 | |
2018 | 583318 | |
2017 | 432657 |
Distinct | 8 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 33.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 4210385 | |
ab3c25cf | 172826 | 3.9% |
15f04f45 | 1431 | < 0.1% |
be4fd70b | 917 | < 0.1% |
daf49a8a | 490 | < 0.1% |
71ddaa88 | 9 | < 0.1% |
0c42a10e | 2 | < 0.1% |
9ba4314a | 2 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 12806843 | |
b | 4385047 | 12.5% |
a | 4384705 | 12.5% |
4 | 4214660 | 12.0% |
1 | 4211829 | 12.0% |
7 | 4211311 | 12.0% |
c | 345654 | 1.0% |
f | 177095 | 0.5% |
3 | 172828 | 0.5% |
2 | 172828 | 0.5% |
Other values (5) | 5696 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 25793651 | |
Lowercase Letter | 9294845 | 26.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 12806843 | |
4 | 4214660 | 16.3% |
1 | 4211829 | 16.3% |
7 | 4211311 | 16.3% |
3 | 172828 | 0.7% |
2 | 172828 | 0.7% |
0 | 2352 | < 0.1% |
8 | 508 | < 0.1% |
9 | 492 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 4385047 | |
a | 4384705 | |
c | 345654 | 3.7% |
f | 177095 | 1.9% |
d | 1425 | < 0.1% |
e | 919 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 25793651 | |
Latin | 9294845 | 26.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 12806843 | |
4 | 4214660 | 16.3% |
1 | 4211829 | 16.3% |
7 | 4211311 | 16.3% |
3 | 172828 | 0.7% |
2 | 172828 | 0.7% |
0 | 2352 | < 0.1% |
8 | 508 | < 0.1% |
9 | 492 | < 0.1% |
Latin
Value | Count | Frequency (%) |
b | 4385047 | |
a | 4384705 | |
c | 345654 | 3.7% |
f | 177095 | 1.9% |
d | 1425 | < 0.1% |
e | 919 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35088496 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 12806843 | |
b | 4385047 | 12.5% |
a | 4384705 | 12.5% |
4 | 4214660 | 12.0% |
1 | 4211829 | 12.0% |
7 | 4211311 | 12.0% |
c | 345654 | 1.0% |
f | 177095 | 0.5% |
3 | 172828 | 0.5% |
2 | 172828 | 0.5% |
Other values (5) | 5696 | < 0.1% |
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 33.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 4341390 | |
ab3c25cf | 43626 | 1.0% |
be4fd70b | 408 | < 0.1% |
daf49a8a | 299 | < 0.1% |
15f04f45 | 269 | < 0.1% |
p28_48_88 | 70 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 13068334 | |
a | 4385913 | 12.5% |
b | 4385832 | 12.5% |
4 | 4342705 | 12.4% |
7 | 4341798 | 12.4% |
1 | 4341659 | 12.4% |
c | 87252 | 0.2% |
f | 44871 | 0.1% |
2 | 43696 | 0.1% |
3 | 43626 | 0.1% |
Other values (7) | 2880 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26183373 | |
Lowercase Letter | 8904983 | 25.4% |
Connector Punctuation | 140 | < 0.1% |
Uppercase Letter | 70 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 13068334 | |
4 | 4342705 | 16.6% |
7 | 4341798 | 16.6% |
1 | 4341659 | 16.6% |
2 | 43696 | 0.2% |
3 | 43626 | 0.2% |
0 | 677 | < 0.1% |
8 | 579 | < 0.1% |
9 | 299 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 4385913 | |
b | 4385832 | |
c | 87252 | 1.0% |
f | 44871 | 0.5% |
d | 707 | < 0.1% |
e | 408 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 140 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 70 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 26183513 | |
Latin | 8905053 | 25.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 13068334 | |
4 | 4342705 | 16.6% |
7 | 4341798 | 16.6% |
1 | 4341659 | 16.6% |
2 | 43696 | 0.2% |
3 | 43626 | 0.2% |
0 | 677 | < 0.1% |
8 | 579 | < 0.1% |
9 | 299 | < 0.1% |
_ | 140 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 4385913 | |
b | 4385832 | |
c | 87252 | 1.0% |
f | 44871 | 0.5% |
d | 707 | < 0.1% |
e | 408 | < 0.1% |
P | 70 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35088566 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 13068334 | |
a | 4385913 | 12.5% |
b | 4385832 | 12.5% |
4 | 4342705 | 12.4% |
7 | 4341798 | 12.4% |
1 | 4341659 | 12.4% |
c | 87252 | 0.2% |
f | 44871 | 0.1% |
2 | 43696 | 0.1% |
3 | 43626 | 0.1% |
Other values (7) | 2880 | < 0.1% |