Dataset statistics
Number of variables | 19 |
---|---|
Number of observations | 8055986 |
Missing cells | 51201435 |
Missing cells (%) | 33.5% |
Total size in memory | 1.1 GiB |
Average record size in memory | 152.0 B |
Variable types
Numeric | 13 |
---|---|
Text | 6 |
collater_valueofguarantee_1124L has 7956618 (98.8%) missing values | Missing |
collater_valueofguarantee_876L has 7720433 (95.8%) missing values | Missing |
pmts_dpd_1073P has 6871133 (85.3%) missing values | Missing |
pmts_dpd_303P has 4608761 (57.2%) missing values | Missing |
pmts_month_158T has 5585750 (69.3%) missing values | Missing |
pmts_month_706T has 702590 (8.7%) missing values | Missing |
pmts_overdue_1140A has 6863527 (85.2%) missing values | Missing |
pmts_overdue_1152A has 4604283 (57.2%) missing values | Missing |
pmts_year_1139T has 5585750 (69.3%) missing values | Missing |
pmts_year_507T has 702590 (8.7%) missing values | Missing |
collater_valueofguarantee_876L is highly skewed (γ1 = 46.93143835) | Skewed |
pmts_dpd_303P is highly skewed (γ1 = 36.80773901) | Skewed |
pmts_overdue_1140A is highly skewed (γ1 = 344.9531162) | Skewed |
pmts_overdue_1152A is highly skewed (γ1 = 259.3446397) | Skewed |
collater_valueofguarantee_1124L has 90662 (1.1%) zeros | Zeros |
collater_valueofguarantee_876L has 293947 (3.6%) zeros | Zeros |
num_group1 has 1462147 (18.1%) zeros | Zeros |
num_group2 has 325360 (4.0%) zeros | Zeros |
pmts_dpd_1073P has 1115719 (13.8%) zeros | Zeros |
pmts_dpd_303P has 2884014 (35.8%) zeros | Zeros |
pmts_overdue_1140A has 1122141 (13.9%) zeros | Zeros |
pmts_overdue_1152A has 2864665 (35.6%) zeros | Zeros |
Reproduction
Analysis started | 2024-02-13 19:51:08.860476 |
---|---|
Analysis finished | 2024-02-13 19:51:28.115434 |
Duration | 19.25 seconds |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
Distinct | 45056 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1480036.192 |
Minimum | 49417 |
---|---|
Maximum | 2681255 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 49417 |
---|---|
5-th percentile | 218775 |
Q1 | 977210 |
median | 1825914 |
Q3 | 1836249 |
95-th percentile | 2678919 |
Maximum | 2681255 |
Range | 2631838 |
Interquartile range (IQR) | 859039 |
Descriptive statistics
Standard deviation | 747088.2698 |
---|---|
Coefficient of variation (CV) | 0.5047770275 |
Kurtosis | -0.6460760485 |
Mean | 1480036.192 |
Median Absolute Deviation (MAD) | 13367 |
Skewness | -0.587070045 |
Sum | 1.192315085 × 1013 |
Variance | 5.581408829 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
221467 | 7859 | 0.1% |
973076 | 6764 | 0.1% |
220977 | 4269 | 0.1% |
1825173 | 2268 | < 0.1% |
219624 | 1800 | < 0.1% |
1835890 | 1776 | < 0.1% |
1836084 | 1776 | < 0.1% |
1831201 | 1656 | < 0.1% |
982012 | 1632 | < 0.1% |
1842598 | 1572 | < 0.1% |
Other values (45046) | 8024614 |
Value | Count | Frequency (%) |
49417 | 444 | |
49444 | 240 | |
49450 | 108 | < 0.1% |
49538 | 156 | < 0.1% |
49632 | 60 | < 0.1% |
Value | Count | Frequency (%) |
2681255 | 492 | |
2681252 | 84 | < 0.1% |
2681251 | 72 | < 0.1% |
2681250 | 132 | < 0.1% |
2681249 | 108 | < 0.1% |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 61.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 7956618 | |
9a0c095e | 61356 | 0.8% |
8fd95e4b | 37968 | 0.5% |
06fb9ba8 | 44 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 23969178 | |
a | 8018018 | 12.4% |
b | 7994674 | 12.4% |
4 | 7994586 | 12.4% |
7 | 7956618 | 12.3% |
1 | 7956618 | 12.3% |
9 | 160724 | 0.2% |
0 | 122756 | 0.2% |
e | 99324 | 0.2% |
c | 61356 | 0.1% |
Other values (4) | 114036 | 0.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 48198536 | |
Lowercase Letter | 16249352 | 25.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 23969178 | |
4 | 7994586 | 16.6% |
7 | 7956618 | 16.5% |
1 | 7956618 | 16.5% |
9 | 160724 | 0.3% |
0 | 122756 | 0.3% |
8 | 38012 | 0.1% |
6 | 44 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 8018018 | |
b | 7994674 | |
e | 99324 | 0.6% |
c | 61356 | 0.4% |
f | 38012 | 0.2% |
d | 37968 | 0.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 48198536 | |
Latin | 16249352 | 25.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 23969178 | |
4 | 7994586 | 16.6% |
7 | 7956618 | 16.5% |
1 | 7956618 | 16.5% |
9 | 160724 | 0.3% |
0 | 122756 | 0.3% |
8 | 38012 | 0.1% |
6 | 44 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 8018018 | |
b | 7994674 | |
e | 99324 | 0.6% |
c | 61356 | 0.4% |
f | 38012 | 0.2% |
d | 37968 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 64447888 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 23969178 | |
a | 8018018 | 12.4% |
b | 7994674 | 12.4% |
4 | 7994586 | 12.4% |
7 | 7956618 | 12.3% |
1 | 7956618 | 12.3% |
9 | 160724 | 0.2% |
0 | 122756 | 0.2% |
e | 99324 | 0.2% |
c | 61356 | 0.1% |
Other values (4) | 114036 | 0.2% |
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 61.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 7720433 | |
9a0c095e | 172448 | 2.1% |
8fd95e4b | 162673 | 2.0% |
06fb9ba8 | 386 | < 0.1% |
3cbe86ba | 45 | < 0.1% |
9276e4bb | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 23496420 | |
a | 7893312 | 12.2% |
b | 7883970 | 12.2% |
4 | 7883107 | 12.2% |
7 | 7720434 | 12.0% |
1 | 7720433 | 12.0% |
9 | 507956 | 0.8% |
0 | 345282 | 0.5% |
e | 335167 | 0.5% |
c | 172493 | 0.3% |
Other values (6) | 489314 | 0.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 47837214 | |
Lowercase Letter | 16610674 | 25.8% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 23496420 | |
4 | 7883107 | 16.5% |
7 | 7720434 | 16.1% |
1 | 7720433 | 16.1% |
9 | 507956 | 1.1% |
0 | 345282 | 0.7% |
8 | 163104 | 0.3% |
6 | 432 | < 0.1% |
3 | 45 | < 0.1% |
2 | 1 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 7893312 | |
b | 7883970 | |
e | 335167 | 2.0% |
c | 172493 | 1.0% |
f | 163059 | 1.0% |
d | 162673 | 1.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 47837214 | |
Latin | 16610674 | 25.8% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 23496420 | |
4 | 7883107 | 16.5% |
7 | 7720434 | 16.1% |
1 | 7720433 | 16.1% |
9 | 507956 | 1.1% |
0 | 345282 | 0.7% |
8 | 163104 | 0.3% |
6 | 432 | < 0.1% |
3 | 45 | < 0.1% |
2 | 1 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 7893312 | |
b | 7883970 | |
e | 335167 | 2.0% |
c | 172493 | 1.0% |
f | 163059 | 1.0% |
d | 162673 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 64447888 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 23496420 | |
a | 7893312 | 12.2% |
b | 7883970 | 12.2% |
4 | 7883107 | 12.2% |
7 | 7720434 | 12.0% |
1 | 7720433 | 12.0% |
9 | 507956 | 0.8% |
0 | 345282 | 0.5% |
e | 335167 | 0.5% |
c | 172493 | 0.3% |
Other values (6) | 489314 | 0.8% |
collater_valueofguarantee_1124L
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 4833 |
---|---|
Distinct (%) | 4.9% |
Missing | 7956618 |
Missing (%) | 98.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9499369.01 |
Minimum | 0 |
---|---|
Maximum | 1750000000 |
Zeros | 90662 |
Zeros (%) | 1.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 7351210 |
Maximum | 1750000000 |
Range | 1750000000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 98302336.89 |
---|---|
Coefficient of variation (CV) | 10.34830174 |
Kurtosis | 156.858693 |
Mean | 9499369.01 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 12.42348167 |
Sum | 9.439332998 × 1011 |
Variance | 9.663349438 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 90662 | 1.1% |
1300000000 | 504 | < 0.1% |
31000000 | 74 | < 0.1% |
938159252.8 | 72 | < 0.1% |
14536950 | 72 | < 0.1% |
10175865 | 72 | < 0.1% |
11108369.7 | 72 | < 0.1% |
456217535.2 | 72 | < 0.1% |
42669516.65 | 72 | < 0.1% |
59320800 | 72 | < 0.1% |
Other values (4823) | 7624 | 0.1% |
(Missing) | 7956618 |
Value | Count | Frequency (%) |
0 | 90662 | |
1 | 23 | < 0.1% |
1000 | 6 | < 0.1% |
2500 | 1 | < 0.1% |
4264.7 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1750000000 | 2 | < 0.1% |
1300000000 | 504 | |
1268000000 | 1 | < 0.1% |
1135000000 | 1 | < 0.1% |
938159252.8 | 72 | < 0.1% |
collater_valueofguarantee_876L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 12031 |
---|---|
Distinct (%) | 3.6% |
Missing | 7720433 |
Missing (%) | 95.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10095376.44 |
Minimum | 0 |
---|---|
Maximum | 1.576941824 × 1010 |
Zeros | 293947 |
Zeros (%) | 3.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2304245.6 |
Maximum | 1.576941824 × 1010 |
Range | 1.576941824 × 1010 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 106342638.4 |
---|---|
Coefficient of variation (CV) | 10.53379623 |
Kurtosis | 5862.181633 |
Mean | 10095376.44 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 46.93143835 |
Sum | 3.387533852 × 1012 |
Variance | 1.130875673 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 293947 | 3.6% |
60000 | 907 | < 0.1% |
740000000 | 702 | < 0.1% |
130000 | 568 | < 0.1% |
100000 | 545 | < 0.1% |
975308630 | 464 | < 0.1% |
50000 | 417 | < 0.1% |
1300000000 | 396 | < 0.1% |
65000 | 333 | < 0.1% |
80000 | 319 | < 0.1% |
Other values (12021) | 36955 | 0.5% |
(Missing) | 7720433 |
Value | Count | Frequency (%) |
0 | 293947 | |
0.01 | 1 | < 0.1% |
0.95 | 1 | < 0.1% |
0.99 | 1 | < 0.1% |
1 | 107 | < 0.1% |
Value | Count | Frequency (%) |
1.576941824 × 1010 | 4 | < 0.1% |
3250000000 | 8 | < 0.1% |
3200000000 | 8 | < 0.1% |
2000000000 | 44 | < 0.1% |
1300000000 | 396 |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 61.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 7720433 | |
c7a5ad39 | 251636 | 3.1% |
3cbe86ba | 47608 | 0.6% |
9276e4bb | 13668 | 0.2% |
0e63c0f0 | 8809 | 0.1% |
168ad9f3 | 4344 | 0.1% |
5224034a | 2078 | < 0.1% |
5994c34a | 1836 | < 0.1% |
7b62420e | 1620 | < 0.1% |
940efad7 | 1514 | < 0.1% |
Other values (5) | 2440 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 23417366 | |
a | 8282191 | 12.9% |
7 | 7990201 | 12.4% |
b | 7845443 | 12.2% |
4 | 7745862 | 12.0% |
1 | 7727542 | 12.0% |
3 | 316311 | 0.5% |
c | 311530 | 0.5% |
9 | 274834 | 0.4% |
d | 259096 | 0.4% |
Other values (6) | 277512 | 0.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 47658485 | |
Lowercase Letter | 16789403 | 26.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 23417366 | |
7 | 7990201 | 16.8% |
4 | 7745862 | 16.3% |
1 | 7727542 | 16.2% |
3 | 316311 | 0.7% |
9 | 274834 | 0.6% |
6 | 76887 | 0.2% |
8 | 52947 | 0.1% |
0 | 32745 | 0.1% |
2 | 23790 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 8282191 | |
b | 7845443 | |
c | 311530 | 1.9% |
d | 259096 | 1.5% |
e | 73736 | 0.4% |
f | 17407 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 47658485 | |
Latin | 16789403 | 26.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 23417366 | |
7 | 7990201 | 16.8% |
4 | 7745862 | 16.3% |
1 | 7727542 | 16.2% |
3 | 316311 | 0.7% |
9 | 274834 | 0.6% |
6 | 76887 | 0.2% |
8 | 52947 | 0.1% |
0 | 32745 | 0.1% |
2 | 23790 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 8282191 | |
b | 7845443 | |
c | 311530 | 1.9% |
d | 259096 | 1.5% |
e | 73736 | 0.4% |
f | 17407 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 64447888 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 23417366 | |
a | 8282191 | 12.9% |
7 | 7990201 | 12.4% |
b | 7845443 | 12.2% |
4 | 7745862 | 12.0% |
1 | 7727542 | 12.0% |
3 | 316311 | 0.5% |
c | 311530 | 0.5% |
9 | 274834 | 0.4% |
d | 259096 | 0.4% |
Other values (6) | 277512 | 0.4% |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 61.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 7956618 | |
c7a5ad39 | 83555 | 1.0% |
0e63c0f0 | 4714 | 0.1% |
9276e4bb | 4544 | 0.1% |
168ad9f3 | 3835 | < 0.1% |
3cbe86ba | 1242 | < 0.1% |
7b62420e | 861 | < 0.1% |
940efad7 | 196 | < 0.1% |
2fd21cf1 | 148 | < 0.1% |
f4d8a027 | 113 | < 0.1% |
Other values (5) | 160 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 23953565 | |
a | 8129214 | 12.6% |
7 | 8045955 | 12.5% |
b | 7969118 | 12.4% |
4 | 7962522 | 12.4% |
1 | 7960812 | 12.4% |
3 | 93439 | 0.1% |
9 | 92172 | 0.1% |
c | 89743 | 0.1% |
d | 87847 | 0.1% |
Other values (6) | 63501 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 48151191 | |
Lowercase Letter | 16296697 | 25.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 23953565 | |
7 | 8045955 | 16.7% |
4 | 7962522 | 16.5% |
1 | 7960812 | 16.5% |
3 | 93439 | 0.2% |
9 | 92172 | 0.2% |
0 | 15391 | < 0.1% |
6 | 15263 | < 0.1% |
2 | 6819 | < 0.1% |
8 | 5253 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 8129214 | |
b | 7969118 | |
c | 89743 | 0.6% |
d | 87847 | 0.5% |
e | 11620 | 0.1% |
f | 9155 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 48151191 | |
Latin | 16296697 | 25.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 23953565 | |
7 | 8045955 | 16.7% |
4 | 7962522 | 16.5% |
1 | 7960812 | 16.5% |
3 | 93439 | 0.2% |
9 | 92172 | 0.2% |
0 | 15391 | < 0.1% |
6 | 15263 | < 0.1% |
2 | 6819 | < 0.1% |
8 | 5253 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 8129214 | |
b | 7969118 | |
c | 89743 | 0.6% |
d | 87847 | 0.5% |
e | 11620 | 0.1% |
f | 9155 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 64447888 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 23953565 | |
a | 8129214 | 12.6% |
7 | 8045955 | 12.5% |
b | 7969118 | 12.4% |
4 | 7962522 | 12.4% |
1 | 7960812 | 12.4% |
3 | 93439 | 0.1% |
9 | 92172 | 0.1% |
c | 89743 | 0.1% |
d | 87847 | 0.1% |
Other values (6) | 63501 | 0.1% |
num_group1
Real number (ℝ)
ZEROS
 
Distinct | 294 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.533827392 |
Minimum | 0 |
---|---|
Maximum | 293 |
Zeros | 1462147 |
Zeros (%) | 18.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 3 |
Q3 | 7 |
95-th percentile | 18 |
Maximum | 293 |
Range | 293 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 9.19827738 |
---|---|
Coefficient of variation (CV) | 1.662190872 |
Kurtosis | 221.1698905 |
Mean | 5.533827392 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 10.80047156 |
Sum | 44580436 |
Variance | 84.60830676 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1462147 | |
1 | 1129278 | |
2 | 893984 | |
3 | 728098 | |
4 | 604925 | |
5 | 512020 | 6.4% |
6 | 433456 | 5.4% |
7 | 361361 | 4.5% |
8 | 303040 | 3.8% |
9 | 255339 | 3.2% |
Other values (284) | 1372338 |
Value | Count | Frequency (%) |
0 | 1462147 | |
1 | 1129278 | |
2 | 893984 | |
3 | 728098 | |
4 | 604925 |
Value | Count | Frequency (%) |
293 | 22 | |
292 | 22 | |
291 | 22 | |
290 | 22 | |
289 | 22 |
num_group2
Real number (ℝ)
ZEROS
 
Distinct | 101 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.56430969 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 325360 |
Zeros (%) | 4.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 12 |
Q3 | 20 |
95-th percentile | 32 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 9.4929668 |
---|---|
Coefficient of variation (CV) | 0.6998488691 |
Kurtosis | 0.2571096344 |
Mean | 13.56430969 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.6049061605 |
Sum | 109273889 |
Variance | 90.11641867 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 325360 | 4.0% |
1 | 325333 | 4.0% |
2 | 325333 | 4.0% |
3 | 325333 | 4.0% |
4 | 325333 | 4.0% |
5 | 325333 | 4.0% |
6 | 325333 | 4.0% |
7 | 325333 | 4.0% |
8 | 325333 | 4.0% |
9 | 325333 | 4.0% |
Other values (91) | 4802629 |
Value | Count | Frequency (%) |
0 | 325360 | |
1 | 325333 | |
2 | 325333 | |
3 | 325333 | |
4 | 325333 |
Value | Count | Frequency (%) |
100 | 1 | < 0.1% |
99 | 1 | < 0.1% |
98 | 79 | |
97 | 79 | |
96 | 79 |
pmts_dpd_1073P
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 3060 |
---|---|
Distinct (%) | 0.3% |
Missing | 6871133 |
Missing (%) | 85.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12.11641529 |
Minimum | 0 |
---|---|
Maximum | 4520 |
Zeros | 1115719 |
Zeros (%) | 13.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1 |
Maximum | 4520 |
Range | 4520 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 146.1369082 |
---|---|
Coefficient of variation (CV) | 12.06106796 |
Kurtosis | 359.4909907 |
Mean | 12.11641529 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 17.57120307 |
Sum | 14356171 |
Variance | 21355.99592 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1115719 | 13.8% |
1 | 11291 | 0.1% |
2 | 4384 | 0.1% |
3 | 3939 | < 0.1% |
4 | 3536 | < 0.1% |
7 | 2052 | < 0.1% |
5 | 2032 | < 0.1% |
6 | 1932 | < 0.1% |
8 | 1684 | < 0.1% |
9 | 1614 | < 0.1% |
Other values (3050) | 36670 | 0.5% |
(Missing) | 6871133 |
Value | Count | Frequency (%) |
0 | 1115719 | |
1 | 11291 | 0.1% |
2 | 4384 | 0.1% |
3 | 3939 | < 0.1% |
4 | 3536 | < 0.1% |
Value | Count | Frequency (%) |
4520 | 1 | |
4507 | 1 | |
4475 | 1 | |
4454 | 1 | |
4422 | 1 |
pmts_dpd_303P
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 4042 |
---|---|
Distinct (%) | 0.1% |
Missing | 4608761 |
Missing (%) | 57.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 57.02591302 |
Minimum | -12 |
---|---|
Maximum | 84574 |
Zeros | 2884014 |
Zeros (%) | 35.8% |
Negative | 531 |
Negative (%) | < 0.1% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | -12 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 360 |
Maximum | 84574 |
Range | 84586 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 282.7228433 |
---|---|
Coefficient of variation (CV) | 4.957795997 |
Kurtosis | 9299.912895 |
Mean | 57.02591302 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 36.80773901 |
Sum | 196581153 |
Variance | 79932.20611 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2884014 | |
1 | 79122 | 1.0% |
3 | 20657 | 0.3% |
2 | 19157 | 0.2% |
4 | 17056 | 0.2% |
6 | 13852 | 0.2% |
5 | 11317 | 0.1% |
7 | 11133 | 0.1% |
9 | 8081 | 0.1% |
8 | 7862 | 0.1% |
Other values (4032) | 374974 | 4.7% |
(Missing) | 4608761 |
Value | Count | Frequency (%) |
-12 | 1 | < 0.1% |
-9 | 1 | < 0.1% |
-8 | 7 | < 0.1% |
-7 | 13 | |
-6 | 23 |
Value | Count | Frequency (%) |
84574 | 1 | |
84560 | 1 | |
84533 | 1 | |
84505 | 1 | |
4668 | 1 |
pmts_month_158T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 5585750 |
Missing (%) | 69.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452053228 |
---|---|
Coefficient of variation (CV) | 0.531085112 |
Kurtosis | -1.216783251 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 16056534 |
Variance | 11.91667149 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 205853 | 2.6% |
3 | 205853 | 2.6% |
4 | 205853 | 2.6% |
5 | 205853 | 2.6% |
6 | 205853 | 2.6% |
7 | 205853 | 2.6% |
8 | 205853 | 2.6% |
9 | 205853 | 2.6% |
10 | 205853 | 2.6% |
11 | 205853 | 2.6% |
Other values (2) | 411706 | 5.1% |
(Missing) | 5585750 |
Value | Count | Frequency (%) |
1 | 205853 | |
2 | 205853 | |
3 | 205853 | |
4 | 205853 | |
5 | 205853 |
Value | Count | Frequency (%) |
12 | 205853 | |
11 | 205853 | |
10 | 205853 | |
9 | 205853 | |
8 | 205853 |
pmts_month_706T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 702590 |
Missing (%) | 8.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052764 |
---|---|
Coefficient of variation (CV) | 0.5310850407 |
Kurtosis | -1.216783228 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 47797074 |
Variance | 11.91666829 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 612783 | |
3 | 612783 | |
4 | 612783 | |
5 | 612783 | |
6 | 612783 | |
7 | 612783 | |
8 | 612783 | |
9 | 612783 | |
10 | 612783 | |
11 | 612783 | |
Other values (2) | 1225566 | |
(Missing) | 702590 |
Value | Count | Frequency (%) |
1 | 612783 | |
2 | 612783 | |
3 | 612783 | |
4 | 612783 | |
5 | 612783 |
Value | Count | Frequency (%) |
12 | 612783 | |
11 | 612783 | |
10 | 612783 | |
9 | 612783 | |
8 | 612783 |
pmts_overdue_1140A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 52480 |
---|---|
Distinct (%) | 4.4% |
Missing | 6863527 |
Missing (%) | 85.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2961.235791 |
Minimum | 0 |
---|---|
Maximum | 149930380 |
Zeros | 1122141 |
Zeros (%) | 13.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 529.833842 |
Maximum | 149930380 |
Range | 149930380 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 388144.9616 |
---|---|
Coefficient of variation (CV) | 131.075331 |
Kurtosis | 131049.0785 |
Mean | 2961.235791 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 344.9531162 |
Sum | 3531152270 |
Variance | 1.506565112 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1122141 | 13.9% |
10 | 107 | < 0.1% |
14 | 107 | < 0.1% |
400 | 89 | < 0.1% |
3350.3801 | 72 | < 0.1% |
0.4 | 68 | < 0.1% |
1000 | 64 | < 0.1% |
28000 | 64 | < 0.1% |
20 | 56 | < 0.1% |
1 | 52 | < 0.1% |
Other values (52470) | 69639 | 0.9% |
(Missing) | 6863527 |
Value | Count | Frequency (%) |
0 | 1122141 | |
0.002 | 2 | < 0.1% |
0.004 | 5 | < 0.1% |
0.006 | 4 | < 0.1% |
0.008 | 5 | < 0.1% |
Value | Count | Frequency (%) |
149930380 | 7 | < 0.1% |
23045082 | 32 | |
15555000 | 18 | |
8150148 | 5 | < 0.1% |
5045893.5 | 3 | < 0.1% |
pmts_overdue_1152A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 280697 |
---|---|
Distinct (%) | 8.1% |
Missing | 4604283 |
Missing (%) | 57.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4299.084887 |
Minimum | 0 |
---|---|
Maximum | 46413904 |
Zeros | 2864665 |
Zeros (%) | 35.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 16413.764 |
Maximum | 46413904 |
Range | 46413904 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 89787.06774 |
---|---|
Coefficient of variation (CV) | 20.88515814 |
Kurtosis | 104914.0842 |
Mean | 4299.084887 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 259.3446397 |
Sum | 1.48391642 × 1010 |
Variance | 8061717534 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2864665 | |
0.2 | 1719 | < 0.1% |
1000 | 894 | < 0.1% |
0.4 | 854 | < 0.1% |
0.8 | 641 | < 0.1% |
2000 | 595 | < 0.1% |
3000 | 534 | < 0.1% |
2 | 519 | < 0.1% |
1.6 | 511 | < 0.1% |
0.6 | 479 | < 0.1% |
Other values (280687) | 580292 | 7.2% |
(Missing) | 4604283 |
Value | Count | Frequency (%) |
0 | 2864665 | |
0.002 | 39 | < 0.1% |
0.004 | 25 | < 0.1% |
0.006 | 17 | < 0.1% |
0.008 | 22 | < 0.1% |
Value | Count | Frequency (%) |
46413904 | 4 | |
27888398 | 2 | |
26678742 | 2 | |
24440374 | 4 | |
20743684 | 1 | < 0.1% |
pmts_year_1139T
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 5585750 |
Missing (%) | 69.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2019.345637 |
Minimum | 2016 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 2016 |
---|---|
5-th percentile | 2018 |
Q1 | 2019 |
median | 2019 |
Q3 | 2020 |
95-th percentile | 2020 |
Maximum | 2021 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.7899714889 |
---|---|
Coefficient of variation (CV) | 0.0003912017212 |
Kurtosis | -0.6975550172 |
Mean | 2019.345637 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.2526182713 |
Sum | 4988260289 |
Variance | 0.6240549533 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2020 | 1073524 | 13.3% |
2019 | 906216 | 11.2% |
2018 | 399723 | 5.0% |
2021 | 90404 | 1.1% |
2017 | 303 | < 0.1% |
2016 | 66 | < 0.1% |
(Missing) | 5585750 |
Value | Count | Frequency (%) |
2016 | 66 | < 0.1% |
2017 | 303 | < 0.1% |
2018 | 399723 | 5.0% |
2019 | 906216 | |
2020 | 1073524 |
Value | Count | Frequency (%) |
2021 | 90404 | 1.1% |
2020 | 1073524 | |
2019 | 906216 | |
2018 | 399723 | 5.0% |
2017 | 303 | < 0.1% |
pmts_year_507T
Real number (ℝ)
MISSING
 
Distinct | 22 |
---|---|
Distinct (%) | < 0.1% |
Missing | 702590 |
Missing (%) | 8.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2014.818264 |
Minimum | 2000 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 61.5 MiB |
Quantile statistics
Minimum | 2000 |
---|---|
5-th percentile | 2007 |
Q1 | 2012 |
median | 2016 |
Q3 | 2018 |
95-th percentile | 2019 |
Maximum | 2021 |
Range | 21 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 3.971782424 |
---|---|
Coefficient of variation (CV) | 0.001971285696 |
Kurtosis | -0.4506686967 |
Mean | 2014.818264 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.763033686 |
Sum | 1.481575656 × 1010 |
Variance | 15.77505562 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2018 | 1119912 | |
2019 | 1017290 | |
2017 | 864903 | |
2016 | 620562 | |
2015 | 547132 | |
2014 | 505515 | 6.3% |
2013 | 436475 | 5.4% |
2012 | 353630 | 4.4% |
2011 | 304694 | 3.8% |
2020 | 293980 | 3.6% |
Other values (12) | 1289303 | |
(Missing) | 702590 |
Value | Count | Frequency (%) |
2000 | 11 | < 0.1% |
2001 | 34 | < 0.1% |
2002 | 135 | < 0.1% |
2003 | 683 | < 0.1% |
2004 | 13954 |
Value | Count | Frequency (%) |
2021 | 19104 | 0.2% |
2020 | 293980 | 3.6% |
2019 | 1017290 | |
2018 | 1119912 | |
2017 | 864903 |
Distinct | 10 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 61.5 MiB |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 8.000000248 |
Min length | 8 |
Characters and Unicode
Total characters | 64447890 |
---|---|
Distinct characters | 17 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 2 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | ab3c25cf |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 7736685 | |
ab3c25cf | 312948 | 3.9% |
15f04f45 | 3164 | < 0.1% |
be4fd70b | 2204 | < 0.1% |
daf49a8a | 965 | < 0.1% |
71ddaa88 | 10 | < 0.1% |
0c42a10e | 6 | < 0.1% |
p28_48_88 | 2 | < 0.1% |
9ba4314a | 1 | < 0.1% |
1d94eac1 | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 23529331 | |
b | 8054042 | 12.5% |
a | 8052557 | 12.5% |
4 | 7746193 | 12.0% |
1 | 7739868 | 12.0% |
7 | 7738899 | 12.0% |
c | 625903 | 1.0% |
f | 322445 | 0.5% |
2 | 312956 | 0.5% |
3 | 312949 | 0.5% |
Other values (7) | 12747 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 47387536 | |
Lowercase Letter | 17060348 | 26.5% |
Connector Punctuation | 4 | < 0.1% |
Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 23529331 | |
4 | 7746193 | 16.3% |
1 | 7739868 | 16.3% |
7 | 7738899 | 16.3% |
2 | 312956 | 0.7% |
3 | 312949 | 0.7% |
0 | 5380 | < 0.1% |
8 | 993 | < 0.1% |
9 | 967 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 8054042 | |
a | 8052557 | |
c | 625903 | 3.7% |
f | 322445 | 1.9% |
d | 3190 | < 0.1% |
e | 2211 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 4 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 47387540 | |
Latin | 17060350 | 26.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 23529331 | |
4 | 7746193 | 16.3% |
1 | 7739868 | 16.3% |
7 | 7738899 | 16.3% |
2 | 312956 | 0.7% |
3 | 312949 | 0.7% |
0 | 5380 | < 0.1% |
8 | 993 | < 0.1% |
9 | 967 | < 0.1% |
_ | 4 | < 0.1% |
Latin
Value | Count | Frequency (%) |
b | 8054042 | |
a | 8052557 | |
c | 625903 | 3.7% |
f | 322445 | 1.9% |
d | 3190 | < 0.1% |
e | 2211 | < 0.1% |
P | 2 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 64447890 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 23529331 | |
b | 8054042 | 12.5% |
a | 8052557 | 12.5% |
4 | 7746193 | 12.0% |
1 | 7739868 | 12.0% |
7 | 7738899 | 12.0% |
c | 625903 | 1.0% |
f | 322445 | 0.5% |
2 | 312956 | 0.5% |
3 | 312949 | 0.5% |
Other values (7) | 12747 | < 0.1% |
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 61.5 MiB |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 8.000013654 |
Min length | 8 |
Characters and Unicode
Total characters | 64447998 |
---|---|
Distinct characters | 17 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | ab3c25cf |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 7964485 | |
ab3c25cf | 88586 | 1.1% |
15f04f45 | 1250 | < 0.1% |
be4fd70b | 1011 | < 0.1% |
daf49a8a | 543 | < 0.1% |
p28_48_88 | 110 | < 0.1% |
71ddaa88 | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 23984541 | |
b | 8055093 | 12.5% |
a | 8054702 | 12.5% |
4 | 7968649 | 12.4% |
1 | 7965736 | 12.4% |
7 | 7965497 | 12.4% |
c | 177172 | 0.3% |
f | 92640 | 0.1% |
2 | 88696 | 0.1% |
3 | 88586 | 0.1% |
Other values (7) | 6686 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 48065494 | |
Lowercase Letter | 16382174 | 25.4% |
Connector Punctuation | 220 | < 0.1% |
Uppercase Letter | 110 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 23984541 | |
4 | 7968649 | 16.6% |
1 | 7965736 | 16.6% |
7 | 7965497 | 16.6% |
2 | 88696 | 0.2% |
3 | 88586 | 0.2% |
0 | 2261 | < 0.1% |
8 | 985 | < 0.1% |
9 | 543 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 8055093 | |
a | 8054702 | |
c | 177172 | 1.1% |
f | 92640 | 0.6% |
d | 1556 | < 0.1% |
e | 1011 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 220 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 110 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 48065714 | |
Latin | 16382284 | 25.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 23984541 | |
4 | 7968649 | 16.6% |
1 | 7965736 | 16.6% |
7 | 7965497 | 16.6% |
2 | 88696 | 0.2% |
3 | 88586 | 0.2% |
0 | 2261 | < 0.1% |
8 | 985 | < 0.1% |
9 | 543 | < 0.1% |
_ | 220 | < 0.1% |
Latin
Value | Count | Frequency (%) |
b | 8055093 | |
a | 8054702 | |
c | 177172 | 1.1% |
f | 92640 | 0.6% |
d | 1556 | < 0.1% |
e | 1011 | < 0.1% |
P | 110 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 64447998 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 23984541 | |
b | 8055093 | 12.5% |
a | 8054702 | 12.5% |
4 | 7968649 | 12.4% |
1 | 7965736 | 12.4% |
7 | 7965497 | 12.4% |
c | 177172 | 0.3% |
f | 92640 | 0.1% |
2 | 88696 | 0.1% |
3 | 88586 | 0.1% |
Other values (7) | 6686 | < 0.1% |