Dataset statistics
Number of variables | 19 |
---|---|
Number of observations | 13927071 |
Missing cells | 88764540 |
Missing cells (%) | 33.5% |
Total size in memory | 2.0 GiB |
Average record size in memory | 152.0 B |
Variable types
Numeric | 13 |
---|---|
Text | 6 |
collater_valueofguarantee_1124L has 13778078 (98.9%) missing values | Missing |
collater_valueofguarantee_876L has 13373863 (96.0%) missing values | Missing |
pmts_dpd_1073P has 11883298 (85.3%) missing values | Missing |
pmts_dpd_303P has 7816731 (56.1%) missing values | Missing |
pmts_month_158T has 10075311 (72.3%) missing values | Missing |
pmts_month_706T has 1040751 (7.5%) missing values | Missing |
pmts_overdue_1140A has 11871027 (85.2%) missing values | Missing |
pmts_overdue_1152A has 7809419 (56.1%) missing values | Missing |
pmts_year_1139T has 10075311 (72.3%) missing values | Missing |
pmts_year_507T has 1040751 (7.5%) missing values | Missing |
collater_valueofguarantee_1124L is highly skewed (γ1 = 47.0809369) | Skewed |
collater_valueofguarantee_876L is highly skewed (γ1 = 68.99623008) | Skewed |
pmts_dpd_1073P is highly skewed (γ1 = 36.69774178) | Skewed |
pmts_overdue_1140A is highly skewed (γ1 = 137.8117801) | Skewed |
pmts_overdue_1152A is highly skewed (γ1 = 314.5818664) | Skewed |
collater_valueofguarantee_1124L has 139355 (1.0%) zeros | Zeros |
collater_valueofguarantee_876L has 504614 (3.6%) zeros | Zeros |
num_group1 has 2482094 (17.8%) zeros | Zeros |
num_group2 has 560081 (4.0%) zeros | Zeros |
pmts_dpd_1073P has 1958853 (14.1%) zeros | Zeros |
pmts_dpd_303P has 5124725 (36.8%) zeros | Zeros |
pmts_overdue_1140A has 1969331 (14.1%) zeros | Zeros |
pmts_overdue_1152A has 5090863 (36.6%) zeros | Zeros |
Reproduction
Analysis started | 2024-02-13 19:51:41.397929 |
---|---|
Analysis finished | 2024-02-13 19:52:06.212156 |
Duration | 24.81 seconds |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
Distinct | 77457 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1438578.907 |
Minimum | 51083 |
---|---|
Maximum | 2688744 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 51083 |
---|---|
5-th percentile | 226210 |
Q1 | 238681 |
median | 1852943 |
Q3 | 1870815 |
95-th percentile | 2685193 |
Maximum | 2688744 |
Range | 2637661 |
Interquartile range (IQR) | 1632134 |
Descriptive statistics
Standard deviation | 806896.0686 |
---|---|
Coefficient of variation (CV) | 0.5608980255 |
Kurtosis | -1.045670522 |
Mean | 1438578.907 |
Median Absolute Deviation (MAD) | 25736 |
Skewness | -0.4369387253 |
Sum | 2.003519058 × 1013 |
Variance | 6.510812655 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1863457 | 2772 | < 0.1% |
1856049 | 2388 | < 0.1% |
1845383 | 2076 | < 0.1% |
236981 | 1944 | < 0.1% |
225313 | 1872 | < 0.1% |
52198 | 1668 | < 0.1% |
1845023 | 1668 | < 0.1% |
1849279 | 1608 | < 0.1% |
1846434 | 1584 | < 0.1% |
237301 | 1584 | < 0.1% |
Other values (77447) | 13907907 |
Value | Count | Frequency (%) |
51083 | 96 | < 0.1% |
51099 | 120 | |
51103 | 84 | < 0.1% |
51106 | 60 | < 0.1% |
51115 | 240 |
Value | Count | Frequency (%) |
2688744 | 24 | < 0.1% |
2688743 | 156 | |
2688742 | 96 | |
2688741 | 96 | |
2688740 | 48 | < 0.1% |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 106.3 MiB |
Value | Count | Frequency (%) |
a55475b1 | 13778078 | |
9a0c095e | 99894 | 0.7% |
8fd95e4b | 49026 | 0.4% |
06fb9ba8 | 73 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 41483154 | |
a | 13878045 | 12.5% |
b | 13827250 | 12.4% |
4 | 13827104 | 12.4% |
7 | 13778078 | 12.4% |
1 | 13778078 | 12.4% |
9 | 248887 | 0.2% |
0 | 199861 | 0.2% |
e | 148920 | 0.1% |
c | 99894 | 0.1% |
Other values (4) | 147297 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 83364334 | |
Lowercase Letter | 28052234 | 25.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 41483154 | |
4 | 13827104 | 16.6% |
7 | 13778078 | 16.5% |
1 | 13778078 | 16.5% |
9 | 248887 | 0.3% |
0 | 199861 | 0.2% |
8 | 49099 | 0.1% |
6 | 73 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 13878045 | |
b | 13827250 | |
e | 148920 | 0.5% |
c | 99894 | 0.4% |
f | 49099 | 0.2% |
d | 49026 | 0.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 83364334 | |
Latin | 28052234 | 25.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 41483154 | |
4 | 13827104 | 16.6% |
7 | 13778078 | 16.5% |
1 | 13778078 | 16.5% |
9 | 248887 | 0.3% |
0 | 199861 | 0.2% |
8 | 49099 | 0.1% |
6 | 73 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 13878045 | |
b | 13827250 | |
e | 148920 | 0.5% |
c | 99894 | 0.4% |
f | 49099 | 0.2% |
d | 49026 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 111416568 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 41483154 | |
a | 13878045 | 12.5% |
b | 13827250 | 12.4% |
4 | 13827104 | 12.4% |
7 | 13778078 | 12.4% |
1 | 13778078 | 12.4% |
9 | 248887 | 0.2% |
0 | 199861 | 0.2% |
e | 148920 | 0.1% |
c | 99894 | 0.1% |
Other values (4) | 147297 | 0.1% |
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 106.3 MiB |
Value | Count | Frequency (%) |
a55475b1 | 13373863 | |
9a0c095e | 299054 | 2.1% |
8fd95e4b | 253478 | 1.8% |
06fb9ba8 | 568 | < 0.1% |
3cbe86ba | 105 | < 0.1% |
c7a5ad39 | 2 | < 0.1% |
9276e4bb | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 40674123 | |
a | 13673594 | 12.3% |
b | 13628689 | 12.2% |
4 | 13627342 | 12.2% |
7 | 13373866 | 12.0% |
1 | 13373863 | 12.0% |
9 | 852157 | 0.8% |
0 | 598676 | 0.5% |
e | 552638 | 0.5% |
c | 299161 | 0.3% |
Other values (6) | 762459 | 0.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 82754960 | |
Lowercase Letter | 28661608 | 25.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 40674123 | |
4 | 13627342 | 16.5% |
7 | 13373866 | 16.2% |
1 | 13373863 | 16.2% |
9 | 852157 | 1.0% |
0 | 598676 | 0.7% |
8 | 254151 | 0.3% |
6 | 674 | < 0.1% |
3 | 107 | < 0.1% |
2 | 1 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 13673594 | |
b | 13628689 | |
e | 552638 | 1.9% |
c | 299161 | 1.0% |
f | 254046 | 0.9% |
d | 253480 | 0.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 82754960 | |
Latin | 28661608 | 25.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 40674123 | |
4 | 13627342 | 16.5% |
7 | 13373866 | 16.2% |
1 | 13373863 | 16.2% |
9 | 852157 | 1.0% |
0 | 598676 | 0.7% |
8 | 254151 | 0.3% |
6 | 674 | < 0.1% |
3 | 107 | < 0.1% |
2 | 1 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 13673594 | |
b | 13628689 | |
e | 552638 | 1.9% |
c | 299161 | 1.0% |
f | 254046 | 0.9% |
d | 253480 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 111416568 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 40674123 | |
a | 13673594 | 12.3% |
b | 13628689 | 12.2% |
4 | 13627342 | 12.2% |
7 | 13373866 | 12.0% |
1 | 13373863 | 12.0% |
9 | 852157 | 0.8% |
0 | 598676 | 0.5% |
e | 552638 | 0.5% |
c | 299161 | 0.3% |
Other values (6) | 762459 | 0.7% |
collater_valueofguarantee_1124L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 6658 |
---|---|
Distinct (%) | 4.5% |
Missing | 13778078 |
Missing (%) | 98.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1917304.433 |
Minimum | 0 |
---|---|
Maximum | 3200000000 |
Zeros | 139355 |
Zeros (%) | 1.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2774896.824 |
Maximum | 3200000000 |
Range | 3200000000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 36878205.82 |
---|---|
Coefficient of variation (CV) | 19.23440283 |
Kurtosis | 3065.259628 |
Mean | 1917304.433 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 47.0809369 |
Sum | 2.856649394 × 1011 |
Variance | 1.360002064 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 139355 | 1.0% |
5000000 | 73 | < 0.1% |
200000000 | 68 | < 0.1% |
400000000 | 60 | < 0.1% |
1200000000 | 48 | < 0.1% |
3000000 | 42 | < 0.1% |
4000000 | 40 | < 0.1% |
10000000 | 39 | < 0.1% |
7000000 | 36 | < 0.1% |
6000000 | 35 | < 0.1% |
Other values (6648) | 9197 | 0.1% |
(Missing) | 13778078 |
Value | Count | Frequency (%) |
0 | 139355 | |
1 | 21 | < 0.1% |
383 | 1 | < 0.1% |
1484 | 1 | < 0.1% |
1866 | 1 | < 0.1% |
Value | Count | Frequency (%) |
3200000000 | 3 | < 0.1% |
3000000000 | 5 | < 0.1% |
1200000000 | 48 | |
1139827000 | 6 | < 0.1% |
983057000 | 6 | < 0.1% |
collater_valueofguarantee_876L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 17093 |
---|---|
Distinct (%) | 3.1% |
Missing | 13373863 |
Missing (%) | 96.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1960659.196 |
Minimum | 0 |
---|---|
Maximum | 6804986362 |
Zeros | 504614 |
Zeros (%) | 3.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 118744.55 |
Maximum | 6804986362 |
Range | 6804986362 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 69301291.8 |
---|---|
Coefficient of variation (CV) | 35.34591425 |
Kurtosis | 5916.584668 |
Mean | 1960659.196 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 68.99623008 |
Sum | 1.084652352 × 1012 |
Variance | 4.802669045 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 504614 | 3.6% |
60000 | 1427 | < 0.1% |
130000 | 1061 | < 0.1% |
100000 | 1053 | < 0.1% |
50000 | 762 | < 0.1% |
65000 | 640 | < 0.1% |
300000 | 511 | < 0.1% |
70000 | 507 | < 0.1% |
80000 | 500 | < 0.1% |
150000 | 494 | < 0.1% |
Other values (17083) | 41639 | 0.3% |
(Missing) | 13373863 |
Value | Count | Frequency (%) |
0 | 504614 | |
0.14 | 1 | < 0.1% |
0.99 | 1 | < 0.1% |
1 | 159 | < 0.1% |
1.2 | 1 | < 0.1% |
Value | Count | Frequency (%) |
6804986362 | 32 | < 0.1% |
3250000000 | 39 | < 0.1% |
3200000000 | 8 | < 0.1% |
2000000000 | 101 | |
1200000000 | 46 |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 106.3 MiB |
Value | Count | Frequency (%) |
a55475b1 | 13373863 | |
c7a5ad39 | 437459 | 3.1% |
3cbe86ba | 80058 | 0.6% |
9276e4bb | 10883 | 0.1% |
0e63c0f0 | 8992 | 0.1% |
168ad9f3 | 3545 | < 0.1% |
5224034a | 2668 | < 0.1% |
940efad7 | 2520 | < 0.1% |
7b62420e | 2338 | < 0.1% |
2fd21cf1 | 1539 | < 0.1% |
Other values (5) | 3206 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 40563772 | |
a | 14340330 | 12.9% |
7 | 13829035 | 12.4% |
b | 13559262 | 12.2% |
4 | 13398592 | 12.0% |
1 | 13381291 | 12.0% |
3 | 533973 | 0.5% |
c | 530104 | 0.5% |
9 | 456909 | 0.4% |
d | 445839 | 0.4% |
Other values (6) | 377461 | 0.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 82416509 | |
Lowercase Letter | 29000059 | 26.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 40563772 | |
7 | 13829035 | 16.8% |
4 | 13398592 | 16.3% |
1 | 13381291 | 16.2% |
3 | 533973 | 0.6% |
9 | 456909 | 0.6% |
6 | 106995 | 0.1% |
8 | 85184 | 0.1% |
0 | 36009 | < 0.1% |
2 | 24749 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 14340330 | |
b | 13559262 | |
c | 530104 | 1.8% |
d | 445839 | 1.5% |
e | 105596 | 0.4% |
f | 18928 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 82416509 | |
Latin | 29000059 | 26.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 40563772 | |
7 | 13829035 | 16.8% |
4 | 13398592 | 16.3% |
1 | 13381291 | 16.2% |
3 | 533973 | 0.6% |
9 | 456909 | 0.6% |
6 | 106995 | 0.1% |
8 | 85184 | 0.1% |
0 | 36009 | < 0.1% |
2 | 24749 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 14340330 | |
b | 13559262 | |
c | 530104 | 1.8% |
d | 445839 | 1.5% |
e | 105596 | 0.4% |
f | 18928 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 111416568 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 40563772 | |
a | 14340330 | 12.9% |
7 | 13829035 | 12.4% |
b | 13559262 | 12.2% |
4 | 13398592 | 12.0% |
1 | 13381291 | 12.0% |
3 | 533973 | 0.5% |
c | 530104 | 0.5% |
9 | 456909 | 0.4% |
d | 445839 | 0.4% |
Other values (6) | 377461 | 0.3% |
Distinct | 14 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 106.3 MiB |
Value | Count | Frequency (%) |
a55475b1 | 13778078 | |
c7a5ad39 | 136255 | 1.0% |
9276e4bb | 5003 | < 0.1% |
0e63c0f0 | 3727 | < 0.1% |
7b62420e | 1393 | < 0.1% |
168ad9f3 | 1379 | < 0.1% |
940efad7 | 360 | < 0.1% |
f4d8a027 | 324 | < 0.1% |
2fd21cf1 | 193 | < 0.1% |
3cbe86ba | 166 | < 0.1% |
Other values (4) | 193 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 41470681 | |
a | 14052918 | 12.6% |
7 | 13921507 | 12.5% |
b | 13789902 | 12.4% |
4 | 13785359 | 12.4% |
1 | 13779935 | 12.4% |
9 | 143067 | 0.1% |
3 | 141627 | 0.1% |
c | 140468 | 0.1% |
d | 138511 | 0.1% |
Other values (6) | 52593 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 83277851 | |
Lowercase Letter | 28138717 | 25.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 41470681 | |
7 | 13921507 | 16.7% |
4 | 13785359 | 16.6% |
1 | 13779935 | 16.5% |
9 | 143067 | 0.2% |
3 | 141627 | 0.2% |
0 | 13324 | < 0.1% |
6 | 11761 | < 0.1% |
2 | 8629 | < 0.1% |
8 | 1961 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 14052918 | |
b | 13789902 | |
c | 140468 | 0.5% |
d | 138511 | 0.5% |
e | 10741 | < 0.1% |
f | 6177 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 83277851 | |
Latin | 28138717 | 25.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 41470681 | |
7 | 13921507 | 16.7% |
4 | 13785359 | 16.6% |
1 | 13779935 | 16.5% |
9 | 143067 | 0.2% |
3 | 141627 | 0.2% |
0 | 13324 | < 0.1% |
6 | 11761 | < 0.1% |
2 | 8629 | < 0.1% |
8 | 1961 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 14052918 | |
b | 13789902 | |
c | 140468 | 0.5% |
d | 138511 | 0.5% |
e | 10741 | < 0.1% |
f | 6177 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 111416568 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 41470681 | |
a | 14052918 | 12.6% |
7 | 13921507 | 12.5% |
b | 13789902 | 12.4% |
4 | 13785359 | 12.4% |
1 | 13779935 | 12.4% |
9 | 143067 | 0.1% |
3 | 141627 | 0.1% |
c | 140468 | 0.1% |
d | 138511 | 0.1% |
Other values (6) | 52593 | < 0.1% |
num_group1
Real number (ℝ)
ZEROS
 
Distinct | 198 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.265338204 |
Minimum | 0 |
---|---|
Maximum | 197 |
Zeros | 2482094 |
Zeros (%) | 17.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 3 |
Q3 | 7 |
95-th percentile | 17 |
Maximum | 197 |
Range | 197 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 6.875070202 |
---|---|
Coefficient of variation (CV) | 1.305722431 |
Kurtosis | 57.95678216 |
Mean | 5.265338204 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 4.909283745 |
Sum | 73330739 |
Variance | 47.26659028 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2482094 | |
1 | 1935612 | |
2 | 1542782 | |
3 | 1270423 | |
4 | 1064545 | |
5 | 898181 | 6.4% |
6 | 761863 | 5.5% |
7 | 641975 | 4.6% |
8 | 539828 | 3.9% |
9 | 453673 | 3.3% |
Other values (188) | 2336095 |
Value | Count | Frequency (%) |
0 | 2482094 | |
1 | 1935612 | |
2 | 1542782 | |
3 | 1270423 | |
4 | 1064545 |
Value | Count | Frequency (%) |
197 | 12 | |
196 | 12 | |
195 | 12 | |
194 | 12 | |
193 | 12 |
num_group2
Real number (ℝ)
ZEROS
 
Distinct | 40 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.55590016 |
Minimum | 0 |
---|---|
Maximum | 39 |
Zeros | 560081 |
Zeros (%) | 4.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 12 |
Q3 | 20 |
95-th percentile | 32 |
Maximum | 39 |
Range | 39 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 9.381094055 |
---|---|
Coefficient of variation (CV) | 0.6920303295 |
Kurtosis | -0.7071473224 |
Mean | 13.55590016 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.4768722509 |
Sum | 188793984 |
Variance | 88.00492566 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 560081 | 4.0% |
1 | 560031 | 4.0% |
2 | 560030 | 4.0% |
7 | 560030 | 4.0% |
3 | 560030 | 4.0% |
9 | 560030 | 4.0% |
8 | 560030 | 4.0% |
11 | 560030 | 4.0% |
6 | 560030 | 4.0% |
5 | 560030 | 4.0% |
Other values (30) | 8326719 |
Value | Count | Frequency (%) |
0 | 560081 | |
1 | 560031 | |
2 | 560030 | |
3 | 560030 | |
4 | 560030 |
Value | Count | Frequency (%) |
39 | 1 | < 0.1% |
38 | 1 | < 0.1% |
37 | 1 | < 0.1% |
36 | 1 | < 0.1% |
35 | 178580 |
pmts_dpd_1073P
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 2146 |
---|---|
Distinct (%) | 0.1% |
Missing | 11883298 |
Missing (%) | 85.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.107236469 |
Minimum | 0 |
---|---|
Maximum | 4520 |
Zeros | 1958853 |
Zeros (%) | 14.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 4520 |
Range | 4520 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 70.10703701 |
---|---|
Coefficient of variation (CV) | 22.56250456 |
Kurtosis | 1614.321289 |
Mean | 3.107236469 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 36.69774178 |
Sum | 6350486 |
Variance | 4914.996638 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1958853 | 14.1% |
1 | 17075 | 0.1% |
2 | 6974 | 0.1% |
3 | 6007 | < 0.1% |
4 | 5304 | < 0.1% |
7 | 3365 | < 0.1% |
5 | 3236 | < 0.1% |
6 | 2993 | < 0.1% |
8 | 2758 | < 0.1% |
10 | 2533 | < 0.1% |
Other values (2136) | 34675 | 0.2% |
(Missing) | 11883298 |
Value | Count | Frequency (%) |
0 | 1958853 | |
1 | 17075 | 0.1% |
2 | 6974 | 0.1% |
3 | 6007 | < 0.1% |
4 | 5304 | < 0.1% |
Value | Count | Frequency (%) |
4520 | 1 | |
4497 | 1 | |
4460 | 1 | |
4446 | 1 | |
4415 | 1 |
pmts_dpd_303P
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 4145 |
---|---|
Distinct (%) | 0.1% |
Missing | 7816731 |
Missing (%) | 56.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 58.6952402 |
Minimum | -9 |
---|---|
Maximum | 84575 |
Zeros | 5124725 |
Zeros (%) | 36.8% |
Negative | 779 |
Negative (%) | < 0.1% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | -9 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 378 |
Maximum | 84575 |
Range | 84584 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 279.0666744 |
---|---|
Coefficient of variation (CV) | 4.754502639 |
Kurtosis | 2883.58435 |
Mean | 58.6952402 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 16.20937066 |
Sum | 358647874 |
Variance | 77878.20878 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5124725 | |
1 | 137651 | 1.0% |
3 | 35474 | 0.3% |
2 | 33307 | 0.2% |
4 | 29262 | 0.2% |
6 | 24181 | 0.2% |
7 | 19331 | 0.1% |
5 | 19320 | 0.1% |
9 | 14088 | 0.1% |
8 | 13653 | 0.1% |
Other values (4135) | 659348 | 4.7% |
(Missing) | 7816731 |
Value | Count | Frequency (%) |
-9 | 6 | |
-8 | 7 | |
-7 | 2 | < 0.1% |
-6 | 12 | |
-5 | 3 | < 0.1% |
Value | Count | Frequency (%) |
84575 | 1 | < 0.1% |
84573 | 1 | < 0.1% |
29250 | 4 | |
5184 | 1 | < 0.1% |
5164 | 1 | < 0.1% |
pmts_month_158T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 10075311 |
Missing (%) | 72.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052978 |
---|---|
Coefficient of variation (CV) | 0.5310850735 |
Kurtosis | -1.216783239 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 25036440 |
Variance | 11.91666976 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 320980 | 2.3% |
3 | 320980 | 2.3% |
4 | 320980 | 2.3% |
5 | 320980 | 2.3% |
6 | 320980 | 2.3% |
7 | 320980 | 2.3% |
8 | 320980 | 2.3% |
9 | 320980 | 2.3% |
10 | 320980 | 2.3% |
11 | 320980 | 2.3% |
Other values (2) | 641960 | 4.6% |
(Missing) | 10075311 |
Value | Count | Frequency (%) |
1 | 320980 | |
2 | 320980 | |
3 | 320980 | |
4 | 320980 | |
5 | 320980 |
Value | Count | Frequency (%) |
12 | 320980 | |
11 | 320980 | |
10 | 320980 | |
9 | 320980 | |
8 | 320980 |
pmts_month_706T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 1040751 |
Missing (%) | 7.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052663 |
---|---|
Coefficient of variation (CV) | 0.5310850252 |
Kurtosis | -1.216783223 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 83761080 |
Variance | 11.91666759 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 1073860 | |
3 | 1073860 | |
4 | 1073860 | |
5 | 1073860 | |
6 | 1073860 | |
7 | 1073860 | |
8 | 1073860 | |
9 | 1073860 | |
10 | 1073860 | |
11 | 1073860 | |
Other values (2) | 2147720 |
Value | Count | Frequency (%) |
1 | 1073860 | |
2 | 1073860 | |
3 | 1073860 | |
4 | 1073860 | |
5 | 1073860 |
Value | Count | Frequency (%) |
12 | 1073860 | |
11 | 1073860 | |
10 | 1073860 | |
9 | 1073860 | |
8 | 1073860 |
pmts_overdue_1140A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 69983 |
---|---|
Distinct (%) | 3.4% |
Missing | 11871027 |
Missing (%) | 85.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 599.1327471 |
Minimum | 0 |
---|---|
Maximum | 5881917.5 |
Zeros | 1969331 |
Zeros (%) | 14.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 5881917.5 |
Range | 5881917.5 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 30430.77891 |
---|---|
Coefficient of variation (CV) | 50.79137981 |
Kurtosis | 22225.41324 |
Mean | 599.1327471 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 137.8117801 |
Sum | 1231843290 |
Variance | 926032305.1 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1969331 | 14.1% |
10 | 207 | < 0.1% |
14 | 111 | < 0.1% |
400 | 96 | < 0.1% |
99.8 | 94 | < 0.1% |
0.2 | 89 | < 0.1% |
4 | 80 | < 0.1% |
0.4 | 80 | < 0.1% |
1.6 | 71 | < 0.1% |
1.2 | 66 | < 0.1% |
Other values (69973) | 85819 | 0.6% |
(Missing) | 11871027 |
Value | Count | Frequency (%) |
0 | 1969331 | |
0.002 | 10 | < 0.1% |
0.004 | 6 | < 0.1% |
0.006 | 4 | < 0.1% |
0.008 | 9 | < 0.1% |
Value | Count | Frequency (%) |
5881917.5 | 17 | |
5881667.5 | 1 | < 0.1% |
5881552.5 | 1 | < 0.1% |
5881277.5 | 3 | < 0.1% |
5878422 | 2 | < 0.1% |
pmts_overdue_1152A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 448986 |
---|---|
Distinct (%) | 7.3% |
Missing | 7809419 |
Missing (%) | 56.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3954.427561 |
Minimum | 0 |
---|---|
Maximum | 51793576 |
Zeros | 5090863 |
Zeros (%) | 36.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 16063.50675 |
Maximum | 51793576 |
Range | 51793576 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 87539.96896 |
---|---|
Coefficient of variation (CV) | 22.13720383 |
Kurtosis | 142902.378 |
Mean | 3954.427561 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 314.5818664 |
Sum | 2.419181168 × 1010 |
Variance | 7663246166 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5090863 | |
0.2 | 2936 | < 0.1% |
1000 | 1581 | < 0.1% |
0.4 | 1453 | < 0.1% |
0.8 | 1110 | < 0.1% |
2000 | 1086 | < 0.1% |
2 | 1025 | < 0.1% |
1.6 | 1005 | < 0.1% |
0.6 | 930 | < 0.1% |
1 | 927 | < 0.1% |
Other values (448976) | 1014736 | 7.3% |
(Missing) | 7809419 |
Value | Count | Frequency (%) |
0 | 5090863 | |
0.002 | 106 | < 0.1% |
0.004 | 36 | < 0.1% |
0.006 | 56 | < 0.1% |
0.008 | 40 | < 0.1% |
Value | Count | Frequency (%) |
51793576 | 1 | |
51169692 | 1 | |
49788236 | 1 | |
48674156 | 1 | |
47470956 | 1 |
pmts_year_1139T
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 10075311 |
Missing (%) | 72.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2019.364831 |
Minimum | 2016 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 2016 |
---|---|
5-th percentile | 2018 |
Q1 | 2019 |
median | 2019 |
Q3 | 2020 |
95-th percentile | 2020 |
Maximum | 2021 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.7904653126 |
---|---|
Coefficient of variation (CV) | 0.000391442547 |
Kurtosis | -0.6875733162 |
Mean | 2019.364831 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.2852906722 |
Sum | 7778108680 |
Variance | 0.6248354104 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2020 | 1724349 | 12.4% |
2019 | 1370885 | 9.8% |
2018 | 610446 | 4.4% |
2021 | 145888 | 1.0% |
2017 | 137 | < 0.1% |
2016 | 55 | < 0.1% |
(Missing) | 10075311 |
Value | Count | Frequency (%) |
2016 | 55 | < 0.1% |
2017 | 137 | < 0.1% |
2018 | 610446 | 4.4% |
2019 | 1370885 | |
2020 | 1724349 |
Value | Count | Frequency (%) |
2021 | 145888 | 1.0% |
2020 | 1724349 | |
2019 | 1370885 | |
2018 | 610446 | 4.4% |
2017 | 137 | < 0.1% |
pmts_year_507T
Real number (ℝ)
MISSING
 
Distinct | 21 |
---|---|
Distinct (%) | < 0.1% |
Missing | 1040751 |
Missing (%) | 7.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2014.879039 |
Minimum | 2001 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 106.3 MiB |
Quantile statistics
Minimum | 2001 |
---|---|
5-th percentile | 2007 |
Q1 | 2012 |
median | 2016 |
Q3 | 2018 |
95-th percentile | 2020 |
Maximum | 2021 |
Range | 20 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 4.02004628 |
---|---|
Coefficient of variation (CV) | 0.001995179959 |
Kurtosis | -0.4722344126 |
Mean | 2014.879039 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.7492984691 |
Sum | 2.596437606 × 1010 |
Variance | 16.16077209 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2018 | 1882027 | |
2019 | 1801228 | |
2017 | 1445462 | |
2016 | 1061050 | |
2015 | 945567 | |
2014 | 869514 | 6.2% |
2013 | 751105 | 5.4% |
2020 | 707133 | 5.1% |
2012 | 614324 | 4.4% |
2011 | 531763 | 3.8% |
Other values (11) | 2277147 | |
(Missing) | 1040751 |
Value | Count | Frequency (%) |
2001 | 44 | < 0.1% |
2002 | 257 | < 0.1% |
2003 | 760 | < 0.1% |
2004 | 25543 | 0.2% |
2005 | 126528 |
Value | Count | Frequency (%) |
2021 | 50720 | 0.4% |
2020 | 707133 | 5.1% |
2019 | 1801228 | |
2018 | 1882027 | |
2017 | 1445462 |
Distinct | 10 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 106.3 MiB |
Value | Count | Frequency (%) |
a55475b1 | 13377486 | |
ab3c25cf | 541038 | 3.9% |
15f04f45 | 4328 | < 0.1% |
be4fd70b | 2821 | < 0.1% |
daf49a8a | 1355 | < 0.1% |
0c42a10e | 20 | < 0.1% |
71ddaa88 | 14 | < 0.1% |
1d94eac1 | 6 | < 0.1% |
652d52e3 | 2 | < 0.1% |
9ba4314a | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 40682156 | |
b | 13924167 | 12.5% |
a | 13922645 | 12.5% |
4 | 13390346 | 12.0% |
1 | 13381861 | 12.0% |
7 | 13380321 | 12.0% |
c | 1082102 | 1.0% |
f | 553870 | 0.5% |
2 | 541062 | 0.5% |
3 | 541041 | 0.5% |
Other values (6) | 16997 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 81926723 | |
Lowercase Letter | 29489845 | 26.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 40682156 | |
4 | 13390346 | 16.3% |
1 | 13381861 | 16.3% |
7 | 13380321 | 16.3% |
2 | 541062 | 0.7% |
3 | 541041 | 0.7% |
0 | 7189 | < 0.1% |
8 | 1383 | < 0.1% |
9 | 1362 | < 0.1% |
6 | 2 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 13924167 | |
a | 13922645 | |
c | 1082102 | 3.7% |
f | 553870 | 1.9% |
d | 4212 | < 0.1% |
e | 2849 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 81926723 | |
Latin | 29489845 | 26.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 40682156 | |
4 | 13390346 | 16.3% |
1 | 13381861 | 16.3% |
7 | 13380321 | 16.3% |
2 | 541062 | 0.7% |
3 | 541041 | 0.7% |
0 | 7189 | < 0.1% |
8 | 1383 | < 0.1% |
9 | 1362 | < 0.1% |
6 | 2 | < 0.1% |
Latin
Value | Count | Frequency (%) |
b | 13924167 | |
a | 13922645 | |
c | 1082102 | 3.7% |
f | 553870 | 1.9% |
d | 4212 | < 0.1% |
e | 2849 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 111416568 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 40682156 | |
b | 13924167 | 12.5% |
a | 13922645 | 12.5% |
4 | 13390346 | 12.0% |
1 | 13381861 | 12.0% |
7 | 13380321 | 12.0% |
c | 1082102 | 1.0% |
f | 553870 | 0.5% |
2 | 541062 | 0.5% |
3 | 541041 | 0.5% |
Other values (6) | 16997 | < 0.1% |
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 106.3 MiB |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 8.00001594 |
Min length | 8 |
Characters and Unicode
Total characters | 111416790 |
---|---|
Distinct characters | 17 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | ab3c25cf |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 13780319 | |
ab3c25cf | 143426 | 1.0% |
be4fd70b | 1339 | < 0.1% |
15f04f45 | 894 | < 0.1% |
daf49a8a | 870 | < 0.1% |
p28_48_88 | 222 | < 0.1% |
71ddaa88 | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 41486171 | |
b | 13926423 | 12.5% |
a | 13926357 | 12.5% |
4 | 13784538 | 12.4% |
7 | 13781659 | 12.4% |
1 | 13781214 | 12.4% |
c | 286852 | 0.3% |
f | 147423 | 0.1% |
2 | 143648 | 0.1% |
3 | 143426 | 0.1% |
Other values (7) | 9079 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 83125519 | |
Lowercase Letter | 28290605 | 25.4% |
Connector Punctuation | 444 | < 0.1% |
Uppercase Letter | 222 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 41486171 | |
4 | 13784538 | 16.6% |
7 | 13781659 | 16.6% |
1 | 13781214 | 16.6% |
2 | 143648 | 0.2% |
3 | 143426 | 0.2% |
0 | 2233 | < 0.1% |
8 | 1760 | < 0.1% |
9 | 870 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 13926423 | |
a | 13926357 | |
c | 286852 | 1.0% |
f | 147423 | 0.5% |
d | 2211 | < 0.1% |
e | 1339 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 444 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 222 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 83125963 | |
Latin | 28290827 | 25.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 41486171 | |
4 | 13784538 | 16.6% |
7 | 13781659 | 16.6% |
1 | 13781214 | 16.6% |
2 | 143648 | 0.2% |
3 | 143426 | 0.2% |
0 | 2233 | < 0.1% |
8 | 1760 | < 0.1% |
9 | 870 | < 0.1% |
_ | 444 | < 0.1% |
Latin
Value | Count | Frequency (%) |
b | 13926423 | |
a | 13926357 | |
c | 286852 | 1.0% |
f | 147423 | 0.5% |
d | 2211 | < 0.1% |
e | 1339 | < 0.1% |
P | 222 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 111416790 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 41486171 | |
b | 13926423 | 12.5% |
a | 13926357 | 12.5% |
4 | 13784538 | 12.4% |
7 | 13781659 | 12.4% |
1 | 13781214 | 12.4% |
c | 286852 | 0.3% |
f | 147423 | 0.1% |
2 | 143648 | 0.1% |
3 | 143426 | 0.1% |
Other values (7) | 9079 | < 0.1% |