Dataset statistics
Number of variables | 19 |
---|---|
Number of observations | 17893536 |
Missing cells | 114892314 |
Missing cells (%) | 33.8% |
Total size in memory | 2.5 GiB |
Average record size in memory | 152.0 B |
Variable types
Numeric | 13 |
---|---|
Text | 6 |
collater_valueofguarantee_1124L has 17591919 (98.3%) missing values | Missing |
collater_valueofguarantee_876L has 17306035 (96.7%) missing values | Missing |
pmts_dpd_1073P has 14018715 (78.3%) missing values | Missing |
pmts_dpd_303P has 11481091 (64.2%) missing values | Missing |
pmts_month_158T has 10312896 (57.6%) missing values | Missing |
pmts_month_706T has 4192740 (23.4%) missing values | Missing |
pmts_overdue_1140A has 14008893 (78.3%) missing values | Missing |
pmts_overdue_1152A has 11474389 (64.1%) missing values | Missing |
pmts_year_1139T has 10312896 (57.6%) missing values | Missing |
pmts_year_507T has 4192740 (23.4%) missing values | Missing |
collater_valueofguarantee_1124L is highly skewed (γ1 = 93.02870407) | Skewed |
collater_valueofguarantee_876L is highly skewed (γ1 = 27.59180471) | Skewed |
pmts_dpd_303P is highly skewed (γ1 = 50.67888568) | Skewed |
pmts_overdue_1140A is highly skewed (γ1 = 206.234823) | Skewed |
pmts_overdue_1152A is highly skewed (γ1 = 224.8312526) | Skewed |
collater_valueofguarantee_1124L has 280131 (1.6%) zeros | Zeros |
collater_valueofguarantee_876L has 516710 (2.9%) zeros | Zeros |
num_group1 has 4888627 (27.3%) zeros | Zeros |
num_group2 has 705366 (3.9%) zeros | Zeros |
pmts_dpd_1073P has 3646455 (20.4%) zeros | Zeros |
pmts_dpd_303P has 5378239 (30.1%) zeros | Zeros |
pmts_overdue_1140A has 3652565 (20.4%) zeros | Zeros |
pmts_overdue_1152A has 5336097 (29.8%) zeros | Zeros |
Reproduction
Analysis started | 2024-02-13 19:44:47.246727 |
---|---|
Analysis finished | 2024-02-13 19:45:28.041616 |
Duration | 40.79 seconds |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
Distinct | 156749 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1323076.851 |
Minimum | 13927 |
---|---|
Maximum | 2593511 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 13927 |
---|---|
5-th percentile | 133731 |
Q1 | 727795 |
median | 1399594 |
Q3 | 1427579 |
95-th percentile | 2588352 |
Maximum | 2593511 |
Range | 2579584 |
Interquartile range (IQR) | 699784 |
Descriptive statistics
Standard deviation | 702400.9226 |
---|---|
Coefficient of variation (CV) | 0.5308844473 |
Kurtosis | -0.1877919764 |
Mean | 1323076.851 |
Median Absolute Deviation (MAD) | 33577 |
Skewness | 0.1888454156 |
Sum | 2.367452326 × 1013 |
Variance | 4.933670561 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1383556 | 4209 | < 0.1% |
1396592 | 3336 | < 0.1% |
140807 | 1752 | < 0.1% |
1404470 | 1704 | < 0.1% |
1425225 | 1680 | < 0.1% |
1390546 | 1524 | < 0.1% |
1424070 | 1488 | < 0.1% |
1395337 | 1440 | < 0.1% |
141012 | 1404 | < 0.1% |
1420355 | 1356 | < 0.1% |
Other values (156739) | 17873643 |
Value | Count | Frequency (%) |
13927 | 36 | < 0.1% |
13994 | 24 | < 0.1% |
14050 | 96 | |
14051 | 36 | < 0.1% |
14053 | 36 | < 0.1% |
Value | Count | Frequency (%) |
2593511 | 132 | |
2593508 | 108 | |
2593507 | 24 | < 0.1% |
2593505 | 240 | |
2593504 | 120 |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 136.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 17591919 | |
9a0c095e | 212129 | 1.2% |
8fd95e4b | 89356 | 0.5% |
06fb9ba8 | 132 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 53077242 | |
a | 17804180 | 12.4% |
b | 17681539 | 12.4% |
4 | 17681275 | 12.4% |
7 | 17591919 | 12.3% |
1 | 17591919 | 12.3% |
9 | 513746 | 0.4% |
0 | 424390 | 0.3% |
e | 301485 | 0.2% |
c | 212129 | 0.1% |
Other values (4) | 268464 | 0.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 106970111 | |
Lowercase Letter | 36178177 | 25.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 53077242 | |
4 | 17681275 | 16.5% |
7 | 17591919 | 16.4% |
1 | 17591919 | 16.4% |
9 | 513746 | 0.5% |
0 | 424390 | 0.4% |
8 | 89488 | 0.1% |
6 | 132 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 17804180 | |
b | 17681539 | |
e | 301485 | 0.8% |
c | 212129 | 0.6% |
f | 89488 | 0.2% |
d | 89356 | 0.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 106970111 | |
Latin | 36178177 | 25.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 53077242 | |
4 | 17681275 | 16.5% |
7 | 17591919 | 16.4% |
1 | 17591919 | 16.4% |
9 | 513746 | 0.5% |
0 | 424390 | 0.4% |
8 | 89488 | 0.1% |
6 | 132 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 17804180 | |
b | 17681539 | |
e | 301485 | 0.8% |
c | 212129 | 0.6% |
f | 89488 | 0.2% |
d | 89356 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 143148288 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 53077242 | |
a | 17804180 | 12.4% |
b | 17681539 | 12.4% |
4 | 17681275 | 12.4% |
7 | 17591919 | 12.3% |
1 | 17591919 | 12.3% |
9 | 513746 | 0.4% |
0 | 424390 | 0.3% |
e | 301485 | 0.2% |
c | 212129 | 0.1% |
Other values (4) | 268464 | 0.2% |
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 136.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 17306035 | |
9a0c095e | 331844 | 1.9% |
8fd95e4b | 254546 | 1.4% |
06fb9ba8 | 963 | < 0.1% |
3cbe86ba | 145 | < 0.1% |
c7a5ad39 | 2 | < 0.1% |
f4d8a027 | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 52504497 | |
a | 17638992 | 12.3% |
b | 17562797 | 12.3% |
4 | 17560582 | 12.3% |
7 | 17306038 | 12.1% |
1 | 17306035 | 12.1% |
9 | 919199 | 0.6% |
0 | 664652 | 0.5% |
e | 586535 | 0.4% |
c | 331991 | 0.2% |
Other values (6) | 766970 | 0.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 106517914 | |
Lowercase Letter | 36630374 | 25.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 52504497 | |
4 | 17560582 | 16.5% |
7 | 17306038 | 16.2% |
1 | 17306035 | 16.2% |
9 | 919199 | 0.9% |
0 | 664652 | 0.6% |
8 | 255655 | 0.2% |
6 | 1108 | < 0.1% |
3 | 147 | < 0.1% |
2 | 1 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 17638992 | |
b | 17562797 | |
e | 586535 | 1.6% |
c | 331991 | 0.9% |
f | 255510 | 0.7% |
d | 254549 | 0.7% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 106517914 | |
Latin | 36630374 | 25.6% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 52504497 | |
4 | 17560582 | 16.5% |
7 | 17306038 | 16.2% |
1 | 17306035 | 16.2% |
9 | 919199 | 0.9% |
0 | 664652 | 0.6% |
8 | 255655 | 0.2% |
6 | 1108 | < 0.1% |
3 | 147 | < 0.1% |
2 | 1 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 17638992 | |
b | 17562797 | |
e | 586535 | 1.6% |
c | 331991 | 0.9% |
f | 255510 | 0.7% |
d | 254549 | 0.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 143148288 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 52504497 | |
a | 17638992 | 12.3% |
b | 17562797 | 12.3% |
4 | 17560582 | 12.3% |
7 | 17306038 | 12.1% |
1 | 17306035 | 12.1% |
9 | 919199 | 0.6% |
0 | 664652 | 0.5% |
e | 586535 | 0.4% |
c | 331991 | 0.2% |
Other values (6) | 766970 | 0.5% |
collater_valueofguarantee_1124L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 14224 |
---|---|
Distinct (%) | 4.7% |
Missing | 17591919 |
Missing (%) | 98.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1129784.99 |
Minimum | 0 |
---|---|
Maximum | 3200000000 |
Zeros | 280131 |
Zeros (%) | 1.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3500000 |
Maximum | 3200000000 |
Range | 3200000000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 24676664.46 |
---|---|
Coefficient of variation (CV) | 21.84191212 |
Kurtosis | 10961.52147 |
Mean | 1129784.99 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 93.02870407 |
Sum | 3.407623592 × 1011 |
Variance | 6.089377687 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 280131 | 1.6% |
3000000 | 151 | < 0.1% |
5000000 | 136 | < 0.1% |
4000000 | 124 | < 0.1% |
2000000 | 121 | < 0.1% |
1 | 115 | < 0.1% |
10000000 | 108 | < 0.1% |
2500000 | 85 | < 0.1% |
3500000 | 79 | < 0.1% |
6000000 | 73 | < 0.1% |
Other values (14214) | 20494 | 0.1% |
(Missing) | 17591919 |
Value | Count | Frequency (%) |
0 | 280131 | |
0.02 | 1 | < 0.1% |
1 | 115 | < 0.1% |
5 | 1 | < 0.1% |
500 | 1 | < 0.1% |
Value | Count | Frequency (%) |
3200000000 | 11 | |
1912618083 | 3 | < 0.1% |
1325758000 | 1 | < 0.1% |
1267140000 | 1 | < 0.1% |
1000000000 | 1 | < 0.1% |
collater_valueofguarantee_876L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 24044 |
---|---|
Distinct (%) | 4.1% |
Missing | 17306035 |
Missing (%) | 96.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3840788.337 |
Minimum | 0 |
---|---|
Maximum | 4905062000 |
Zeros | 516710 |
Zeros (%) | 2.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 228000 |
Maximum | 4905062000 |
Range | 4905062000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 69028116.68 |
---|---|
Coefficient of variation (CV) | 17.97238239 |
Kurtosis | 963.9886887 |
Mean | 3840788.337 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 27.59180471 |
Sum | 2.256466989 × 1012 |
Variance | 4.764880892 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 516710 | 2.9% |
60000 | 1829 | < 0.1% |
130000 | 1443 | < 0.1% |
100000 | 1349 | < 0.1% |
50000 | 1035 | < 0.1% |
65000 | 888 | < 0.1% |
70000 | 656 | < 0.1% |
80000 | 617 | < 0.1% |
150000 | 580 | < 0.1% |
200000 | 576 | < 0.1% |
Other values (24034) | 61818 | 0.3% |
(Missing) | 17306035 |
Value | Count | Frequency (%) |
0 | 516710 | |
0.02 | 10 | < 0.1% |
0.03 | 12 | < 0.1% |
0.04 | 4 | < 0.1% |
0.06 | 4 | < 0.1% |
Value | Count | Frequency (%) |
4905062000 | 1 | < 0.1% |
3250000000 | 60 | < 0.1% |
3200000000 | 14 | < 0.1% |
2947009633 | 1 | < 0.1% |
2000000000 | 175 |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 136.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 17306035 | |
c7a5ad39 | 433700 | 2.4% |
3cbe86ba | 108817 | 0.6% |
9276e4bb | 16582 | 0.1% |
0e63c0f0 | 8085 | < 0.1% |
168ad9f3 | 4694 | < 0.1% |
5224034a | 3499 | < 0.1% |
7b62420e | 3401 | < 0.1% |
940efad7 | 3178 | < 0.1% |
5994c34a | 1619 | < 0.1% |
Other values (5) | 3926 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 52357543 | |
a | 18297596 | 12.8% |
7 | 17765339 | 12.4% |
b | 17561469 | 12.3% |
4 | 17341213 | 12.1% |
1 | 17314399 | 12.1% |
3 | 560414 | 0.4% |
c | 554366 | 0.4% |
9 | 461392 | 0.3% |
d | 444263 | 0.3% |
Other values (6) | 490294 | 0.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 106129696 | |
Lowercase Letter | 37018592 | 25.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 52357543 | |
7 | 17765339 | 16.7% |
4 | 17341213 | 16.3% |
1 | 17314399 | 16.3% |
3 | 560414 | 0.5% |
9 | 461392 | 0.4% |
6 | 142814 | 0.1% |
8 | 115297 | 0.1% |
0 | 36687 | < 0.1% |
2 | 34598 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 18297596 | |
b | 17561469 | |
c | 554366 | 1.5% |
d | 444263 | 1.2% |
e | 140683 | 0.4% |
f | 20215 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 106129696 | |
Latin | 37018592 | 25.9% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 52357543 | |
7 | 17765339 | 16.7% |
4 | 17341213 | 16.3% |
1 | 17314399 | 16.3% |
3 | 560414 | 0.5% |
9 | 461392 | 0.4% |
6 | 142814 | 0.1% |
8 | 115297 | 0.1% |
0 | 36687 | < 0.1% |
2 | 34598 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 18297596 | |
b | 17561469 | |
c | 554366 | 1.5% |
d | 444263 | 1.2% |
e | 140683 | 0.4% |
f | 20215 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 143148288 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 52357543 | |
a | 18297596 | 12.8% |
7 | 17765339 | 12.4% |
b | 17561469 | 12.3% |
4 | 17341213 | 12.1% |
1 | 17314399 | 12.1% |
3 | 560414 | 0.4% |
c | 554366 | 0.4% |
9 | 461392 | 0.3% |
d | 444263 | 0.3% |
Other values (6) | 490294 | 0.3% |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 136.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 17591919 | |
c7a5ad39 | 273548 | 1.5% |
9276e4bb | 11440 | 0.1% |
0e63c0f0 | 7695 | < 0.1% |
7b62420e | 3074 | < 0.1% |
168ad9f3 | 3057 | < 0.1% |
940efad7 | 853 | < 0.1% |
3cbe86ba | 510 | < 0.1% |
f4d8a027 | 466 | < 0.1% |
46ab00a7 | 333 | < 0.1% |
Other values (5) | 641 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 53049623 | |
a | 18144816 | 12.7% |
7 | 17881717 | 12.5% |
b | 17619305 | 12.3% |
4 | 17608578 | 12.3% |
1 | 17595686 | 12.3% |
9 | 288976 | 0.2% |
3 | 285054 | 0.2% |
c | 282184 | 0.2% |
d | 278242 | 0.2% |
Other values (6) | 114107 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 106787383 | |
Lowercase Letter | 36360905 | 25.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 53049623 | |
7 | 17881717 | 16.7% |
4 | 17608578 | 16.5% |
1 | 17595686 | 16.5% |
9 | 288976 | 0.3% |
3 | 285054 | 0.3% |
0 | 28354 | < 0.1% |
6 | 26188 | < 0.1% |
2 | 19100 | < 0.1% |
8 | 4107 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 18144816 | |
b | 17619305 | |
c | 282184 | 0.8% |
d | 278242 | 0.8% |
e | 23646 | 0.1% |
f | 12712 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 106787383 | |
Latin | 36360905 | 25.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 53049623 | |
7 | 17881717 | 16.7% |
4 | 17608578 | 16.5% |
1 | 17595686 | 16.5% |
9 | 288976 | 0.3% |
3 | 285054 | 0.3% |
0 | 28354 | < 0.1% |
6 | 26188 | < 0.1% |
2 | 19100 | < 0.1% |
8 | 4107 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 18144816 | |
b | 17619305 | |
c | 282184 | 0.8% |
d | 278242 | 0.8% |
e | 23646 | 0.1% |
f | 12712 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 143148288 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 53049623 | |
a | 18144816 | 12.7% |
7 | 17881717 | 12.5% |
b | 17619305 | 12.3% |
4 | 17608578 | 12.3% |
1 | 17595686 | 12.3% |
9 | 288976 | 0.2% |
3 | 285054 | 0.2% |
c | 282184 | 0.2% |
d | 278242 | 0.2% |
Other values (6) | 114107 | 0.1% |
num_group1
Real number (ℝ)
ZEROS
 
Distinct | 243 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.748604859 |
Minimum | 0 |
---|---|
Maximum | 242 |
Zeros | 4888627 |
Zeros (%) | 27.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 2 |
Q3 | 5 |
95-th percentile | 13 |
Maximum | 242 |
Range | 242 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 5.886697029 |
---|---|
Coefficient of variation (CV) | 1.570370111 |
Kurtosis | 240.7655965 |
Mean | 3.748604859 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 9.524976471 |
Sum | 67075796 |
Variance | 34.65320191 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4888627 | |
1 | 3118173 | |
2 | 2077950 | |
3 | 1509896 | 8.4% |
4 | 1171802 | 6.5% |
5 | 948040 | 5.3% |
6 | 772672 | 4.3% |
7 | 629147 | 3.5% |
8 | 511118 | 2.9% |
9 | 414183 | 2.3% |
Other values (233) | 1851928 | 10.3% |
Value | Count | Frequency (%) |
0 | 4888627 | |
1 | 3118173 | |
2 | 2077950 | |
3 | 1509896 | 8.4% |
4 | 1171802 | 6.5% |
Value | Count | Frequency (%) |
242 | 36 | |
241 | 24 | |
240 | 12 | < 0.1% |
239 | 24 | |
238 | 24 |
num_group2
Real number (ℝ)
ZEROS
 
Distinct | 36 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.75626578 |
Minimum | 0 |
---|---|
Maximum | 35 |
Zeros | 705366 |
Zeros (%) | 3.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 12 |
Q3 | 21 |
95-th percentile | 32 |
Maximum | 35 |
Range | 35 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 9.444509069 |
---|---|
Coefficient of variation (CV) | 0.6865605259 |
Kurtosis | -0.7450781045 |
Mean | 13.75626578 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.4526629906 |
Sum | 246148237 |
Variance | 89.19875156 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 705366 | 3.9% |
1 | 705282 | 3.9% |
2 | 705279 | 3.9% |
3 | 705278 | 3.9% |
4 | 705278 | 3.9% |
5 | 705277 | 3.9% |
6 | 705277 | 3.9% |
7 | 705277 | 3.9% |
8 | 705277 | 3.9% |
9 | 705277 | 3.9% |
Other values (26) | 10840668 |
Value | Count | Frequency (%) |
0 | 705366 | |
1 | 705282 | |
2 | 705279 | |
3 | 705278 | |
4 | 705278 |
Value | Count | Frequency (%) |
35 | 240097 | |
34 | 240097 | |
33 | 240097 | |
32 | 240097 | |
31 | 240097 |
pmts_dpd_1073P
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 3636 |
---|---|
Distinct (%) | 0.1% |
Missing | 14018715 |
Missing (%) | 78.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.33832944 |
Minimum | 0 |
---|---|
Maximum | 4455 |
Zeros | 3646455 |
Zeros (%) | 20.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1 |
Maximum | 4455 |
Range | 4455 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 129.55736 |
---|---|
Coefficient of variation (CV) | 11.4264946 |
Kurtosis | 350.6340232 |
Mean | 11.33832944 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 16.91171771 |
Sum | 43933997 |
Variance | 16785.10954 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3646455 | 20.4% |
1 | 35584 | 0.2% |
3 | 13830 | 0.1% |
2 | 13671 | 0.1% |
4 | 13102 | 0.1% |
7 | 8038 | < 0.1% |
5 | 7333 | < 0.1% |
6 | 7257 | < 0.1% |
10 | 5482 | < 0.1% |
8 | 5470 | < 0.1% |
Other values (3626) | 118599 | 0.7% |
(Missing) | 14018715 |
Value | Count | Frequency (%) |
0 | 3646455 | |
1 | 35584 | 0.2% |
2 | 13671 | 0.1% |
3 | 13830 | 0.1% |
4 | 13102 | 0.1% |
Value | Count | Frequency (%) |
4455 | 1 | |
4445 | 1 | |
4423 | 1 | |
4391 | 1 | |
4365 | 1 |
pmts_dpd_303P
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 3939 |
---|---|
Distinct (%) | 0.1% |
Missing | 11481091 |
Missing (%) | 64.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 52.53682815 |
Minimum | -11 |
---|---|
Maximum | 117000 |
Zeros | 5378239 |
Zeros (%) | 30.1% |
Negative | 1070 |
Negative (%) | < 0.1% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | -11 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 316 |
Maximum | 117000 |
Range | 117011 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 271.1978107 |
---|---|
Coefficient of variation (CV) | 5.162051465 |
Kurtosis | 15096.84724 |
Mean | 52.53682815 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 50.67888568 |
Sum | 336889521 |
Variance | 73548.25254 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5378239 | |
1 | 155702 | 0.9% |
3 | 39667 | 0.2% |
2 | 37077 | 0.2% |
4 | 32617 | 0.2% |
6 | 27457 | 0.2% |
5 | 21786 | 0.1% |
7 | 21024 | 0.1% |
9 | 14802 | 0.1% |
8 | 14588 | 0.1% |
Other values (3929) | 669486 | 3.7% |
(Missing) | 11481091 |
Value | Count | Frequency (%) |
-11 | 1 | < 0.1% |
-10 | 10 | |
-9 | 12 | |
-8 | 6 | |
-7 | 3 | < 0.1% |
Value | Count | Frequency (%) |
117000 | 1 | |
84575 | 1 | |
84560 | 2 | |
84533 | 2 | |
84505 | 1 |
pmts_month_158T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 10312896 |
Missing (%) | 57.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052757 |
---|---|
Coefficient of variation (CV) | 0.5310850396 |
Kurtosis | -1.216783228 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 49274160 |
Variance | 11.91666824 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 631720 | 3.5% |
3 | 631720 | 3.5% |
4 | 631720 | 3.5% |
5 | 631720 | 3.5% |
6 | 631720 | 3.5% |
7 | 631720 | 3.5% |
8 | 631720 | 3.5% |
9 | 631720 | 3.5% |
10 | 631720 | 3.5% |
11 | 631720 | 3.5% |
Other values (2) | 1263440 | 7.1% |
(Missing) | 10312896 |
Value | Count | Frequency (%) |
1 | 631720 | |
2 | 631720 | |
3 | 631720 | |
4 | 631720 | |
5 | 631720 |
Value | Count | Frequency (%) |
12 | 631720 | |
11 | 631720 | |
10 | 631720 | |
9 | 631720 | |
8 | 631720 |
pmts_month_706T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 4192740 |
Missing (%) | 23.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052656 |
---|---|
Coefficient of variation (CV) | 0.5310850239 |
Kurtosis | -1.216783223 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 89055174 |
Variance | 11.91666754 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 1141733 | 6.4% |
3 | 1141733 | 6.4% |
4 | 1141733 | 6.4% |
5 | 1141733 | 6.4% |
6 | 1141733 | 6.4% |
7 | 1141733 | 6.4% |
8 | 1141733 | 6.4% |
9 | 1141733 | 6.4% |
10 | 1141733 | 6.4% |
11 | 1141733 | 6.4% |
Other values (2) | 2283466 | |
(Missing) | 4192740 |
Value | Count | Frequency (%) |
1 | 1141733 | |
2 | 1141733 | |
3 | 1141733 | |
4 | 1141733 | |
5 | 1141733 |
Value | Count | Frequency (%) |
12 | 1141733 | |
11 | 1141733 | |
10 | 1141733 | |
9 | 1141733 | |
8 | 1141733 |
pmts_overdue_1140A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 168579 |
---|---|
Distinct (%) | 4.3% |
Missing | 14008893 |
Missing (%) | 78.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1683.38384 |
Minimum | 0 |
---|---|
Maximum | 23891848 |
Zeros | 3652565 |
Zeros (%) | 20.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 728.335 |
Maximum | 23891848 |
Range | 23891848 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 78014.54689 |
---|---|
Coefficient of variation (CV) | 46.34388488 |
Kurtosis | 52939.56564 |
Mean | 1683.38384 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 206.234823 |
Sum | 6539345250 |
Variance | 6086269527 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3652565 | 20.4% |
1000 | 441 | < 0.1% |
2000 | 306 | < 0.1% |
400 | 234 | < 0.1% |
10 | 232 | < 0.1% |
3000 | 230 | < 0.1% |
0.4 | 194 | < 0.1% |
2 | 193 | < 0.1% |
4 | 190 | < 0.1% |
1.6 | 186 | < 0.1% |
Other values (168569) | 229872 | 1.3% |
(Missing) | 14008893 |
Value | Count | Frequency (%) |
0 | 3652565 | |
0.002 | 15 | < 0.1% |
0.004 | 7 | < 0.1% |
0.006 | 13 | < 0.1% |
0.008 | 17 | < 0.1% |
Value | Count | Frequency (%) |
23891848 | 17 | |
17402200 | 7 | |
15768560 | 17 | |
12144611 | 10 | |
9278913 | 7 |
pmts_overdue_1152A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 470005 |
---|---|
Distinct (%) | 7.3% |
Missing | 11474389 |
Missing (%) | 64.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3933.6804 |
Minimum | 0 |
---|---|
Maximum | 38038588 |
Zeros | 5336097 |
Zeros (%) | 29.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 15315.4308 |
Maximum | 38038588 |
Range | 38038588 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 71135.53397 |
---|---|
Coefficient of variation (CV) | 18.08370959 |
Kurtosis | 72421.98985 |
Mean | 3933.6804 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 224.8312526 |
Sum | 2.525087274 × 1010 |
Variance | 5060264194 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5336097 | |
0.2 | 3645 | < 0.1% |
1000 | 2069 | < 0.1% |
0.4 | 1428 | < 0.1% |
2000 | 1319 | < 0.1% |
0.8 | 1121 | < 0.1% |
3000 | 1107 | < 0.1% |
1 | 1015 | < 0.1% |
2 | 1013 | < 0.1% |
1.6 | 1003 | < 0.1% |
Other values (469995) | 1069330 | 6.0% |
(Missing) | 11474389 |
Value | Count | Frequency (%) |
0 | 5336097 | |
0.002 | 97 | < 0.1% |
0.004 | 42 | < 0.1% |
0.006 | 25 | < 0.1% |
0.008 | 35 | < 0.1% |
Value | Count | Frequency (%) |
38038588 | 1 | < 0.1% |
32000000 | 2 | < 0.1% |
24400000 | 1 | < 0.1% |
24000000 | 7 | |
21444070 | 2 | < 0.1% |
pmts_year_1139T
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 10312896 |
Missing (%) | 57.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2018.374944 |
Minimum | 2015 |
---|---|
Maximum | 2020 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 2015 |
---|---|
5-th percentile | 2017 |
Q1 | 2018 |
median | 2018 |
Q3 | 2019 |
95-th percentile | 2019 |
Maximum | 2020 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.7944665286 |
---|---|
Coefficient of variation (CV) | 0.00039361692 |
Kurtosis | -0.6926642171 |
Mean | 2018.374944 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.3098220355 |
Sum | 1.530057383 × 1010 |
Variance | 0.6311770651 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2019 | 3463369 | 19.4% |
2018 | 2614271 | 14.6% |
2017 | 1208349 | 6.8% |
2020 | 294154 | 1.6% |
2016 | 475 | < 0.1% |
2015 | 22 | < 0.1% |
(Missing) | 10312896 |
Value | Count | Frequency (%) |
2015 | 22 | < 0.1% |
2016 | 475 | < 0.1% |
2017 | 1208349 | 6.8% |
2018 | 2614271 | |
2019 | 3463369 |
Value | Count | Frequency (%) |
2020 | 294154 | 1.6% |
2019 | 3463369 | |
2018 | 2614271 | |
2017 | 1208349 | 6.8% |
2016 | 475 | < 0.1% |
pmts_year_507T
Real number (ℝ)
MISSING
 
Distinct | 20 |
---|---|
Distinct (%) | < 0.1% |
Missing | 4192740 |
Missing (%) | 23.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2013.929632 |
Minimum | 2001 |
---|---|
Maximum | 2020 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 136.5 MiB |
Quantile statistics
Minimum | 2001 |
---|---|
5-th percentile | 2007 |
Q1 | 2011 |
median | 2015 |
Q3 | 2017 |
95-th percentile | 2019 |
Maximum | 2020 |
Range | 19 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 3.849804474 |
---|---|
Coefficient of variation (CV) | 0.001911588376 |
Kurtosis | -0.670652374 |
Mean | 2013.929632 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.6230673825 |
Sum | 2.759243904 × 1010 |
Variance | 14.82099449 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2018 | 1915284 | |
2017 | 1795630 | |
2016 | 1368263 | 7.6% |
2015 | 1219915 | 6.8% |
2014 | 1132030 | 6.3% |
2013 | 977085 | 5.5% |
2019 | 828225 | 4.6% |
2012 | 803078 | 4.5% |
2011 | 694279 | 3.9% |
2007 | 583142 | 3.3% |
Other values (10) | 2383865 | |
(Missing) | 4192740 |
Value | Count | Frequency (%) |
2001 | 44 | < 0.1% |
2002 | 422 | < 0.1% |
2003 | 1314 | < 0.1% |
2004 | 35173 | 0.2% |
2005 | 167593 |
Value | Count | Frequency (%) |
2020 | 60727 | 0.3% |
2019 | 828225 | |
2018 | 1915284 | |
2017 | 1795630 | |
2016 | 1368263 |
Distinct | 9 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 136.5 MiB |
Value | Count | Frequency (%) |
a55475b1 | 17312772 | |
ab3c25cf | 570195 | 3.2% |
15f04f45 | 5649 | < 0.1% |
be4fd70b | 2971 | < 0.1% |
daf49a8a | 1918 | < 0.1% |
71ddaa88 | 13 | < 0.1% |
0c42a10e | 11 | < 0.1% |
1d94eac1 | 6 | < 0.1% |
9ba4314a | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 52519809 | |
b | 17888910 | 12.5% |
a | 17888766 | 12.5% |
4 | 17328978 | 12.1% |
1 | 17318458 | 12.1% |
7 | 17315756 | 12.1% |
c | 1140407 | 0.8% |
f | 586382 | 0.4% |
2 | 570206 | 0.4% |
3 | 570196 | 0.4% |
Other values (5) | 20420 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 105635914 | |
Lowercase Letter | 37512374 | 26.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 52519809 | |
4 | 17328978 | 16.4% |
1 | 17318458 | 16.4% |
7 | 17315756 | 16.4% |
2 | 570206 | 0.5% |
3 | 570196 | 0.5% |
0 | 8642 | < 0.1% |
8 | 1944 | < 0.1% |
9 | 1925 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 17888910 | |
a | 17888766 | |
c | 1140407 | 3.0% |
f | 586382 | 1.6% |
d | 4921 | < 0.1% |
e | 2988 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 105635914 | |
Latin | 37512374 | 26.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 52519809 | |
4 | 17328978 | 16.4% |
1 | 17318458 | 16.4% |
7 | 17315756 | 16.4% |
2 | 570206 | 0.5% |
3 | 570196 | 0.5% |
0 | 8642 | < 0.1% |
8 | 1944 | < 0.1% |
9 | 1925 | < 0.1% |
Latin
Value | Count | Frequency (%) |
b | 17888910 | |
a | 17888766 | |
c | 1140407 | 3.0% |
f | 586382 | 1.6% |
d | 4921 | < 0.1% |
e | 2988 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 143148288 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 52519809 | |
b | 17888910 | 12.5% |
a | 17888766 | 12.5% |
4 | 17328978 | 12.1% |
1 | 17318458 | 12.1% |
7 | 17315756 | 12.1% |
c | 1140407 | 0.8% |
f | 586382 | 0.4% |
2 | 570206 | 0.4% |
3 | 570196 | 0.4% |
Other values (5) | 20420 | < 0.1% |
Distinct | 8 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 136.5 MiB |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 8.000028111 |
Min length | 8 |
Characters and Unicode
Total characters | 143148791 |
---|---|
Distinct characters | 17 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | ab3c25cf |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 17597382 | |
ab3c25cf | 288736 | 1.6% |
be4fd70b | 2861 | < 0.1% |
15f04f45 | 2055 | < 0.1% |
daf49a8a | 1993 | < 0.1% |
p28_48_88 | 503 | < 0.1% |
71ddaa88 | 5 | < 0.1% |
0c42a10e | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 53084992 | |
a | 17892108 | 12.5% |
b | 17891840 | 12.5% |
4 | 17606850 | 12.3% |
7 | 17600248 | 12.3% |
1 | 17599443 | 12.3% |
c | 577473 | 0.4% |
f | 297700 | 0.2% |
2 | 289240 | 0.2% |
3 | 288736 | 0.2% |
Other values (7) | 20161 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 106480435 | |
Lowercase Letter | 36666847 | 25.6% |
Connector Punctuation | 1006 | < 0.1% |
Uppercase Letter | 503 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 53084992 | |
4 | 17606850 | 16.5% |
7 | 17600248 | 16.5% |
1 | 17599443 | 16.5% |
2 | 289240 | 0.3% |
3 | 288736 | 0.3% |
0 | 4918 | < 0.1% |
8 | 4015 | < 0.1% |
9 | 1993 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 17892108 | |
b | 17891840 | |
c | 577473 | 1.6% |
f | 297700 | 0.8% |
d | 4864 | < 0.1% |
e | 2862 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1006 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 503 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 106481441 | |
Latin | 36667350 | 25.6% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 53084992 | |
4 | 17606850 | 16.5% |
7 | 17600248 | 16.5% |
1 | 17599443 | 16.5% |
2 | 289240 | 0.3% |
3 | 288736 | 0.3% |
0 | 4918 | < 0.1% |
8 | 4015 | < 0.1% |
9 | 1993 | < 0.1% |
_ | 1006 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 17892108 | |
b | 17891840 | |
c | 577473 | 1.6% |
f | 297700 | 0.8% |
d | 4864 | < 0.1% |
e | 2862 | < 0.1% |
P | 503 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 143148791 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 53084992 | |
a | 17892108 | 12.5% |
b | 17891840 | 12.5% |
4 | 17606850 | 12.3% |
7 | 17600248 | 12.3% |
1 | 17599443 | 12.3% |
c | 577473 | 0.4% |
f | 297700 | 0.2% |
2 | 289240 | 0.2% |
3 | 288736 | 0.2% |
Other values (7) | 20161 | < 0.1% |