Dataset statistics
Number of variables | 19 |
---|---|
Number of observations | 25511332 |
Missing cells | 161328437 |
Missing cells (%) | 33.3% |
Total size in memory | 3.6 GiB |
Average record size in memory | 152.0 B |
Variable types
Numeric | 13 |
---|---|
Text | 6 |
collater_valueofguarantee_1124L has 25182623 (98.7%) missing values | Missing |
collater_valueofguarantee_876L has 24501614 (96.0%) missing values | Missing |
pmts_dpd_1073P has 21490632 (84.2%) missing values | Missing |
pmts_dpd_303P has 14872146 (58.3%) missing values | Missing |
pmts_month_158T has 16723732 (65.6%) missing values | Missing |
pmts_month_706T has 2754316 (10.8%) missing values | Missing |
pmts_overdue_1140A has 21467596 (84.1%) missing values | Missing |
pmts_overdue_1152A has 14857730 (58.2%) missing values | Missing |
pmts_year_1139T has 16723732 (65.6%) missing values | Missing |
pmts_year_507T has 2754316 (10.8%) missing values | Missing |
collater_valueofguarantee_1124L is highly skewed (γ1 = 222.4815149) | Skewed |
collater_valueofguarantee_876L is highly skewed (γ1 = 554.2424378) | Skewed |
pmts_dpd_303P is highly skewed (γ1 = 31.47683138) | Skewed |
pmts_overdue_1140A is highly skewed (γ1 = 187.1643167) | Skewed |
pmts_overdue_1152A is highly skewed (γ1 = 2054.522814) | Skewed |
collater_valueofguarantee_1124L has 301773 (1.2%) zeros | Zeros |
collater_valueofguarantee_876L has 908887 (3.6%) zeros | Zeros |
num_group1 has 4889854 (19.2%) zeros | Zeros |
num_group2 has 1030268 (4.0%) zeros | Zeros |
pmts_dpd_1073P has 3756543 (14.7%) zeros | Zeros |
pmts_dpd_303P has 8873659 (34.8%) zeros | Zeros |
pmts_overdue_1140A has 3775346 (14.8%) zeros | Zeros |
pmts_overdue_1152A has 8809850 (34.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-02-13 19:50:08.714823 |
---|---|
Analysis finished | 2024-02-13 19:50:53.306508 |
Duration | 44.59 seconds |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
Distinct | 150426 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1442419.953 |
Minimum | 42865 |
---|---|
Maximum | 2677343 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 42865 |
---|---|
5-th percentile | 196742 |
Q1 | 943104 |
median | 1770308 |
Q3 | 1804494 |
95-th percentile | 2670785 |
Maximum | 2677343 |
Range | 2634478 |
Interquartile range (IQR) | 861390 |
Descriptive statistics
Standard deviation | 818041.3973 |
---|---|
Coefficient of variation (CV) | 0.5671312265 |
Kurtosis | -0.9485639771 |
Mean | 1442419.953 |
Median Absolute Deviation (MAD) | 47817 |
Skewness | -0.3442243149 |
Sum | 3.679805431 × 1013 |
Variance | 6.691917277 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
214097 | 9404 | < 0.1% |
941237 | 4968 | < 0.1% |
1782211 | 4008 | < 0.1% |
1805200 | 2868 | < 0.1% |
216858 | 2448 | < 0.1% |
1797208 | 2316 | < 0.1% |
212614 | 2292 | < 0.1% |
969281 | 2184 | < 0.1% |
961696 | 2040 | < 0.1% |
1805398 | 2004 | < 0.1% |
Other values (150416) | 25476800 |
Value | Count | Frequency (%) |
42865 | 24 | < 0.1% |
42911 | 108 | |
43010 | 72 | |
43055 | 168 | |
43068 | 48 | < 0.1% |
Value | Count | Frequency (%) |
2677343 | 24 | < 0.1% |
2677342 | 96 | |
2677341 | 84 | |
2677340 | 132 | |
2677339 | 48 | < 0.1% |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 194.6 MiB |
Value | Count | Frequency (%) |
a55475b1 | 25182623 | |
9a0c095e | 214647 | 0.8% |
8fd95e4b | 113868 | 0.4% |
06fb9ba8 | 194 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 75876384 | |
a | 25397464 | 12.4% |
b | 25296879 | 12.4% |
4 | 25296491 | 12.4% |
7 | 25182623 | 12.3% |
1 | 25182623 | 12.3% |
9 | 543356 | 0.3% |
0 | 429488 | 0.2% |
e | 328515 | 0.2% |
c | 214647 | 0.1% |
Other values (4) | 342186 | 0.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 152625221 | |
Lowercase Letter | 51465435 | 25.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 75876384 | |
4 | 25296491 | 16.6% |
7 | 25182623 | 16.5% |
1 | 25182623 | 16.5% |
9 | 543356 | 0.4% |
0 | 429488 | 0.3% |
8 | 114062 | 0.1% |
6 | 194 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 25397464 | |
b | 25296879 | |
e | 328515 | 0.6% |
c | 214647 | 0.4% |
f | 114062 | 0.2% |
d | 113868 | 0.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 152625221 | |
Latin | 51465435 | 25.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 75876384 | |
4 | 25296491 | 16.6% |
7 | 25182623 | 16.5% |
1 | 25182623 | 16.5% |
9 | 543356 | 0.4% |
0 | 429488 | 0.3% |
8 | 114062 | 0.1% |
6 | 194 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 25397464 | |
b | 25296879 | |
e | 328515 | 0.6% |
c | 214647 | 0.4% |
f | 114062 | 0.2% |
d | 113868 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 204090656 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 75876384 | |
a | 25397464 | 12.4% |
b | 25296879 | 12.4% |
4 | 25296491 | 12.4% |
7 | 25182623 | 12.3% |
1 | 25182623 | 12.3% |
9 | 543356 | 0.3% |
0 | 429488 | 0.2% |
e | 328515 | 0.2% |
c | 214647 | 0.1% |
Other values (4) | 342186 | 0.2% |
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 194.6 MiB |
Value | Count | Frequency (%) |
a55475b1 | 24501614 | |
9a0c095e | 537869 | 2.1% |
8fd95e4b | 470331 | 1.8% |
06fb9ba8 | 1308 | < 0.1% |
3cbe86ba | 201 | < 0.1% |
9276e4bb | 5 | < 0.1% |
c7a5ad39 | 4 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 74513046 | |
a | 25041000 | 12.3% |
b | 24974973 | 12.2% |
4 | 24971950 | 12.2% |
7 | 24501623 | 12.0% |
1 | 24501614 | 12.0% |
9 | 1547386 | 0.8% |
0 | 1077046 | 0.5% |
e | 1008406 | 0.5% |
c | 538074 | 0.3% |
Other values (6) | 1415538 | 0.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 151586229 | |
Lowercase Letter | 52504427 | 25.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 74513046 | |
4 | 24971950 | 16.5% |
7 | 24501623 | 16.2% |
1 | 24501614 | 16.2% |
9 | 1547386 | 1.0% |
0 | 1077046 | 0.7% |
8 | 471840 | 0.3% |
6 | 1514 | < 0.1% |
3 | 205 | < 0.1% |
2 | 5 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 25041000 | |
b | 24974973 | |
e | 1008406 | 1.9% |
c | 538074 | 1.0% |
f | 471639 | 0.9% |
d | 470335 | 0.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 151586229 | |
Latin | 52504427 | 25.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 74513046 | |
4 | 24971950 | 16.5% |
7 | 24501623 | 16.2% |
1 | 24501614 | 16.2% |
9 | 1547386 | 1.0% |
0 | 1077046 | 0.7% |
8 | 471840 | 0.3% |
6 | 1514 | < 0.1% |
3 | 205 | < 0.1% |
2 | 5 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 25041000 | |
b | 24974973 | |
e | 1008406 | 1.9% |
c | 538074 | 1.0% |
f | 471639 | 0.9% |
d | 470335 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 204090656 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 74513046 | |
a | 25041000 | 12.3% |
b | 24974973 | 12.2% |
4 | 24971950 | 12.2% |
7 | 24501623 | 12.0% |
1 | 24501614 | 12.0% |
9 | 1547386 | 0.8% |
0 | 1077046 | 0.5% |
e | 1008406 | 0.5% |
c | 538074 | 0.3% |
Other values (6) | 1415538 | 0.7% |
collater_valueofguarantee_1124L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 14600 |
---|---|
Distinct (%) | 4.4% |
Missing | 25182623 |
Missing (%) | 98.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6965820.356 |
Minimum | 0 |
---|---|
Maximum | 9.878 × 1010 |
Zeros | 301773 |
Zeros (%) | 1.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 5199840 |
Maximum | 9.878 × 1010 |
Range | 9.878 × 1010 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 386553639.2 |
---|---|
Coefficient of variation (CV) | 55.49290959 |
Kurtosis | 54096.52349 |
Mean | 6965820.356 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 222.4815149 |
Sum | 2.289727843 × 1012 |
Variance | 1.49423716 × 1017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 301773 | 1.2% |
1300000000 | 651 | < 0.1% |
3000000 | 156 | < 0.1% |
5000000 | 155 | < 0.1% |
2000000 | 141 | < 0.1% |
4000000 | 138 | < 0.1% |
10000000 | 112 | < 0.1% |
20000000 | 109 | < 0.1% |
6000000 | 107 | < 0.1% |
833333.35 | 97 | < 0.1% |
Other values (14590) | 25270 | 0.1% |
(Missing) | 25182623 |
Value | Count | Frequency (%) |
0 | 301773 | |
1 | 77 | < 0.1% |
1866 | 1 | < 0.1% |
2383.71 | 1 | < 0.1% |
2445 | 1 | < 0.1% |
Value | Count | Frequency (%) |
9.878 × 1010 | 4 | |
4.49 × 1010 | 4 | |
1.194099351 × 1010 | 3 | |
5593274000 | 2 | |
3719067464 | 1 | < 0.1% |
collater_valueofguarantee_876L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 32485 |
---|---|
Distinct (%) | 3.2% |
Missing | 24501614 |
Missing (%) | 96.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2941478.947 |
Minimum | 0 |
---|---|
Maximum | 9.878 × 1010 |
Zeros | 908887 |
Zeros (%) | 3.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 166511.45 |
Maximum | 9.878 × 1010 |
Range | 9.878 × 1010 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 123789868.3 |
---|---|
Coefficient of variation (CV) | 42.0842272 |
Kurtosis | 418707.4787 |
Mean | 2941478.947 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 554.2424378 |
Sum | 2.970064239 × 1012 |
Variance | 1.532393149 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 908887 | 3.6% |
60000 | 2689 | < 0.1% |
130000 | 2138 | < 0.1% |
100000 | 1863 | < 0.1% |
65000 | 1329 | < 0.1% |
50000 | 1265 | < 0.1% |
70000 | 920 | < 0.1% |
150000 | 886 | < 0.1% |
300000 | 877 | < 0.1% |
200000 | 857 | < 0.1% |
Other values (32475) | 88007 | 0.3% |
(Missing) | 24501614 |
Value | Count | Frequency (%) |
0 | 908887 | |
0.1 | 1 | < 0.1% |
0.14 | 1 | < 0.1% |
0.24 | 1 | < 0.1% |
0.48 | 1 | < 0.1% |
Value | Count | Frequency (%) |
9.878 × 1010 | 1 | < 0.1% |
4.49 × 1010 | 1 | < 0.1% |
3250000000 | 60 | |
3200000000 | 17 | < 0.1% |
2921303961 | 4 | < 0.1% |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 194.6 MiB |
Value | Count | Frequency (%) |
a55475b1 | 24501614 | |
c7a5ad39 | 786090 | 3.1% |
3cbe86ba | 148053 | 0.6% |
9276e4bb | 25228 | 0.1% |
0e63c0f0 | 17717 | 0.1% |
168ad9f3 | 7815 | < 0.1% |
5224034a | 5628 | < 0.1% |
7b62420e | 5296 | < 0.1% |
940efad7 | 4516 | < 0.1% |
2fd21cf1 | 3444 | < 0.1% |
Other values (5) | 5931 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 74300036 | |
a | 26245295 | 12.9% |
7 | 25326667 | 12.4% |
b | 24855889 | 12.2% |
4 | 24554535 | 12.0% |
1 | 24517708 | 12.0% |
3 | 967388 | 0.5% |
c | 958780 | 0.5% |
9 | 827819 | 0.4% |
d | 803294 | 0.4% |
Other values (6) | 733245 | 0.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 150986755 | |
Lowercase Letter | 53103901 | 26.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 74300036 | |
7 | 25326667 | 16.8% |
4 | 24554535 | 16.3% |
1 | 24517708 | 16.2% |
3 | 967388 | 0.6% |
9 | 827819 | 0.5% |
6 | 206526 | 0.1% |
8 | 158688 | 0.1% |
0 | 71995 | < 0.1% |
2 | 55393 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 26245295 | |
b | 24855889 | |
c | 958780 | 1.8% |
d | 803294 | 1.5% |
e | 202201 | 0.4% |
f | 38442 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 150986755 | |
Latin | 53103901 | 26.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 74300036 | |
7 | 25326667 | 16.8% |
4 | 24554535 | 16.3% |
1 | 24517708 | 16.2% |
3 | 967388 | 0.6% |
9 | 827819 | 0.5% |
6 | 206526 | 0.1% |
8 | 158688 | 0.1% |
0 | 71995 | < 0.1% |
2 | 55393 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 26245295 | |
b | 24855889 | |
c | 958780 | 1.8% |
d | 803294 | 1.5% |
e | 202201 | 0.4% |
f | 38442 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 204090656 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 74300036 | |
a | 26245295 | 12.9% |
7 | 25326667 | 12.4% |
b | 24855889 | 12.2% |
4 | 24554535 | 12.0% |
1 | 24517708 | 12.0% |
3 | 967388 | 0.5% |
c | 958780 | 0.5% |
9 | 827819 | 0.4% |
d | 803294 | 0.4% |
Other values (6) | 733245 | 0.4% |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 194.6 MiB |
Value | Count | Frequency (%) |
a55475b1 | 25182623 | |
c7a5ad39 | 287925 | 1.1% |
9276e4bb | 13475 | 0.1% |
0e63c0f0 | 12582 | < 0.1% |
168ad9f3 | 6524 | < 0.1% |
7b62420e | 3842 | < 0.1% |
3cbe86ba | 1937 | < 0.1% |
940efad7 | 657 | < 0.1% |
f4d8a027 | 639 | < 0.1% |
2fd21cf1 | 499 | < 0.1% |
Other values (5) | 629 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 75836407 | |
a | 25768689 | 12.6% |
7 | 25489363 | 12.5% |
b | 25217486 | 12.4% |
4 | 25202116 | 12.3% |
1 | 25190326 | 12.3% |
3 | 309400 | 0.2% |
9 | 308747 | 0.2% |
c | 303207 | 0.1% |
d | 296244 | 0.1% |
Other values (6) | 168671 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 152450951 | |
Lowercase Letter | 51639705 | 25.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 75836407 | |
7 | 25489363 | 16.7% |
4 | 25202116 | 16.5% |
1 | 25190326 | 16.5% |
3 | 309400 | 0.2% |
9 | 308747 | 0.2% |
0 | 43260 | < 0.1% |
6 | 38557 | < 0.1% |
2 | 23494 | < 0.1% |
8 | 9281 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 25768689 | |
b | 25217486 | |
c | 303207 | 0.6% |
d | 296244 | 0.6% |
e | 32674 | 0.1% |
f | 21405 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 152450951 | |
Latin | 51639705 | 25.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 75836407 | |
7 | 25489363 | 16.7% |
4 | 25202116 | 16.5% |
1 | 25190326 | 16.5% |
3 | 309400 | 0.2% |
9 | 308747 | 0.2% |
0 | 43260 | < 0.1% |
6 | 38557 | < 0.1% |
2 | 23494 | < 0.1% |
8 | 9281 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 25768689 | |
b | 25217486 | |
c | 303207 | 0.6% |
d | 296244 | 0.6% |
e | 32674 | 0.1% |
f | 21405 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 204090656 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 75836407 | |
a | 25768689 | 12.6% |
7 | 25489363 | 12.5% |
b | 25217486 | 12.4% |
4 | 25202116 | 12.3% |
1 | 25190326 | 12.3% |
3 | 309400 | 0.2% |
9 | 308747 | 0.2% |
c | 303207 | 0.1% |
d | 296244 | 0.1% |
Other values (6) | 168671 | 0.1% |
num_group1
Real number (ℝ)
ZEROS
 
Distinct | 333 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.199547558 |
Minimum | 0 |
---|---|
Maximum | 332 |
Zeros | 4889854 |
Zeros (%) | 19.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 3 |
Q3 | 7 |
95-th percentile | 17 |
Maximum | 332 |
Range | 332 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 7.81430312 |
---|---|
Coefficient of variation (CV) | 1.502881363 |
Kurtosis | 188.0776265 |
Mean | 5.199547558 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 8.552489082 |
Sum | 132647384 |
Variance | 61.06333325 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4889854 | |
1 | 3764450 | |
2 | 2904683 | |
3 | 2309598 | |
4 | 1880065 | 7.4% |
5 | 1565930 | 6.1% |
6 | 1309001 | 5.1% |
7 | 1097447 | 4.3% |
8 | 917453 | 3.6% |
9 | 765332 | 3.0% |
Other values (323) | 4107519 |
Value | Count | Frequency (%) |
0 | 4889854 | |
1 | 3764450 | |
2 | 2904683 | |
3 | 2309598 | |
4 | 1880065 | 7.4% |
Value | Count | Frequency (%) |
332 | 12 | |
331 | 12 | |
330 | 12 | |
329 | 12 | |
328 | 12 |
num_group2
Real number (ℝ)
ZEROS
 
Distinct | 101 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.55214197 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 1030268 |
Zeros (%) | 4.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 12 |
Q3 | 20 |
95-th percentile | 32 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 9.435353914 |
---|---|
Coefficient of variation (CV) | 0.6962260237 |
Kurtosis | -0.3341419037 |
Mean | 13.55214197 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.5287536011 |
Sum | 345733193 |
Variance | 89.02590348 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1030268 | 4.0% |
7 | 1030177 | 4.0% |
11 | 1030177 | 4.0% |
10 | 1030177 | 4.0% |
9 | 1030177 | 4.0% |
8 | 1030177 | 4.0% |
1 | 1030177 | 4.0% |
6 | 1030177 | 4.0% |
5 | 1030177 | 4.0% |
4 | 1030177 | 4.0% |
Other values (91) | 15209471 |
Value | Count | Frequency (%) |
0 | 1030268 | |
1 | 1030177 | |
2 | 1030177 | |
3 | 1030177 | |
4 | 1030177 |
Value | Count | Frequency (%) |
100 | 1 | < 0.1% |
99 | 1 | < 0.1% |
98 | 93 | |
97 | 94 | |
96 | 95 |
pmts_dpd_1073P
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 3814 |
---|---|
Distinct (%) | 0.1% |
Missing | 21490632 |
Missing (%) | 84.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.55760813 |
Minimum | 0 |
---|---|
Maximum | 4565 |
Zeros | 3756543 |
Zeros (%) | 14.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3 |
Maximum | 4565 |
Range | 4565 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 142.5610536 |
---|---|
Coefficient of variation (CV) | 10.51520683 |
Kurtosis | 314.7134632 |
Mean | 13.55760813 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 16.02003956 |
Sum | 54511075 |
Variance | 20323.65399 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3756543 | 14.7% |
1 | 40686 | 0.2% |
2 | 15994 | 0.1% |
3 | 14561 | 0.1% |
4 | 13014 | 0.1% |
7 | 8012 | < 0.1% |
5 | 7668 | < 0.1% |
6 | 6989 | < 0.1% |
8 | 6008 | < 0.1% |
9 | 5975 | < 0.1% |
Other values (3804) | 145250 | 0.6% |
(Missing) | 21490632 |
Value | Count | Frequency (%) |
0 | 3756543 | |
1 | 40686 | 0.2% |
2 | 15994 | 0.1% |
3 | 14561 | 0.1% |
4 | 13014 | 0.1% |
Value | Count | Frequency (%) |
4565 | 1 | |
4561 | 1 | |
4552 | 1 | |
4536 | 2 | |
4503 | 1 |
pmts_dpd_303P
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 4273 |
---|---|
Distinct (%) | < 0.1% |
Missing | 14872146 |
Missing (%) | 58.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 58.91003541 |
Minimum | -18 |
---|---|
Maximum | 117000 |
Zeros | 8873659 |
Zeros (%) | 34.8% |
Negative | 1621 |
Negative (%) | < 0.1% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | -18 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 377 |
Maximum | 117000 |
Range | 117018 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 285.3191101 |
---|---|
Coefficient of variation (CV) | 4.843302303 |
Kurtosis | 8206.222245 |
Mean | 58.91003541 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 31.47683138 |
Sum | 626754824 |
Variance | 81406.99462 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 8873659 | |
1 | 243256 | 1.0% |
3 | 64361 | 0.3% |
2 | 61391 | 0.2% |
4 | 53185 | 0.2% |
6 | 43772 | 0.2% |
5 | 35500 | 0.1% |
7 | 34653 | 0.1% |
9 | 25102 | 0.1% |
8 | 24613 | 0.1% |
Other values (4263) | 1179694 | 4.6% |
(Missing) | 14872146 |
Value | Count | Frequency (%) |
-18 | 11 | |
-16 | 2 | < 0.1% |
-15 | 2 | < 0.1% |
-14 | 2 | < 0.1% |
-12 | 1 | < 0.1% |
Value | Count | Frequency (%) |
117000 | 1 | |
84575 | 2 | |
84574 | 1 | |
84573 | 1 | |
84560 | 2 |
pmts_month_158T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 16723732 |
Missing (%) | 65.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052726 |
---|---|
Coefficient of variation (CV) | 0.5310850348 |
Kurtosis | -1.216783226 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 57119400 |
Variance | 11.91666802 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 732300 | 2.9% |
3 | 732300 | 2.9% |
4 | 732300 | 2.9% |
5 | 732300 | 2.9% |
6 | 732300 | 2.9% |
7 | 732300 | 2.9% |
8 | 732300 | 2.9% |
9 | 732300 | 2.9% |
10 | 732300 | 2.9% |
11 | 732300 | 2.9% |
Other values (2) | 1464600 | 5.7% |
(Missing) | 16723732 |
Value | Count | Frequency (%) |
1 | 732300 | |
2 | 732300 | |
3 | 732300 | |
4 | 732300 | |
5 | 732300 |
Value | Count | Frequency (%) |
12 | 732300 | |
11 | 732300 | |
10 | 732300 | |
9 | 732300 | |
8 | 732300 |
pmts_month_706T
Real number (ℝ)
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 2754316 |
Missing (%) | 10.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052605 |
---|---|
Coefficient of variation (CV) | 0.5310850162 |
Kurtosis | -1.21678322 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 147920604 |
Variance | 11.91666719 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 1896418 | |
3 | 1896418 | |
4 | 1896418 | |
5 | 1896418 | |
6 | 1896418 | |
7 | 1896418 | |
8 | 1896418 | |
9 | 1896418 | |
10 | 1896418 | |
11 | 1896418 | |
Other values (2) | 3792836 | |
(Missing) | 2754316 |
Value | Count | Frequency (%) |
1 | 1896418 | |
2 | 1896418 | |
3 | 1896418 | |
4 | 1896418 | |
5 | 1896418 |
Value | Count | Frequency (%) |
12 | 1896418 | |
11 | 1896418 | |
10 | 1896418 | |
9 | 1896418 | |
8 | 1896418 |
pmts_overdue_1140A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 188405 |
---|---|
Distinct (%) | 4.7% |
Missing | 21467596 |
Missing (%) | 84.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2147.679317 |
Minimum | 0 |
---|---|
Maximum | 28325886 |
Zeros | 3775346 |
Zeros (%) | 14.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1774.0335 |
Maximum | 28325886 |
Range | 28325886 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 106264.0546 |
---|---|
Coefficient of variation (CV) | 49.47854821 |
Kurtosis | 41865.39734 |
Mean | 2147.679317 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 187.1643167 |
Sum | 8684648170 |
Variance | 1.12920493 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3775346 | 14.8% |
10 | 386 | < 0.1% |
400 | 298 | < 0.1% |
1000 | 292 | < 0.1% |
14 | 248 | < 0.1% |
35000 | 222 | < 0.1% |
4228 | 204 | < 0.1% |
3500 | 188 | < 0.1% |
2000 | 178 | < 0.1% |
2 | 176 | < 0.1% |
Other values (188395) | 266198 | 1.0% |
(Missing) | 21467596 |
Value | Count | Frequency (%) |
0 | 3775346 | |
0.002 | 21 | < 0.1% |
0.004 | 12 | < 0.1% |
0.006 | 9 | < 0.1% |
0.008 | 38 | < 0.1% |
Value | Count | Frequency (%) |
28325886 | 23 | |
23891848 | 10 | |
17402200 | 14 | |
15768560 | 10 | |
15237162 | 2 | < 0.1% |
pmts_overdue_1152A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 759663 |
---|---|
Distinct (%) | 7.1% |
Missing | 14857730 |
Missing (%) | 58.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4486.441981 |
Minimum | 0 |
---|---|
Maximum | 593465000 |
Zeros | 8809850 |
Zeros (%) | 34.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 17498.943 |
Maximum | 593465000 |
Range | 593465000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 215393.7122 |
---|---|
Coefficient of variation (CV) | 48.00991813 |
Kurtosis | 5451623.858 |
Mean | 4486.441981 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2054.522814 |
Sum | 4.779676726 × 1010 |
Variance | 4.639445125 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 8809850 | |
0.2 | 5294 | < 0.1% |
1000 | 3491 | < 0.1% |
2000 | 2400 | < 0.1% |
0.4 | 2210 | < 0.1% |
3000 | 2032 | < 0.1% |
0.8 | 1675 | < 0.1% |
2 | 1669 | < 0.1% |
0.6 | 1664 | < 0.1% |
1.2 | 1592 | < 0.1% |
Other values (759653) | 1821725 | 7.1% |
(Missing) | 14857730 |
Value | Count | Frequency (%) |
0 | 8809850 | |
0.002 | 149 | < 0.1% |
0.004 | 72 | < 0.1% |
0.006 | 52 | < 0.1% |
0.008 | 76 | < 0.1% |
Value | Count | Frequency (%) |
593465000 | 1 | < 0.1% |
107822220 | 7 | |
52067108 | 1 | < 0.1% |
46745056 | 1 | < 0.1% |
38199708 | 1 | < 0.1% |
pmts_year_1139T
Real number (ℝ)
MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 16723732 |
Missing (%) | 65.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2019.308068 |
Minimum | 2015 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 2015 |
---|---|
5-th percentile | 2018 |
Q1 | 2019 |
median | 2019 |
Q3 | 2020 |
95-th percentile | 2020 |
Maximum | 2021 |
Range | 6 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.7898625601 |
---|---|
Coefficient of variation (CV) | 0.0003911550559 |
Kurtosis | -0.7153131616 |
Mean | 2019.308068 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.1929942781 |
Sum | 1.774487158 × 1010 |
Variance | 0.6238828638 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2020 | 3605350 | 14.1% |
2019 | 3384736 | 13.3% |
2018 | 1493543 | 5.9% |
2021 | 300908 | 1.2% |
2017 | 2764 | < 0.1% |
2016 | 277 | < 0.1% |
2015 | 22 | < 0.1% |
(Missing) | 16723732 |
Value | Count | Frequency (%) |
2015 | 22 | < 0.1% |
2016 | 277 | < 0.1% |
2017 | 2764 | < 0.1% |
2018 | 1493543 | |
2019 | 3384736 |
Value | Count | Frequency (%) |
2021 | 300908 | 1.2% |
2020 | 3605350 | |
2019 | 3384736 | |
2018 | 1493543 | |
2017 | 2764 | < 0.1% |
pmts_year_507T
Real number (ℝ)
MISSING
 
Distinct | 22 |
---|---|
Distinct (%) | < 0.1% |
Missing | 2754316 |
Missing (%) | 10.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2014.693585 |
Minimum | 2000 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 194.6 MiB |
Quantile statistics
Minimum | 2000 |
---|---|
5-th percentile | 2007 |
Q1 | 2012 |
median | 2016 |
Q3 | 2018 |
95-th percentile | 2019 |
Maximum | 2021 |
Range | 21 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 3.94224462 |
---|---|
Coefficient of variation (CV) | 0.001956746499 |
Kurtosis | -0.458399037 |
Mean | 2014.693585 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.766296528 |
Sum | 4.584841416 × 1010 |
Variance | 15.54129265 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2018 | 3565601 | |
2019 | 3053031 | |
2017 | 2780225 | |
2016 | 1980636 | |
2015 | 1738516 | |
2014 | 1595306 | |
2013 | 1375124 | 5.4% |
2012 | 1116822 | 4.4% |
2011 | 963495 | 3.8% |
2007 | 813761 | 3.2% |
Other values (12) | 3774499 | |
(Missing) | 2754316 |
Value | Count | Frequency (%) |
2000 | 11 | < 0.1% |
2001 | 111 | < 0.1% |
2002 | 571 | < 0.1% |
2003 | 1591 | < 0.1% |
2004 | 48122 |
Value | Count | Frequency (%) |
2021 | 26130 | 0.1% |
2020 | 537474 | 2.1% |
2019 | 3053031 | |
2018 | 3565601 | |
2017 | 2780225 |
Distinct | 9 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 194.6 MiB |
Value | Count | Frequency (%) |
a55475b1 | 24511526 | |
ab3c25cf | 981325 | 3.8% |
15f04f45 | 10252 | < 0.1% |
be4fd70b | 5316 | < 0.1% |
daf49a8a | 2853 | < 0.1% |
71ddaa88 | 21 | < 0.1% |
0c42a10e | 18 | < 0.1% |
1d94eac1 | 16 | < 0.1% |
9ba4314a | 5 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 74536407 | |
b | 25503488 | 12.5% |
a | 25501496 | 12.5% |
4 | 24540243 | 12.0% |
1 | 24521854 | 12.0% |
7 | 24516863 | 12.0% |
c | 1962684 | 1.0% |
f | 1009998 | 0.5% |
2 | 981343 | 0.5% |
3 | 981330 | 0.5% |
Other values (5) | 34950 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 150099413 | |
Lowercase Letter | 53991243 | 26.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 74536407 | |
4 | 24540243 | 16.3% |
1 | 24521854 | 16.3% |
7 | 24516863 | 16.3% |
2 | 981343 | 0.7% |
3 | 981330 | 0.7% |
0 | 15604 | < 0.1% |
8 | 2895 | < 0.1% |
9 | 2874 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 25503488 | |
a | 25501496 | |
c | 1962684 | 3.6% |
f | 1009998 | 1.9% |
d | 8227 | < 0.1% |
e | 5350 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 150099413 | |
Latin | 53991243 | 26.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 74536407 | |
4 | 24540243 | 16.3% |
1 | 24521854 | 16.3% |
7 | 24516863 | 16.3% |
2 | 981343 | 0.7% |
3 | 981330 | 0.7% |
0 | 15604 | < 0.1% |
8 | 2895 | < 0.1% |
9 | 2874 | < 0.1% |
Latin
Value | Count | Frequency (%) |
b | 25503488 | |
a | 25501496 | |
c | 1962684 | 3.6% |
f | 1009998 | 1.9% |
d | 8227 | < 0.1% |
e | 5350 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 204090656 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 74536407 | |
b | 25503488 | 12.5% |
a | 25501496 | 12.5% |
4 | 24540243 | 12.0% |
1 | 24521854 | 12.0% |
7 | 24516863 | 12.0% |
c | 1962684 | 1.0% |
f | 1009998 | 0.5% |
2 | 981343 | 0.5% |
3 | 981330 | 0.5% |
Other values (5) | 34950 | < 0.1% |
Distinct | 8 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 194.6 MiB |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 8.000014111 |
Min length | 8 |
Characters and Unicode
Total characters | 204091016 |
---|---|
Distinct characters | 17 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ab3c25cf |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 25197016 | |
ab3c25cf | 306490 | 1.2% |
be4fd70b | 3129 | < 0.1% |
15f04f45 | 2501 | < 0.1% |
daf49a8a | 1831 | < 0.1% |
p28_48_88 | 360 | < 0.1% |
71ddaa88 | 3 | < 0.1% |
0c42a10e | 2 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 75902540 | |
b | 25509764 | 12.5% |
a | 25509007 | 12.5% |
4 | 25207340 | 12.4% |
7 | 25200148 | 12.3% |
1 | 25199522 | 12.3% |
c | 612982 | 0.3% |
f | 316452 | 0.2% |
2 | 306852 | 0.2% |
3 | 306490 | 0.2% |
Other values (7) | 19919 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 152133634 | |
Lowercase Letter | 51956302 | 25.5% |
Connector Punctuation | 720 | < 0.1% |
Uppercase Letter | 360 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 75902540 | |
4 | 25207340 | 16.6% |
7 | 25200148 | 16.6% |
1 | 25199522 | 16.6% |
2 | 306852 | 0.2% |
3 | 306490 | 0.2% |
0 | 5634 | < 0.1% |
8 | 3277 | < 0.1% |
9 | 1831 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 25509764 | |
a | 25509007 | |
c | 612982 | 1.2% |
f | 316452 | 0.6% |
d | 4966 | < 0.1% |
e | 3131 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 720 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 360 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 152134354 | |
Latin | 51956662 | 25.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 75902540 | |
4 | 25207340 | 16.6% |
7 | 25200148 | 16.6% |
1 | 25199522 | 16.6% |
2 | 306852 | 0.2% |
3 | 306490 | 0.2% |
0 | 5634 | < 0.1% |
8 | 3277 | < 0.1% |
9 | 1831 | < 0.1% |
_ | 720 | < 0.1% |
Latin
Value | Count | Frequency (%) |
b | 25509764 | |
a | 25509007 | |
c | 612982 | 1.2% |
f | 316452 | 0.6% |
d | 4966 | < 0.1% |
e | 3131 | < 0.1% |
P | 360 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 204091016 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 75902540 | |
b | 25509764 | 12.5% |
a | 25509007 | 12.5% |
4 | 25207340 | 12.4% |
7 | 25200148 | 12.3% |
1 | 25199522 | 12.3% |
c | 612982 | 0.3% |
f | 316452 | 0.2% |
2 | 306852 | 0.2% |
3 | 306490 | 0.2% |
Other values (7) | 19919 | < 0.1% |