Dataset statistics
Number of variables | 19 |
---|---|
Number of observations | 5296031 |
Missing cells | 37184092 |
Missing cells (%) | 37.0% |
Total size in memory | 767.7 MiB |
Average record size in memory | 152.0 B |
Variable types
Numeric | 8 |
---|---|
Text | 6 |
Unsupported | 5 |
collater_typofvalofguarant_407M has constant value "" | Constant |
collaterals_typeofguarante_359M has constant value "" | Constant |
subjectroles_name_541M has constant value "" | Constant |
collater_valueofguarantee_1124L has 5093116 (96.2%) missing values | Missing |
collater_valueofguarantee_876L has 5296031 (100.0%) missing values | Missing |
pmts_dpd_1073P has 2806418 (53.0%) missing values | Missing |
pmts_dpd_303P has 5296031 (100.0%) missing values | Missing |
pmts_month_706T has 5296031 (100.0%) missing values | Missing |
pmts_overdue_1140A has 2804357 (53.0%) missing values | Missing |
pmts_overdue_1152A has 5296031 (100.0%) missing values | Missing |
pmts_year_507T has 5296031 (100.0%) missing values | Missing |
collater_valueofguarantee_1124L is highly skewed (γ1 = 53.73476843) | Skewed |
pmts_overdue_1140A is highly skewed (γ1 = 202.1253289) | Skewed |
collater_valueofguarantee_876L is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
pmts_dpd_303P is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
pmts_month_706T is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
pmts_overdue_1152A is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
pmts_year_507T is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
collater_valueofguarantee_1124L has 188430 (3.6%) zeros | Zeros |
num_group1 has 2915256 (55.0%) zeros | Zeros |
num_group2 has 197095 (3.7%) zeros | Zeros |
pmts_dpd_1073P has 2335900 (44.1%) zeros | Zeros |
pmts_overdue_1140A has 2335817 (44.1%) zeros | Zeros |
Reproduction
Analysis started | 2024-02-13 19:43:25.185582 |
---|---|
Analysis finished | 2024-02-13 19:43:40.132818 |
Duration | 14.95 seconds |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
Distinct | 98303 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1063781.521 |
Minimum | 388 |
---|---|
Maximum | 2548729 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 40.4 MiB |
Quantile statistics
Minimum | 388 |
---|---|
5-th percentile | 105340 |
Q1 | 622237 |
median | 1259890 |
Q3 | 1284811 |
95-th percentile | 2543489 |
Maximum | 2548729 |
Range | 2548341 |
Interquartile range (IQR) | 662574 |
Descriptive statistics
Standard deviation | 662226.6264 |
---|---|
Coefficient of variation (CV) | 0.6225212726 |
Kurtosis | 0.31816212 |
Mean | 1063781.521 |
Median Absolute Deviation (MAD) | 38518 |
Skewness | 0.557298106 |
Sum | 5.633819914 × 1012 |
Variance | 4.385441047 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
617817 | 1131 | < 0.1% |
624267 | 732 | < 0.1% |
641873 | 552 | < 0.1% |
1280884 | 516 | < 0.1% |
1286727 | 348 | < 0.1% |
1295729 | 336 | < 0.1% |
1293107 | 324 | < 0.1% |
1293089 | 312 | < 0.1% |
1290178 | 300 | < 0.1% |
638376 | 300 | < 0.1% |
Other values (98293) | 5291180 |
Value | Count | Frequency (%) |
388 | 36 | < 0.1% |
405 | 60 | |
409 | 48 | < 0.1% |
410 | 24 | < 0.1% |
411 | 120 |
Value | Count | Frequency (%) |
2548729 | 108 | |
2548728 | 24 | < 0.1% |
2548727 | 36 | < 0.1% |
2548726 | 96 | |
2548725 | 36 | < 0.1% |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 40.4 MiB |
Value | Count | Frequency (%) |
a55475b1 | 5093116 | |
9a0c095e | 138021 | 2.6% |
8fd95e4b | 64740 | 1.2% |
06fb9ba8 | 154 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 15482109 | |
a | 5231291 | 12.3% |
b | 5158164 | 12.2% |
4 | 5157856 | 12.2% |
7 | 5093116 | 12.0% |
1 | 5093116 | 12.0% |
9 | 340936 | 0.8% |
0 | 276196 | 0.7% |
e | 202761 | 0.5% |
c | 138021 | 0.3% |
Other values (4) | 194682 | 0.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 31508377 | |
Lowercase Letter | 10859871 | 25.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 15482109 | |
4 | 5157856 | 16.4% |
7 | 5093116 | 16.2% |
1 | 5093116 | 16.2% |
9 | 340936 | 1.1% |
0 | 276196 | 0.9% |
8 | 64894 | 0.2% |
6 | 154 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 5231291 | |
b | 5158164 | |
e | 202761 | 1.9% |
c | 138021 | 1.3% |
f | 64894 | 0.6% |
d | 64740 | 0.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 31508377 | |
Latin | 10859871 | 25.6% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 15482109 | |
4 | 5157856 | 16.4% |
7 | 5093116 | 16.2% |
1 | 5093116 | 16.2% |
9 | 340936 | 1.1% |
0 | 276196 | 0.9% |
8 | 64894 | 0.2% |
6 | 154 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 5231291 | |
b | 5158164 | |
e | 202761 | 1.9% |
c | 138021 | 1.3% |
f | 64894 | 0.6% |
d | 64740 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 42368248 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 15482109 | |
a | 5231291 | 12.3% |
b | 5158164 | 12.2% |
4 | 5157856 | 12.2% |
7 | 5093116 | 12.0% |
1 | 5093116 | 12.0% |
9 | 340936 | 0.8% |
0 | 276196 | 0.7% |
e | 202761 | 0.5% |
c | 138021 | 0.3% |
Other values (4) | 194682 | 0.5% |
collater_typofvalofguarant_407M
Text
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 40.4 MiB |
Value | Count | Frequency (%) |
a55475b1 | 5296031 |
Most occurring characters
Value | Count | Frequency (%) |
5 | 15888093 | |
a | 5296031 | 12.5% |
4 | 5296031 | 12.5% |
7 | 5296031 | 12.5% |
b | 5296031 | 12.5% |
1 | 5296031 | 12.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 31776186 | |
Lowercase Letter | 10592062 | 25.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 15888093 | |
4 | 5296031 | 16.7% |
7 | 5296031 | 16.7% |
1 | 5296031 | 16.7% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 5296031 | |
b | 5296031 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 31776186 | |
Latin | 10592062 | 25.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 15888093 | |
4 | 5296031 | 16.7% |
7 | 5296031 | 16.7% |
1 | 5296031 | 16.7% |
Latin
Value | Count | Frequency (%) |
a | 5296031 | |
b | 5296031 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 42368248 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 15888093 | |
a | 5296031 | 12.5% |
4 | 5296031 | 12.5% |
7 | 5296031 | 12.5% |
b | 5296031 | 12.5% |
1 | 5296031 | 12.5% |
collater_valueofguarantee_1124L
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 9526 |
---|---|
Distinct (%) | 4.7% |
Missing | 5093116 |
Missing (%) | 96.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1863340.599 |
Minimum | 0 |
---|---|
Maximum | 3200000000 |
Zeros | 188430 |
Zeros (%) | 3.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 40.4 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3712142.8 |
Maximum | 3200000000 |
Range | 3200000000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 33559092.64 |
---|---|
Coefficient of variation (CV) | 18.01017627 |
Kurtosis | 3996.973233 |
Mean | 1863340.599 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 53.73476843 |
Sum | 3.780997577 × 1011 |
Variance | 1.126212699 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 188430 | 3.6% |
100000000 | 163 | < 0.1% |
200000000 | 138 | < 0.1% |
800000000 | 132 | < 0.1% |
3000000 | 98 | < 0.1% |
2000000 | 84 | < 0.1% |
4000000 | 82 | < 0.1% |
1 | 75 | < 0.1% |
5000000 | 63 | < 0.1% |
6000000 | 58 | < 0.1% |
Other values (9516) | 13592 | 0.3% |
(Missing) | 5093116 |
Value | Count | Frequency (%) |
0 | 188430 | |
0.9 | 2 | < 0.1% |
1 | 75 | < 0.1% |
100 | 3 | < 0.1% |
230 | 1 | < 0.1% |
Value | Count | Frequency (%) |
3200000000 | 6 | |
2500000000 | 8 | |
2100690411 | 1 | < 0.1% |
1908326000 | 1 | < 0.1% |
932264044.2 | 1 | < 0.1% |
collater_valueofguarantee_876L
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 5296031 |
---|---|
Missing (%) | 100.0% |
Memory size | 40.4 MiB |
collaterals_typeofguarante_359M
Text
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 40.4 MiB |
Value | Count | Frequency (%) |
a55475b1 | 5296031 |
Most occurring characters
Value | Count | Frequency (%) |
5 | 15888093 | |
a | 5296031 | 12.5% |
4 | 5296031 | 12.5% |
7 | 5296031 | 12.5% |
b | 5296031 | 12.5% |
1 | 5296031 | 12.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 31776186 | |
Lowercase Letter | 10592062 | 25.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 15888093 | |
4 | 5296031 | 16.7% |
7 | 5296031 | 16.7% |
1 | 5296031 | 16.7% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 5296031 | |
b | 5296031 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 31776186 | |
Latin | 10592062 | 25.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 15888093 | |
4 | 5296031 | 16.7% |
7 | 5296031 | 16.7% |
1 | 5296031 | 16.7% |
Latin
Value | Count | Frequency (%) |
a | 5296031 | |
b | 5296031 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 42368248 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 15888093 | |
a | 5296031 | 12.5% |
4 | 5296031 | 12.5% |
7 | 5296031 | 12.5% |
b | 5296031 | 12.5% |
1 | 5296031 | 12.5% |
Distinct | 15 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 40.4 MiB |
Value | Count | Frequency (%) |
a55475b1 | 5093116 | |
c7a5ad39 | 181173 | 3.4% |
9276e4bb | 8128 | 0.2% |
0e63c0f0 | 7920 | 0.1% |
168ad9f3 | 1877 | < 0.1% |
7b62420e | 1839 | < 0.1% |
940efad7 | 539 | < 0.1% |
f4d8a027 | 512 | < 0.1% |
3cbe86ba | 343 | < 0.1% |
2fd21cf1 | 215 | < 0.1% |
Other values (5) | 369 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 15460689 | |
a | 5459276 | 12.9% |
7 | 5285535 | 12.5% |
b | 5112124 | 12.1% |
4 | 5104619 | 12.0% |
1 | 5095449 | 12.0% |
9 | 191751 | 0.5% |
3 | 191455 | 0.5% |
c | 189694 | 0.4% |
d | 184316 | 0.4% |
Other values (6) | 93340 | 0.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 31392764 | |
Lowercase Letter | 10975484 | 25.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 15460689 | |
7 | 5285535 | 16.8% |
4 | 5104619 | 16.3% |
1 | 5095449 | 16.2% |
9 | 191751 | 0.6% |
3 | 191455 | 0.6% |
0 | 27176 | 0.1% |
6 | 20334 | 0.1% |
2 | 12998 | < 0.1% |
8 | 2758 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 5459276 | |
b | 5112124 | |
c | 189694 | 1.7% |
d | 184316 | 1.7% |
e | 18795 | 0.2% |
f | 11279 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 31392764 | |
Latin | 10975484 | 25.9% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 15460689 | |
7 | 5285535 | 16.8% |
4 | 5104619 | 16.3% |
1 | 5095449 | 16.2% |
9 | 191751 | 0.6% |
3 | 191455 | 0.6% |
0 | 27176 | 0.1% |
6 | 20334 | 0.1% |
2 | 12998 | < 0.1% |
8 | 2758 | < 0.1% |
Latin
Value | Count | Frequency (%) |
a | 5459276 | |
b | 5112124 | |
c | 189694 | 1.7% |
d | 184316 | 1.7% |
e | 18795 | 0.2% |
f | 11279 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 42368248 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 15460689 | |
a | 5459276 | 12.9% |
7 | 5285535 | 12.5% |
b | 5112124 | 12.1% |
4 | 5104619 | 12.0% |
1 | 5095449 | 12.0% |
9 | 191751 | 0.5% |
3 | 191455 | 0.5% |
c | 189694 | 0.4% |
d | 184316 | 0.4% |
Other values (6) | 93340 | 0.2% |
num_group1
Real number (ℝ)
ZEROS
 
Distinct | 47 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.7214553691 |
Minimum | 0 |
---|---|
Maximum | 46 |
Zeros | 2915256 |
Zeros (%) | 55.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 40.4 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 3 |
Maximum | 46 |
Range | 46 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.102630299 |
---|---|
Coefficient of variation (CV) | 1.528341663 |
Kurtosis | 144.7256567 |
Mean | 0.7214553691 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 6.070452508 |
Sum | 3820850 |
Variance | 1.215793576 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2915256 | |
1 | 1459928 | |
2 | 600132 | 11.3% |
3 | 214200 | 4.0% |
4 | 68772 | 1.3% |
5 | 22980 | 0.4% |
6 | 8016 | 0.2% |
7 | 2784 | 0.1% |
8 | 1272 | < 0.1% |
9 | 552 | < 0.1% |
Other values (37) | 2139 | < 0.1% |
Value | Count | Frequency (%) |
0 | 2915256 | |
1 | 1459928 | |
2 | 600132 | 11.3% |
3 | 214200 | 4.0% |
4 | 68772 | 1.3% |
Value | Count | Frequency (%) |
46 | 15 | |
45 | 24 | |
44 | 24 | |
43 | 24 | |
42 | 24 |
num_group2
Real number (ℝ)
ZEROS
 
Distinct | 36 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14.02342263 |
Minimum | 0 |
---|---|
Maximum | 35 |
Zeros | 197095 |
Zeros (%) | 3.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 40.4 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 13 |
Q3 | 21 |
95-th percentile | 32 |
Maximum | 35 |
Range | 35 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 9.312802598 |
---|---|
Coefficient of variation (CV) | 0.6640891343 |
Kurtosis | -0.7376050779 |
Mean | 14.02342263 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.3982469587 |
Sum | 74268481 |
Variance | 86.72829223 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 197095 | 3.7% |
6 | 197095 | 3.7% |
11 | 197095 | 3.7% |
10 | 197095 | 3.7% |
8 | 197095 | 3.7% |
7 | 197095 | 3.7% |
9 | 197095 | 3.7% |
5 | 197095 | 3.7% |
4 | 197095 | 3.7% |
3 | 197095 | 3.7% |
Other values (26) | 3325081 |
Value | Count | Frequency (%) |
0 | 197095 | |
1 | 197095 | |
2 | 197095 | |
3 | 197095 | |
4 | 197095 |
Value | Count | Frequency (%) |
35 | 69233 | |
34 | 69233 | |
33 | 69233 | |
32 | 69233 | |
31 | 69234 |
pmts_dpd_1073P
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 3358 |
---|---|
Distinct (%) | 0.1% |
Missing | 2806418 |
Missing (%) | 53.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.1621742 |
Minimum | 0 |
---|---|
Maximum | 4877 |
Zeros | 2335900 |
Zeros (%) | 44.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 40.4 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2 |
Maximum | 4877 |
Range | 4877 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 138.7310755 |
---|---|
Coefficient of variation (CV) | 10.54013367 |
Kurtosis | 295.5590194 |
Mean | 13.1621742 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 15.45964878 |
Sum | 32768720 |
Variance | 19246.31131 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2335900 | |
1 | 20888 | 0.4% |
3 | 10573 | 0.2% |
2 | 9554 | 0.2% |
4 | 9295 | 0.2% |
7 | 5001 | 0.1% |
5 | 4931 | 0.1% |
6 | 4842 | 0.1% |
8 | 3676 | 0.1% |
9 | 3575 | 0.1% |
Other values (3348) | 81378 | 1.5% |
(Missing) | 2806418 |
Value | Count | Frequency (%) |
0 | 2335900 | |
1 | 20888 | 0.4% |
2 | 9554 | 0.2% |
3 | 10573 | 0.2% |
4 | 9295 | 0.2% |
Value | Count | Frequency (%) |
4877 | 1 | |
4850 | 1 | |
4823 | 1 | |
4806 | 1 | |
4773 | 1 |
pmts_dpd_303P
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 5296031 |
---|---|
Missing (%) | 100.0% |
Memory size | 40.4 MiB |
pmts_month_158T
Real number (ℝ)
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 23 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 40.4 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.452052855 |
---|---|
Coefficient of variation (CV) | 0.5310850547 |
Kurtosis | -1.216783233 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 34424052 |
Variance | 11.91666892 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 441334 | |
3 | 441334 | |
4 | 441334 | |
5 | 441334 | |
6 | 441334 | |
7 | 441334 | |
8 | 441334 | |
9 | 441334 | |
10 | 441334 | |
11 | 441334 | |
Other values (2) | 882668 |
Value | Count | Frequency (%) |
1 | 441334 | |
2 | 441334 | |
3 | 441334 | |
4 | 441334 | |
5 | 441334 |
Value | Count | Frequency (%) |
12 | 441334 | |
11 | 441334 | |
10 | 441334 | |
9 | 441334 | |
8 | 441334 |
pmts_month_706T
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 5296031 |
---|---|
Missing (%) | 100.0% |
Memory size | 40.4 MiB |
pmts_overdue_1140A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 112241 |
---|---|
Distinct (%) | 4.5% |
Missing | 2804357 |
Missing (%) | 53.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1438.95848 |
Minimum | 0 |
---|---|
Maximum | 15237162 |
Zeros | 2335817 |
Zeros (%) | 44.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 40.4 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1012.3793 |
Maximum | 15237162 |
Range | 15237162 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 39508.26943 |
---|---|
Coefficient of variation (CV) | 27.45615664 |
Kurtosis | 65542.76491 |
Mean | 1438.95848 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 202.1253289 |
Sum | 3585415432 |
Variance | 1560903353 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2335817 | |
1000 | 334 | < 0.1% |
2000 | 232 | < 0.1% |
3000 | 209 | < 0.1% |
400 | 189 | < 0.1% |
2600 | 164 | < 0.1% |
0.8 | 144 | < 0.1% |
2 | 140 | < 0.1% |
800 | 128 | < 0.1% |
0.4 | 124 | < 0.1% |
Other values (112231) | 154193 | 2.9% |
(Missing) | 2804357 |
Value | Count | Frequency (%) |
0 | 2335817 | |
0.002 | 22 | < 0.1% |
0.004 | 3 | < 0.1% |
0.006 | 8 | < 0.1% |
0.008 | 13 | < 0.1% |
Value | Count | Frequency (%) |
15237162 | 2 | |
15233466 | 4 | |
14662000 | 1 | < 0.1% |
7399121.5 | 1 | < 0.1% |
7008248 | 1 | < 0.1% |
pmts_overdue_1152A
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 5296031 |
---|---|
Missing (%) | 100.0% |
Memory size | 40.4 MiB |
pmts_year_1139T
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 23 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2018.264301 |
Minimum | 2015 |
---|---|
Maximum | 2020 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 40.4 MiB |
Quantile statistics
Minimum | 2015 |
---|---|
5-th percentile | 2017 |
Q1 | 2018 |
median | 2018 |
Q3 | 2019 |
95-th percentile | 2019 |
Maximum | 2020 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.7832686924 |
---|---|
Coefficient of variation (CV) | 0.0003880902477 |
Kurtosis | -0.7007227773 |
Mean | 2018.264301 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.1184982258 |
Sum | 1.068874388 × 1010 |
Variance | 0.6135098445 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2018 | 2181653 | |
2019 | 2010214 | |
2017 | 936478 | |
2020 | 165420 | 3.1% |
2016 | 1891 | < 0.1% |
2015 | 352 | < 0.1% |
(Missing) | 23 | < 0.1% |
Value | Count | Frequency (%) |
2015 | 352 | < 0.1% |
2016 | 1891 | < 0.1% |
2017 | 936478 | |
2018 | 2181653 | |
2019 | 2010214 |
Value | Count | Frequency (%) |
2020 | 165420 | 3.1% |
2019 | 2010214 | |
2018 | 2181653 | |
2017 | 936478 | |
2016 | 1891 | < 0.1% |
pmts_year_507T
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 5296031 |
---|---|
Missing (%) | 100.0% |
Memory size | 40.4 MiB |
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 40.4 MiB |
Value | Count | Frequency (%) |
a55475b1 | 5296031 |
Most occurring characters
Value | Count | Frequency (%) |
5 | 15888093 | |
a | 5296031 | 12.5% |
4 | 5296031 | 12.5% |
7 | 5296031 | 12.5% |
b | 5296031 | 12.5% |
1 | 5296031 | 12.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 31776186 | |
Lowercase Letter | 10592062 | 25.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 15888093 | |
4 | 5296031 | 16.7% |
7 | 5296031 | 16.7% |
1 | 5296031 | 16.7% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 5296031 | |
b | 5296031 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 31776186 | |
Latin | 10592062 | 25.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 15888093 | |
4 | 5296031 | 16.7% |
7 | 5296031 | 16.7% |
1 | 5296031 | 16.7% |
Latin
Value | Count | Frequency (%) |
a | 5296031 | |
b | 5296031 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 42368248 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 15888093 | |
a | 5296031 | 12.5% |
4 | 5296031 | 12.5% |
7 | 5296031 | 12.5% |
b | 5296031 | 12.5% |
1 | 5296031 | 12.5% |
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 40.4 MiB |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 8.000058157 |
Min length | 8 |
Characters and Unicode
Total characters | 42368556 |
---|---|
Distinct characters | 17 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ab3c25cf |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 5097671 | |
ab3c25cf | 190737 | 3.6% |
be4fd70b | 4749 | 0.1% |
15f04f45 | 1313 | < 0.1% |
daf49a8a | 1251 | < 0.1% |
p28_48_88 | 308 | < 0.1% |
71ddaa88 | 2 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 15486376 | |
b | 5297906 | 12.5% |
a | 5292165 | 12.5% |
4 | 5106605 | 12.1% |
7 | 5102422 | 12.0% |
1 | 5098986 | 12.0% |
c | 381474 | 0.9% |
f | 199363 | 0.5% |
2 | 191045 | 0.5% |
3 | 190737 | 0.5% |
Other values (7) | 21477 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 31185971 | |
Lowercase Letter | 11181661 | 26.4% |
Connector Punctuation | 616 | < 0.1% |
Uppercase Letter | 308 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 15486376 | |
4 | 5106605 | 16.4% |
7 | 5102422 | 16.4% |
1 | 5098986 | 16.4% |
2 | 191045 | 0.6% |
3 | 190737 | 0.6% |
0 | 6062 | < 0.1% |
8 | 2487 | < 0.1% |
9 | 1251 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 5297906 | |
a | 5292165 | |
c | 381474 | 3.4% |
f | 199363 | 1.8% |
d | 6004 | 0.1% |
e | 4749 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 616 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 308 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 31186587 | |
Latin | 11181969 | 26.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 15486376 | |
4 | 5106605 | 16.4% |
7 | 5102422 | 16.4% |
1 | 5098986 | 16.3% |
2 | 191045 | 0.6% |
3 | 190737 | 0.6% |
0 | 6062 | < 0.1% |
8 | 2487 | < 0.1% |
9 | 1251 | < 0.1% |
_ | 616 | < 0.1% |
Latin
Value | Count | Frequency (%) |
b | 5297906 | |
a | 5292165 | |
c | 381474 | 3.4% |
f | 199363 | 1.8% |
d | 6004 | 0.1% |
e | 4749 | < 0.1% |
P | 308 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 42368556 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 15486376 | |
b | 5297906 | 12.5% |
a | 5292165 | 12.5% |
4 | 5106605 | 12.1% |
7 | 5102422 | 12.0% |
1 | 5098986 | 12.0% |
c | 381474 | 0.9% |
f | 199363 | 0.5% |
2 | 191045 | 0.5% |
3 | 190737 | 0.5% |
Other values (7) | 21477 | 0.1% |