Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 4386062 |
| Missing cells | 28029984 |
| Missing cells (%) | 33.6% |
| Total size in memory | 635.8 MiB |
| Average record size in memory | 152.0 B |
Variable types
| Numeric | 13 |
|---|---|
| Text | 6 |
collater_valueofguarantee_1124L has 4340725 (99.0%) missing values | Missing |
collater_valueofguarantee_876L has 4209154 (96.0%) missing values | Missing |
pmts_dpd_1073P has 3729399 (85.0%) missing values | Missing |
pmts_dpd_303P has 2445060 (55.7%) missing values | Missing |
pmts_month_158T has 3280058 (74.8%) missing values | Missing |
pmts_month_706T has 288770 (6.6%) missing values | Missing |
pmts_overdue_1140A has 3725455 (84.9%) missing values | Missing |
pmts_overdue_1152A has 2442535 (55.7%) missing values | Missing |
pmts_year_1139T has 3280058 (74.8%) missing values | Missing |
pmts_year_507T has 288770 (6.6%) missing values | Missing |
collater_valueofguarantee_1124L is highly skewed (γ1 = 81.69064668) | Skewed |
collater_valueofguarantee_876L is highly skewed (γ1 = 80.87769684) | Skewed |
pmts_dpd_1073P is highly skewed (γ1 = 26.9832622) | Skewed |
pmts_dpd_303P is highly skewed (γ1 = 92.41077498) | Skewed |
pmts_overdue_1140A is highly skewed (γ1 = 105.9926051) | Skewed |
pmts_overdue_1152A is highly skewed (γ1 = 186.8551368) | Skewed |
collater_valueofguarantee_876L has 161055 (3.7%) zeros | Zeros |
num_group1 has 755928 (17.2%) zeros | Zeros |
num_group2 has 178326 (4.1%) zeros | Zeros |
pmts_dpd_1073P has 624821 (14.2%) zeros | Zeros |
pmts_dpd_303P has 1622067 (37.0%) zeros | Zeros |
pmts_overdue_1140A has 628168 (14.3%) zeros | Zeros |
pmts_overdue_1152A has 1612253 (36.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-02-13 19:44:21.892070 |
|---|---|
| Analysis finished | 2024-02-13 19:44:33.608998 |
| Duration | 11.72 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
case_id
Real number (ℝ)
| Distinct | 23734 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1570147.672 |
| Minimum | 56408 |
|---|---|
| Maximum | 2703454 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 56408 |
|---|---|
| 5-th percentile | 256297 |
| Q1 | 1023878 |
| median | 1939098 |
| Q3 | 1944519 |
| 95-th percentile | 2702346 |
| Maximum | 2703454 |
| Range | 2647046 |
| Interquartile range (IQR) | 920641 |
Descriptive statistics
| Standard deviation | 819484.4816 |
|---|---|
| Coefficient of variation (CV) | 0.5219155474 |
| Kurtosis | -0.9099583482 |
| Mean | 1570147.672 |
| Median Absolute Deviation (MAD) | 6973 |
| Skewness | -0.5919843554 |
| Sum | 6.886765038 × 1012 |
| Variance | 6.715548156 × 1011 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1936653 | 2004 | < 0.1% |
| 257343 | 1716 | < 0.1% |
| 257052 | 1704 | < 0.1% |
| 1939829 | 1572 | < 0.1% |
| 1025405 | 1332 | < 0.1% |
| 255943 | 1320 | < 0.1% |
| 1945583 | 1272 | < 0.1% |
| 1937123 | 1224 | < 0.1% |
| 1939842 | 1212 | < 0.1% |
| 1942430 | 1212 | < 0.1% |
| Other values (23724) | 4371494 |
| Value | Count | Frequency (%) |
| 56408 | 108 | |
| 56451 | 24 | < 0.1% |
| 56556 | 192 | |
| 56579 | 84 | |
| 56703 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 2703454 | 252 | |
| 2703453 | 348 | |
| 2703452 | 72 | < 0.1% |
| 2703451 | 96 | < 0.1% |
| 2703450 | 252 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 35088496 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 4340725 | |
| 9a0c095e | 31104 | 0.7% |
| 8fd95e4b | 14216 | 0.3% |
| 06fb9ba8 | 14 | < 0.1% |
| 26cf31be | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 13067495 | |
| a | 4371843 | 12.5% |
| b | 4354972 | 12.4% |
| 4 | 4354941 | 12.4% |
| 1 | 4340728 | 12.4% |
| 7 | 4340725 | 12.4% |
| 9 | 76438 | 0.2% |
| 0 | 62222 | 0.2% |
| e | 45323 | 0.1% |
| c | 31107 | 0.1% |
| Other values (6) | 42702 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26256802 | |
| Lowercase Letter | 8831694 | 25.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 13067495 | |
| 4 | 4354941 | 16.6% |
| 1 | 4340728 | 16.5% |
| 7 | 4340725 | 16.5% |
| 9 | 76438 | 0.3% |
| 0 | 62222 | 0.2% |
| 8 | 14230 | 0.1% |
| 6 | 17 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4371843 | |
| b | 4354972 | |
| e | 45323 | 0.5% |
| c | 31107 | 0.4% |
| f | 14233 | 0.2% |
| d | 14216 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26256802 | |
| Latin | 8831694 | 25.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 13067495 | |
| 4 | 4354941 | 16.6% |
| 1 | 4340728 | 16.5% |
| 7 | 4340725 | 16.5% |
| 9 | 76438 | 0.3% |
| 0 | 62222 | 0.2% |
| 8 | 14230 | 0.1% |
| 6 | 17 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| a | 4371843 | |
| b | 4354972 | |
| e | 45323 | 0.5% |
| c | 31107 | 0.4% |
| f | 14233 | 0.2% |
| d | 14216 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35088496 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 13067495 | |
| a | 4371843 | 12.5% |
| b | 4354972 | 12.4% |
| 4 | 4354941 | 12.4% |
| 1 | 4340728 | 12.4% |
| 7 | 4340725 | 12.4% |
| 9 | 76438 | 0.2% |
| 0 | 62222 | 0.2% |
| e | 45323 | 0.1% |
| c | 31107 | 0.1% |
| Other values (6) | 42702 | 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 35088496 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 8fd95e4b |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 4209154 | |
| 9a0c095e | 94501 | 2.2% |
| 8fd95e4b | 82177 | 1.9% |
| 06fb9ba8 | 197 | < 0.1% |
| 3cbe86ba | 32 | < 0.1% |
| 9276e4bb | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 12804140 | |
| a | 4303884 | 12.3% |
| b | 4291791 | 12.2% |
| 4 | 4291332 | 12.2% |
| 7 | 4209155 | 12.0% |
| 1 | 4209154 | 12.0% |
| 9 | 271377 | 0.8% |
| 0 | 189199 | 0.5% |
| e | 176711 | 0.5% |
| c | 94533 | 0.3% |
| Other values (6) | 247220 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26057026 | |
| Lowercase Letter | 9031470 | 25.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 12804140 | |
| 4 | 4291332 | 16.5% |
| 7 | 4209155 | 16.2% |
| 1 | 4209154 | 16.2% |
| 9 | 271377 | 1.0% |
| 0 | 189199 | 0.7% |
| 8 | 82406 | 0.3% |
| 6 | 230 | < 0.1% |
| 3 | 32 | < 0.1% |
| 2 | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4303884 | |
| b | 4291791 | |
| e | 176711 | 2.0% |
| c | 94533 | 1.0% |
| f | 82374 | 0.9% |
| d | 82177 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26057026 | |
| Latin | 9031470 | 25.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 12804140 | |
| 4 | 4291332 | 16.5% |
| 7 | 4209155 | 16.2% |
| 1 | 4209154 | 16.2% |
| 9 | 271377 | 1.0% |
| 0 | 189199 | 0.7% |
| 8 | 82406 | 0.3% |
| 6 | 230 | < 0.1% |
| 3 | 32 | < 0.1% |
| 2 | 1 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| a | 4303884 | |
| b | 4291791 | |
| e | 176711 | 2.0% |
| c | 94533 | 1.0% |
| f | 82374 | 0.9% |
| d | 82177 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35088496 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 12804140 | |
| a | 4303884 | 12.3% |
| b | 4291791 | 12.2% |
| 4 | 4291332 | 12.2% |
| 7 | 4209155 | 12.0% |
| 1 | 4209154 | 12.0% |
| 9 | 271377 | 0.8% |
| 0 | 189199 | 0.5% |
| e | 176711 | 0.5% |
| c | 94533 | 0.3% |
| Other values (6) | 247220 | 0.7% |
collater_valueofguarantee_1124L
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 2374 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 4340725 |
| Missing (%) | 99.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1250201.297 |
| Minimum | 0 |
|---|---|
| Maximum | 3515294000 |
| Zeros | 42324 |
| Zeros (%) | 1.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3007364.784 |
| Maximum | 3515294000 |
| Range | 3515294000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 32106519.85 |
|---|---|
| Coefficient of variation (CV) | 25.68108027 |
| Kurtosis | 7871.848916 |
| Mean | 1250201.297 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 81.69064668 |
| Sum | 5.668037618 × 1010 |
| Variance | 1.030828617 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 42324 | 1.0% |
| 500000 | 20 | < 0.1% |
| 12000000 | 19 | < 0.1% |
| 200000 | 15 | < 0.1% |
| 4000000 | 15 | < 0.1% |
| 11385000 | 14 | < 0.1% |
| 3000000 | 14 | < 0.1% |
| 2000000 | 13 | < 0.1% |
| 5000000 | 11 | < 0.1% |
| 30000000 | 11 | < 0.1% |
| Other values (2364) | 2881 | 0.1% |
| (Missing) | 4340725 |
| Value | Count | Frequency (%) |
| 0 | 42324 | |
| 1 | 10 | < 0.1% |
| 1150 | 1 | < 0.1% |
| 2712 | 1 | < 0.1% |
| 3800 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3515294000 | 1 | < 0.1% |
| 3200000000 | 2 | |
| 1800000000 | 1 | < 0.1% |
| 1050000000 | 1 | < 0.1% |
| 1000000000 | 4 |
collater_valueofguarantee_876L
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 6556 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 4209154 |
| Missing (%) | 96.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 786298.2358 |
| Minimum | 0 |
|---|---|
| Maximum | 3250000000 |
| Zeros | 161055 |
| Zeros (%) | 3.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 130000 |
| Maximum | 3250000000 |
| Range | 3250000000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 31452025.18 |
|---|---|
| Coefficient of variation (CV) | 40.00012178 |
| Kurtosis | 7380.550392 |
| Mean | 786298.2358 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 80.87769684 |
| Sum | 1.391024483 × 1011 |
| Variance | 9.892298882 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 161055 | 3.7% |
| 60000 | 464 | < 0.1% |
| 130000 | 359 | < 0.1% |
| 100000 | 345 | < 0.1% |
| 30000000 | 249 | < 0.1% |
| 50000 | 205 | < 0.1% |
| 65000 | 192 | < 0.1% |
| 70000 | 161 | < 0.1% |
| 150000 | 161 | < 0.1% |
| 80000 | 160 | < 0.1% |
| Other values (6546) | 13557 | 0.3% |
| (Missing) | 4209154 |
| Value | Count | Frequency (%) |
| 0 | 161055 | |
| 0.99 | 3 | < 0.1% |
| 1 | 46 | < 0.1% |
| 1.8 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3250000000 | 5 | |
| 3200000000 | 5 | |
| 2000000000 | 11 | |
| 1200000000 | 7 | |
| 1015106700 | 1 | < 0.1% |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 35088496 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3cbe86ba |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 4209249 | |
| c7a5ad39 | 140275 | 3.2% |
| 3cbe86ba | 24588 | 0.6% |
| 9276e4bb | 3686 | 0.1% |
| 0e63c0f0 | 3265 | 0.1% |
| 168ad9f3 | 1180 | < 0.1% |
| 5224034a | 899 | < 0.1% |
| 7b62420e | 837 | < 0.1% |
| 940efad7 | 755 | < 0.1% |
| 2fd21cf1 | 481 | < 0.1% |
| Other values (5) | 847 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 12769513 | |
| a | 4517896 | 12.9% |
| 7 | 4355347 | 12.4% |
| b | 4267036 | 12.2% |
| 4 | 4217196 | 12.0% |
| 1 | 4211675 | 12.0% |
| 3 | 170515 | 0.5% |
| c | 169201 | 0.5% |
| 9 | 146512 | 0.4% |
| d | 142828 | 0.4% |
| Other values (6) | 120777 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25951815 | |
| Lowercase Letter | 9136681 | 26.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 12769513 | |
| 7 | 4355347 | 16.8% |
| 4 | 4217196 | 16.3% |
| 1 | 4211675 | 16.2% |
| 3 | 170515 | 0.7% |
| 9 | 146512 | 0.6% |
| 6 | 33958 | 0.1% |
| 8 | 26189 | 0.1% |
| 0 | 12653 | < 0.1% |
| 2 | 8257 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4517896 | |
| b | 4267036 | |
| c | 169201 | 1.9% |
| d | 142828 | 1.6% |
| e | 33415 | 0.4% |
| f | 6305 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 25951815 | |
| Latin | 9136681 | 26.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 12769513 | |
| 7 | 4355347 | 16.8% |
| 4 | 4217196 | 16.3% |
| 1 | 4211675 | 16.2% |
| 3 | 170515 | 0.7% |
| 9 | 146512 | 0.6% |
| 6 | 33958 | 0.1% |
| 8 | 26189 | 0.1% |
| 0 | 12653 | < 0.1% |
| 2 | 8257 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| a | 4517896 | |
| b | 4267036 | |
| c | 169201 | 1.9% |
| d | 142828 | 1.6% |
| e | 33415 | 0.4% |
| f | 6305 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35088496 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 12769513 | |
| a | 4517896 | 12.9% |
| 7 | 4355347 | 12.4% |
| b | 4267036 | 12.2% |
| 4 | 4217196 | 12.0% |
| 1 | 4211675 | 12.0% |
| 3 | 170515 | 0.5% |
| c | 169201 | 0.5% |
| 9 | 146512 | 0.4% |
| d | 142828 | 0.4% |
| Other values (6) | 120777 | 0.3% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 35088496 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 4340785 | |
| c7a5ad39 | 41366 | 0.9% |
| 9276e4bb | 1591 | < 0.1% |
| 0e63c0f0 | 1109 | < 0.1% |
| 168ad9f3 | 442 | < 0.1% |
| 7b62420e | 439 | < 0.1% |
| 940efad7 | 116 | < 0.1% |
| f4d8a027 | 79 | < 0.1% |
| 3cbe86ba | 51 | < 0.1% |
| 2fd21cf1 | 41 | < 0.1% |
| Other values (4) | 43 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 13063762 | |
| a | 4424232 | 12.6% |
| 7 | 4384396 | 12.5% |
| b | 4344528 | 12.4% |
| 4 | 4343058 | 12.4% |
| 1 | 4341327 | 12.4% |
| 9 | 43523 | 0.1% |
| 3 | 42991 | 0.1% |
| c | 42589 | 0.1% |
| d | 42044 | 0.1% |
| Other values (6) | 16046 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26229951 | |
| Lowercase Letter | 8858545 | 25.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 13063762 | |
| 7 | 4384396 | 16.7% |
| 4 | 4343058 | 16.6% |
| 1 | 4341327 | 16.6% |
| 9 | 43523 | 0.2% |
| 3 | 42991 | 0.2% |
| 0 | 3984 | < 0.1% |
| 6 | 3652 | < 0.1% |
| 2 | 2668 | < 0.1% |
| 8 | 590 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4424232 | |
| b | 4344528 | |
| c | 42589 | 0.5% |
| d | 42044 | 0.5% |
| e | 3324 | < 0.1% |
| f | 1828 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26229951 | |
| Latin | 8858545 | 25.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 13063762 | |
| 7 | 4384396 | 16.7% |
| 4 | 4343058 | 16.6% |
| 1 | 4341327 | 16.6% |
| 9 | 43523 | 0.2% |
| 3 | 42991 | 0.2% |
| 0 | 3984 | < 0.1% |
| 6 | 3652 | < 0.1% |
| 2 | 2668 | < 0.1% |
| 8 | 590 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| a | 4424232 | |
| b | 4344528 | |
| c | 42589 | 0.5% |
| d | 42044 | 0.5% |
| e | 3324 | < 0.1% |
| f | 1828 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35088496 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 13063762 | |
| a | 4424232 | 12.6% |
| 7 | 4384396 | 12.5% |
| b | 4344528 | 12.4% |
| 4 | 4343058 | 12.4% |
| 1 | 4341327 | 12.4% |
| 9 | 43523 | 0.1% |
| 3 | 42991 | 0.1% |
| c | 42589 | 0.1% |
| d | 42044 | 0.1% |
| Other values (6) | 16046 | < 0.1% |
num_group1
Real number (ℝ)
ZEROS 
| Distinct | 120 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.491723555 |
| Minimum | 0 |
|---|---|
| Maximum | 119 |
| Zeros | 755928 |
| Zeros (%) | 17.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 8 |
| 95-th percentile | 18 |
| Maximum | 119 |
| Range | 119 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 6.988234071 |
|---|---|
| Coefficient of variation (CV) | 1.272502886 |
| Kurtosis | 29.00008007 |
| Mean | 5.491723555 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 3.8286012 |
| Sum | 24087040 |
| Variance | 48.83541543 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 755928 | |
| 1 | 587413 | |
| 2 | 476894 | |
| 3 | 397025 | |
| 4 | 336111 | |
| 5 | 286647 | 6.5% |
| 6 | 242773 | 5.5% |
| 7 | 205885 | 4.7% |
| 8 | 173654 | 4.0% |
| 9 | 145200 | 3.3% |
| Other values (110) | 778532 |
| Value | Count | Frequency (%) |
| 0 | 755928 | |
| 1 | 587413 | |
| 2 | 476894 | |
| 3 | 397025 | |
| 4 | 336111 |
| Value | Count | Frequency (%) |
| 119 | 36 | |
| 118 | 36 | |
| 117 | 36 | |
| 116 | 48 | |
| 115 | 36 |
num_group2
Real number (ℝ)
ZEROS 
| Distinct | 36 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.46609897 |
| Minimum | 0 |
|---|---|
| Maximum | 35 |
| Zeros | 178326 |
| Zeros (%) | 4.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 12 |
| Q3 | 20 |
| 95-th percentile | 32 |
| Maximum | 35 |
| Range | 35 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 9.366761932 |
|---|---|
| Coefficient of variation (CV) | 0.6955809511 |
| Kurtosis | -0.6928494774 |
| Mean | 13.46609897 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.4895278852 |
| Sum | 59063145 |
| Variance | 87.73622909 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 178326 | 4.1% |
| 6 | 178306 | 4.1% |
| 11 | 178306 | 4.1% |
| 10 | 178306 | 4.1% |
| 8 | 178306 | 4.1% |
| 7 | 178306 | 4.1% |
| 9 | 178306 | 4.1% |
| 5 | 178306 | 4.1% |
| 4 | 178306 | 4.1% |
| 3 | 178306 | 4.1% |
| Other values (26) | 2602982 |
| Value | Count | Frequency (%) |
| 0 | 178326 | |
| 1 | 178306 | |
| 2 | 178306 | |
| 3 | 178306 | |
| 4 | 178306 |
| Value | Count | Frequency (%) |
| 35 | 55441 | |
| 34 | 55441 | |
| 33 | 55441 | |
| 32 | 55441 | |
| 31 | 55441 |
pmts_dpd_1073P
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 1678 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 3729399 |
| Missing (%) | 85.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.19676455 |
| Minimum | 0 |
|---|---|
| Maximum | 4155 |
| Zeros | 624821 |
| Zeros (%) | 14.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4155 |
| Range | 4155 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 90.26236627 |
|---|---|
| Coefficient of variation (CV) | 17.36895435 |
| Kurtosis | 864.1461394 |
| Mean | 5.19676455 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 26.9832622 |
| Sum | 3412523 |
| Variance | 8147.294764 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 624821 | 14.2% |
| 1 | 5855 | 0.1% |
| 2 | 2542 | 0.1% |
| 3 | 1905 | < 0.1% |
| 4 | 1790 | < 0.1% |
| 7 | 1160 | < 0.1% |
| 5 | 1084 | < 0.1% |
| 6 | 1017 | < 0.1% |
| 8 | 1007 | < 0.1% |
| 10 | 838 | < 0.1% |
| Other values (1668) | 14644 | 0.3% |
| (Missing) | 3729399 |
| Value | Count | Frequency (%) |
| 0 | 624821 | |
| 1 | 5855 | 0.1% |
| 2 | 2542 | 0.1% |
| 3 | 1905 | < 0.1% |
| 4 | 1790 | < 0.1% |
| Value | Count | Frequency (%) |
| 4155 | 1 | |
| 4136 | 1 | |
| 4110 | 1 | |
| 4079 | 1 | |
| 4053 | 1 |
pmts_dpd_303P
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 3930 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2445060 |
| Missing (%) | 55.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.13521573 |
| Minimum | -4 |
|---|---|
| Maximum | 144000 |
| Zeros | 1622067 |
| Zeros (%) | 37.0% |
| Negative | 257 |
| Negative (%) | < 0.1% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | -4 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 383 |
| Maximum | 144000 |
| Range | 144004 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 329.9638175 |
|---|---|
| Coefficient of variation (CV) | 5.48703141 |
| Kurtosis | 30124.70291 |
| Mean | 60.13521573 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 92.41077498 |
| Sum | 116722574 |
| Variance | 108876.1209 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1622067 | |
| 1 | 43495 | 1.0% |
| 3 | 11351 | 0.3% |
| 2 | 11135 | 0.3% |
| 4 | 9405 | 0.2% |
| 6 | 7649 | 0.2% |
| 7 | 6425 | 0.1% |
| 5 | 6305 | 0.1% |
| 9 | 4578 | 0.1% |
| 8 | 4463 | 0.1% |
| Other values (3920) | 214129 | 4.9% |
| (Missing) | 2445060 |
| Value | Count | Frequency (%) |
| -4 | 11 | < 0.1% |
| -3 | 25 | < 0.1% |
| -2 | 68 | < 0.1% |
| -1 | 153 | < 0.1% |
| 0 | 1622067 |
| Value | Count | Frequency (%) |
| 144000 | 1 | |
| 84574 | 1 | |
| 84561 | 2 | |
| 84532 | 1 | |
| 84505 | 1 |
pmts_month_158T
Real number (ℝ)
MISSING 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3280058 |
| Missing (%) | 74.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.5 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3.75 |
| median | 6.5 |
| Q3 | 9.25 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5.5 |
Descriptive statistics
| Standard deviation | 3.45205409 |
|---|---|
| Coefficient of variation (CV) | 0.5310852446 |
| Kurtosis | -1.216783293 |
| Mean | 6.5 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0 |
| Sum | 7189026 |
| Variance | 11.91667744 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 92167 | 2.1% |
| 3 | 92167 | 2.1% |
| 4 | 92167 | 2.1% |
| 5 | 92167 | 2.1% |
| 6 | 92167 | 2.1% |
| 7 | 92167 | 2.1% |
| 8 | 92167 | 2.1% |
| 9 | 92167 | 2.1% |
| 10 | 92167 | 2.1% |
| 11 | 92167 | 2.1% |
| Other values (2) | 184334 | 4.2% |
| (Missing) | 3280058 |
| Value | Count | Frequency (%) |
| 1 | 92167 | |
| 2 | 92167 | |
| 3 | 92167 | |
| 4 | 92167 | |
| 5 | 92167 |
| Value | Count | Frequency (%) |
| 12 | 92167 | |
| 11 | 92167 | |
| 10 | 92167 | |
| 9 | 92167 | |
| 8 | 92167 |
pmts_month_706T
Real number (ℝ)
MISSING 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 288770 |
| Missing (%) | 6.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.5 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3.75 |
| median | 6.5 |
| Q3 | 9.25 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5.5 |
Descriptive statistics
| Standard deviation | 3.452052951 |
|---|---|
| Coefficient of variation (CV) | 0.5310850694 |
| Kurtosis | -1.216783237 |
| Mean | 6.5 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0 |
| Sum | 26632398 |
| Variance | 11.91666958 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 341441 | |
| 3 | 341441 | |
| 4 | 341441 | |
| 5 | 341441 | |
| 6 | 341441 | |
| 7 | 341441 | |
| 8 | 341441 | |
| 9 | 341441 | |
| 10 | 341441 | |
| 11 | 341441 | |
| Other values (2) | 682882 |
| Value | Count | Frequency (%) |
| 1 | 341441 | |
| 2 | 341441 | |
| 3 | 341441 | |
| 4 | 341441 | |
| 5 | 341441 |
| Value | Count | Frequency (%) |
| 12 | 341441 | |
| 11 | 341441 | |
| 10 | 341441 | |
| 9 | 341441 | |
| 8 | 341441 |
pmts_overdue_1140A
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 26436 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 3725455 |
| Missing (%) | 84.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 924.5240739 |
| Minimum | 0 |
|---|---|
| Maximum | 8737926 |
| Zeros | 628168 |
| Zeros (%) | 14.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8737926 |
| Range | 8737926 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 38970.96235 |
|---|---|
| Coefficient of variation (CV) | 42.15245817 |
| Kurtosis | 13629.83893 |
| Mean | 924.5240739 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 105.9926051 |
| Sum | 610747074.9 |
| Variance | 1518735906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 628168 | 14.3% |
| 99.8 | 89 | < 0.1% |
| 10 | 52 | < 0.1% |
| 14 | 45 | < 0.1% |
| 400 | 31 | < 0.1% |
| 0.2 | 29 | < 0.1% |
| 10.400001 | 27 | < 0.1% |
| 4 | 26 | < 0.1% |
| 10500 | 24 | < 0.1% |
| 69126.055 | 24 | < 0.1% |
| Other values (26426) | 32092 | 0.7% |
| (Missing) | 3725455 |
| Value | Count | Frequency (%) |
| 0 | 628168 | |
| 0.002 | 1 | < 0.1% |
| 0.004 | 6 | < 0.1% |
| 0.006 | 2 | < 0.1% |
| 0.008 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 8737926 | 1 | |
| 6350786 | 1 | |
| 5523147 | 1 | |
| 5315128 | 1 | |
| 5182516 | 1 |
pmts_overdue_1152A
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 170033 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 2442535 |
| Missing (%) | 55.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4111.655686 |
| Minimum | 0 |
|---|---|
| Maximum | 17317146 |
| Zeros | 1612253 |
| Zeros (%) | 36.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 16963.4 |
| Maximum | 17317146 |
| Range | 17317146 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 64789.7787 |
|---|---|
| Coefficient of variation (CV) | 15.75758858 |
| Kurtosis | 45567.63126 |
| Mean | 4111.655686 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 186.8551368 |
| Sum | 7991113840 |
| Variance | 4197715424 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1612253 | |
| 0.2 | 894 | < 0.1% |
| 1000 | 471 | < 0.1% |
| 0.4 | 373 | < 0.1% |
| 2000 | 333 | < 0.1% |
| 0.6 | 310 | < 0.1% |
| 3000 | 297 | < 0.1% |
| 0.8 | 294 | < 0.1% |
| 1.6 | 286 | < 0.1% |
| 2 | 278 | < 0.1% |
| Other values (170023) | 327738 | 7.5% |
| (Missing) | 2442535 |
| Value | Count | Frequency (%) |
| 0 | 1612253 | |
| 0.002 | 27 | < 0.1% |
| 0.004 | 18 | < 0.1% |
| 0.006 | 16 | < 0.1% |
| 0.008 | 19 | < 0.1% |
| Value | Count | Frequency (%) |
| 17317146 | 1 | |
| 17230230 | 1 | |
| 17141206 | 1 | |
| 17051322 | 1 | |
| 16963518 | 1 |
pmts_year_1139T
Real number (ℝ)
MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3280058 |
| Missing (%) | 74.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2019.388432 |
| Minimum | 2016 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 2016 |
|---|---|
| 5-th percentile | 2018 |
| Q1 | 2019 |
| median | 2020 |
| Q3 | 2020 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8013438654 |
|---|---|
| Coefficient of variation (CV) | 0.0003968250253 |
| Kurtosis | -0.697380051 |
| Mean | 2019.388432 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.3462998149 |
| Sum | 2233451683 |
| Variance | 0.6421519906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2020 | 520011 | 11.9% |
| 2019 | 362429 | 8.3% |
| 2018 | 179094 | 4.1% |
| 2021 | 44413 | 1.0% |
| 2017 | 35 | < 0.1% |
| 2016 | 22 | < 0.1% |
| (Missing) | 3280058 |
| Value | Count | Frequency (%) |
| 2016 | 22 | < 0.1% |
| 2017 | 35 | < 0.1% |
| 2018 | 179094 | 4.1% |
| 2019 | 362429 | |
| 2020 | 520011 |
| Value | Count | Frequency (%) |
| 2021 | 44413 | 1.0% |
| 2020 | 520011 | |
| 2019 | 362429 | |
| 2018 | 179094 | 4.1% |
| 2017 | 35 | < 0.1% |
pmts_year_507T
Real number (ℝ)
MISSING 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 288770 |
| Missing (%) | 6.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2015.066429 |
| Minimum | 2002 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 MiB |
Quantile statistics
| Minimum | 2002 |
|---|---|
| 5-th percentile | 2007 |
| Q1 | 2012 |
| median | 2016 |
| Q3 | 2018 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 19 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.072742933 |
|---|---|
| Coefficient of variation (CV) | 0.002021145743 |
| Kurtosis | -0.4521212835 |
| Mean | 2015.066429 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.76809658 |
| Sum | 8256315557 |
| Variance | 16.587235 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2019 | 603853 | |
| 2018 | 583318 | |
| 2017 | 432657 | |
| 2020 | 322828 | |
| 2016 | 316104 | |
| 2015 | 281667 | 6.4% |
| 2014 | 263154 | 6.0% |
| 2013 | 229842 | 5.2% |
| 2012 | 188788 | 4.3% |
| 2011 | 163820 | 3.7% |
| Other values (10) | 711261 | |
| (Missing) | 288770 |
| Value | Count | Frequency (%) |
| 2002 | 88 | < 0.1% |
| 2003 | 316 | < 0.1% |
| 2004 | 7827 | 0.2% |
| 2005 | 39143 | 0.9% |
| 2006 | 99161 |
| Value | Count | Frequency (%) |
| 2021 | 24768 | 0.6% |
| 2020 | 322828 | |
| 2019 | 603853 | |
| 2018 | 583318 | |
| 2017 | 432657 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 35088496 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ab3c25cf |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 4210385 | |
| ab3c25cf | 172826 | 3.9% |
| 15f04f45 | 1431 | < 0.1% |
| be4fd70b | 917 | < 0.1% |
| daf49a8a | 490 | < 0.1% |
| 71ddaa88 | 9 | < 0.1% |
| 0c42a10e | 2 | < 0.1% |
| 9ba4314a | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 12806843 | |
| b | 4385047 | 12.5% |
| a | 4384705 | 12.5% |
| 4 | 4214660 | 12.0% |
| 1 | 4211829 | 12.0% |
| 7 | 4211311 | 12.0% |
| c | 345654 | 1.0% |
| f | 177095 | 0.5% |
| 3 | 172828 | 0.5% |
| 2 | 172828 | 0.5% |
| Other values (5) | 5696 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25793651 | |
| Lowercase Letter | 9294845 | 26.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 12806843 | |
| 4 | 4214660 | 16.3% |
| 1 | 4211829 | 16.3% |
| 7 | 4211311 | 16.3% |
| 3 | 172828 | 0.7% |
| 2 | 172828 | 0.7% |
| 0 | 2352 | < 0.1% |
| 8 | 508 | < 0.1% |
| 9 | 492 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 4385047 | |
| a | 4384705 | |
| c | 345654 | 3.7% |
| f | 177095 | 1.9% |
| d | 1425 | < 0.1% |
| e | 919 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 25793651 | |
| Latin | 9294845 | 26.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 12806843 | |
| 4 | 4214660 | 16.3% |
| 1 | 4211829 | 16.3% |
| 7 | 4211311 | 16.3% |
| 3 | 172828 | 0.7% |
| 2 | 172828 | 0.7% |
| 0 | 2352 | < 0.1% |
| 8 | 508 | < 0.1% |
| 9 | 492 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| b | 4385047 | |
| a | 4384705 | |
| c | 345654 | 3.7% |
| f | 177095 | 1.9% |
| d | 1425 | < 0.1% |
| e | 919 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35088496 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 12806843 | |
| b | 4385047 | 12.5% |
| a | 4384705 | 12.5% |
| 4 | 4214660 | 12.0% |
| 1 | 4211829 | 12.0% |
| 7 | 4211311 | 12.0% |
| c | 345654 | 1.0% |
| f | 177095 | 0.5% |
| 3 | 172828 | 0.5% |
| 2 | 172828 | 0.5% |
| Other values (5) | 5696 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.00001596 |
| Min length | 8 |
Characters and Unicode
| Total characters | 35088566 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 4341390 | |
| ab3c25cf | 43626 | 1.0% |
| be4fd70b | 408 | < 0.1% |
| daf49a8a | 299 | < 0.1% |
| 15f04f45 | 269 | < 0.1% |
| p28_48_88 | 70 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 13068334 | |
| a | 4385913 | 12.5% |
| b | 4385832 | 12.5% |
| 4 | 4342705 | 12.4% |
| 7 | 4341798 | 12.4% |
| 1 | 4341659 | 12.4% |
| c | 87252 | 0.2% |
| f | 44871 | 0.1% |
| 2 | 43696 | 0.1% |
| 3 | 43626 | 0.1% |
| Other values (7) | 2880 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26183373 | |
| Lowercase Letter | 8904983 | 25.4% |
| Connector Punctuation | 140 | < 0.1% |
| Uppercase Letter | 70 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 13068334 | |
| 4 | 4342705 | 16.6% |
| 7 | 4341798 | 16.6% |
| 1 | 4341659 | 16.6% |
| 2 | 43696 | 0.2% |
| 3 | 43626 | 0.2% |
| 0 | 677 | < 0.1% |
| 8 | 579 | < 0.1% |
| 9 | 299 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4385913 | |
| b | 4385832 | |
| c | 87252 | 1.0% |
| f | 44871 | 0.5% |
| d | 707 | < 0.1% |
| e | 408 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 140 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 70 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26183513 | |
| Latin | 8905053 | 25.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 13068334 | |
| 4 | 4342705 | 16.6% |
| 7 | 4341798 | 16.6% |
| 1 | 4341659 | 16.6% |
| 2 | 43696 | 0.2% |
| 3 | 43626 | 0.2% |
| 0 | 677 | < 0.1% |
| 8 | 579 | < 0.1% |
| 9 | 299 | < 0.1% |
| _ | 140 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| a | 4385913 | |
| b | 4385832 | |
| c | 87252 | 1.0% |
| f | 44871 | 0.5% |
| d | 707 | < 0.1% |
| e | 408 | < 0.1% |
| P | 70 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35088566 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 13068334 | |
| a | 4385913 | 12.5% |
| b | 4385832 | 12.5% |
| 4 | 4342705 | 12.4% |
| 7 | 4341798 | 12.4% |
| 1 | 4341659 | 12.4% |
| c | 87252 | 0.2% |
| f | 44871 | 0.1% |
| 2 | 43696 | 0.1% |
| 3 | 43626 | 0.1% |
| Other values (7) | 2880 | < 0.1% |