Dataset statistics
| Number of variables | 41 |
|---|---|
| Number of observations | 2638295 |
| Missing cells | 34503185 |
| Missing cells (%) | 31.9% |
| Total size in memory | 825.3 MiB |
| Average record size in memory | 328.0 B |
Variable types
| Numeric | 20 |
|---|---|
| Text | 19 |
| Boolean | 2 |
isbidproduct_390L is highly imbalanced (69.3%) | Imbalance |
annuity_853A has 94885 (3.6%) missing values | Missing |
approvaldate_319D has 1244273 (47.2%) missing values | Missing |
byoccupationinc_3656910L has 2095507 (79.4%) missing values | Missing |
childnum_21L has 1605531 (60.9%) missing values | Missing |
credacc_actualbalance_314A has 2486439 (94.2%) missing values | Missing |
credacc_credlmt_575A has 75062 (2.8%) missing values | Missing |
credacc_maxhisbal_375A has 2486439 (94.2%) missing values | Missing |
credacc_minhisbal_90A has 2486439 (94.2%) missing values | Missing |
credacc_status_367L has 2486439 (94.2%) missing values | Missing |
credacc_transactions_402L has 2486439 (94.2%) missing values | Missing |
credamount_590A has 78869 (3.0%) missing values | Missing |
credtype_587L has 78869 (3.0%) missing values | Missing |
currdebt_94A has 976135 (37.0%) missing values | Missing |
dateactivated_425D has 1297051 (49.2%) missing values | Missing |
downpmt_134A has 78869 (3.0%) missing values | Missing |
dtlastpmt_581D has 1890009 (71.6%) missing values | Missing |
dtlastpmtallstes_3545839D has 1609466 (61.0%) missing values | Missing |
employedfrom_700D has 1705609 (64.6%) missing values | Missing |
familystate_726L has 1148691 (43.5%) missing values | Missing |
firstnonzeroinstldate_307D has 287307 (10.9%) missing values | Missing |
inittransactioncode_279L has 78869 (3.0%) missing values | Missing |
isdebitcard_527L has 2425416 (91.9%) missing values | Missing |
mainoccupationinc_437A has 65371 (2.5%) missing values | Missing |
maxdpdtolerance_577P has 1278326 (48.5%) missing values | Missing |
outstandingdebt_522A has 980346 (37.2%) missing values | Missing |
pmtnum_8L has 238987 (9.1%) missing values | Missing |
revolvingaccount_394A has 2498196 (94.7%) missing values | Missing |
tenor_203L has 238987 (9.1%) missing values | Missing |
actualdpd_943P is highly skewed (γ1 = 530.2918463) | Skewed |
credacc_maxhisbal_375A is highly skewed (γ1 = 43.42768751) | Skewed |
downpmt_134A is highly skewed (γ1 = 21.07123108) | Skewed |
actualdpd_943P has 2636655 (99.9%) zeros | Zeros |
annuity_853A has 200456 (7.6%) zeros | Zeros |
byoccupationinc_3656910L has 34448 (1.3%) zeros | Zeros |
childnum_21L has 574109 (21.8%) zeros | Zeros |
credacc_actualbalance_314A has 47706 (1.8%) zeros | Zeros |
credacc_credlmt_575A has 2369092 (89.8%) zeros | Zeros |
credacc_maxhisbal_375A has 84335 (3.2%) zeros | Zeros |
credacc_minhisbal_90A has 89232 (3.4%) zeros | Zeros |
credacc_transactions_402L has 134552 (5.1%) zeros | Zeros |
credamount_590A has 73846 (2.8%) zeros | Zeros |
currdebt_94A has 1441963 (54.7%) zeros | Zeros |
downpmt_134A has 2348164 (89.0%) zeros | Zeros |
maxdpdtolerance_577P has 986233 (37.4%) zeros | Zeros |
num_group1 has 438525 (16.6%) zeros | Zeros |
outstandingdebt_522A has 1436446 (54.4%) zeros | Zeros |
Reproduction
| Analysis started | 2024-02-13 19:36:58.685528 |
|---|---|
| Analysis finished | 2024-02-13 19:37:23.604870 |
| Duration | 24.92 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
case_id
Real number (ℝ)
| Distinct | 438525 |
|---|---|
| Distinct (%) | 16.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1482078.441 |
| Minimum | 40704 |
|---|---|
| Maximum | 2703454 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 40704 |
|---|---|
| 5-th percentile | 198771 |
| Q1 | 257431 |
| median | 1788760 |
| Q3 | 1895740 |
| 95-th percentile | 2683686.3 |
| Maximum | 2703454 |
| Range | 2662750 |
| Interquartile range (IQR) | 1638309 |
Descriptive statistics
| Standard deviation | 822852.6011 |
|---|---|
| Coefficient of variation (CV) | 0.5552017887 |
| Kurtosis | -0.9550918823 |
| Mean | 1482078.441 |
| Median Absolute Deviation (MAD) | 129385 |
| Skewness | -0.5094453682 |
| Sum | 3.910160139 × 1012 |
| Variance | 6.770864032 × 1011 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2660368 | 20 | < 0.1% |
| 229396 | 20 | < 0.1% |
| 229411 | 20 | < 0.1% |
| 195358 | 20 | < 0.1% |
| 250069 | 20 | < 0.1% |
| 1809155 | 20 | < 0.1% |
| 1906631 | 20 | < 0.1% |
| 250081 | 20 | < 0.1% |
| 1715312 | 20 | < 0.1% |
| 250087 | 20 | < 0.1% |
| Other values (438515) | 2638095 |
| Value | Count | Frequency (%) |
| 40704 | 1 | < 0.1% |
| 40734 | 1 | < 0.1% |
| 40737 | 1 | < 0.1% |
| 40791 | 3 | |
| 40821 | 2 |
| Value | Count | Frequency (%) |
| 2703454 | 2 | < 0.1% |
| 2703453 | 9 | |
| 2703452 | 3 | < 0.1% |
| 2703451 | 6 | |
| 2703450 | 13 |
actualdpd_943P
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 157 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 266 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02098536445 |
| Minimum | 0 |
|---|---|
| Maximum | 4206 |
| Zeros | 2636655 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4206 |
| Range | 4206 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.707747487 |
|---|---|
| Coefficient of variation (CV) | 271.9870555 |
| Kurtosis | 320201.8546 |
| Mean | 0.02098536445 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 530.2918463 |
| Sum | 55360 |
| Variance | 32.57838137 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2636655 | |
| 1 | 520 | < 0.1% |
| 2 | 220 | < 0.1% |
| 3 | 153 | < 0.1% |
| 4 | 53 | < 0.1% |
| 6 | 38 | < 0.1% |
| 5 | 32 | < 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 20 | < 0.1% |
| 9 | 14 | < 0.1% |
| Other values (147) | 303 | < 0.1% |
| (Missing) | 266 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2636655 | |
| 1 | 520 | < 0.1% |
| 2 | 220 | < 0.1% |
| 3 | 153 | < 0.1% |
| 4 | 53 | < 0.1% |
| Value | Count | Frequency (%) |
| 4206 | 1 | |
| 3980 | 1 | |
| 3623 | 1 | |
| 2617 | 1 | |
| 2505 | 1 |
annuity_853A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 76848 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 94885 |
| Missing (%) | 3.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3502.854599 |
| Minimum | 0 |
|---|---|
| Maximum | 103000 |
| Zeros | 200456 |
| Zeros (%) | 7.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1711.6 |
| median | 2825.8 |
| Q3 | 4595.6 |
| 95-th percentile | 8829.601 |
| Maximum | 103000 |
| Range | 103000 |
| Interquartile range (IQR) | 2884 |
Descriptive statistics
| Standard deviation | 2963.25686 |
|---|---|
| Coefficient of variation (CV) | 0.8459548564 |
| Kurtosis | 28.49244386 |
| Mean | 3502.854599 |
| Median Absolute Deviation (MAD) | 1324.8 |
| Skewness | 3.117039065 |
| Sum | 8909195416 |
| Variance | 8780891.216 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 200456 | 7.6% |
| 1580 | 2372 | 0.1% |
| 1508 | 1974 | 0.1% |
| 2716 | 1633 | 0.1% |
| 3820 | 1096 | < 0.1% |
| 2000 | 1083 | < 0.1% |
| 2103 | 1015 | < 0.1% |
| 2558.4001 | 1001 | < 0.1% |
| 3837.4001 | 997 | < 0.1% |
| 1668 | 990 | < 0.1% |
| Other values (76838) | 2330793 | |
| (Missing) | 94885 | 3.6% |
| Value | Count | Frequency (%) |
| 0 | 200456 | |
| 2 | 1 | < 0.1% |
| 2.2 | 1 | < 0.1% |
| 2.4 | 1 | < 0.1% |
| 2.6000001 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 103000 | 1 | |
| 99646.6 | 2 | |
| 96987.9 | 1 | |
| 95685.2 | 1 | |
| 94012.2 | 1 |
MISSING 
| Distinct | 5398 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 1244273 |
| Missing (%) | 47.2% |
| Memory size | 20.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 13940220 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2019-10-28 |
|---|---|
| 2nd row | 2019-09-13 |
| 3rd row | 2019-10-09 |
| 4th row | 2019-12-01 |
| 5th row | 2019-10-27 |
| Value | Count | Frequency (%) |
| 2019-12-14 | 1795 | 0.1% |
| 2019-12-13 | 1717 | 0.1% |
| 2019-09-21 | 1506 | 0.1% |
| 2019-08-30 | 1503 | 0.1% |
| 2018-12-07 | 1474 | 0.1% |
| 2019-12-27 | 1454 | 0.1% |
| 2019-09-20 | 1449 | 0.1% |
| 2019-11-30 | 1432 | 0.1% |
| 2019-11-29 | 1419 | 0.1% |
| 2019-06-28 | 1403 | 0.1% |
| Other values (5388) | 1378870 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3265444 | |
| - | 2788044 | |
| 1 | 2513445 | |
| 2 | 2385626 | |
| 9 | 614752 | 4.4% |
| 8 | 543699 | 3.9% |
| 7 | 448386 | 3.2% |
| 3 | 393048 | 2.8% |
| 6 | 367360 | 2.6% |
| 5 | 313079 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11152176 | |
| Dash Punctuation | 2788044 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3265444 | |
| 1 | 2513445 | |
| 2 | 2385626 | |
| 9 | 614752 | 5.5% |
| 8 | 543699 | 4.9% |
| 7 | 448386 | 4.0% |
| 3 | 393048 | 3.5% |
| 6 | 367360 | 3.3% |
| 5 | 313079 | 2.8% |
| 4 | 307337 | 2.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2788044 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13940220 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3265444 | |
| - | 2788044 | |
| 1 | 2513445 | |
| 2 | 2385626 | |
| 9 | 614752 | 4.4% |
| 8 | 543699 | 3.9% |
| 7 | 448386 | 3.2% |
| 3 | 393048 | 2.8% |
| 6 | 367360 | 2.6% |
| 5 | 313079 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13940220 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3265444 | |
| - | 2788044 | |
| 1 | 2513445 | |
| 2 | 2385626 | |
| 9 | 614752 | 4.4% |
| 8 | 543699 | 3.9% |
| 7 | 448386 | 3.2% |
| 3 | 393048 | 2.8% |
| 6 | 367360 | 2.6% |
| 5 | 313079 | 2.2% |
byoccupationinc_3656910L
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 17995 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 2095507 |
| Missing (%) | 79.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20411.15626 |
| Minimum | 0 |
|---|---|
| Maximum | 200000 |
| Zeros | 34448 |
| Zeros (%) | 1.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 10000 |
| Q3 | 30000 |
| 95-th percentile | 75000 |
| Maximum | 200000 |
| Range | 200000 |
| Interquartile range (IQR) | 29999 |
Descriptive statistics
| Standard deviation | 30931.99075 |
|---|---|
| Coefficient of variation (CV) | 1.515445296 |
| Kurtosis | 10.36416476 |
| Mean | 20411.15626 |
| Median Absolute Deviation (MAD) | 9999 |
| Skewness | 2.743120942 |
| Sum | 1.107893068 × 1010 |
| Variance | 956788051.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 224806 | 8.5% |
| 0 | 34448 | 1.3% |
| 15000 | 27344 | 1.0% |
| 20000 | 23247 | 0.9% |
| 30000 | 21042 | 0.8% |
| 25000 | 19112 | 0.7% |
| 50000 | 17923 | 0.7% |
| 10000 | 13261 | 0.5% |
| 35000 | 10526 | 0.4% |
| 40000 | 10433 | 0.4% |
| Other values (17985) | 140646 | 5.3% |
| (Missing) | 2095507 |
| Value | Count | Frequency (%) |
| 0 | 34448 | 1.3% |
| 1 | 224806 | |
| 2 | 3 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 200000 | 3856 | |
| 199000 | 13 | < 0.1% |
| 198000 | 12 | < 0.1% |
| 197000 | 6 | < 0.1% |
| 196300 | 3 | < 0.1% |
| Distinct | 73 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 8.961243909 |
| Min length | 8 |
Characters and Unicode
| Total characters | 23642405 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | P94_109_143 |
|---|---|
| 2nd row | P94_109_143 |
| 3rd row | a55475b1 |
| 4th row | P94_109_143 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 1723449 | |
| p94_109_143 | 654286 | 24.8% |
| p30_86_84 | 48402 | 1.8% |
| p180_60_137 | 31915 | 1.2% |
| p198_89_166 | 26012 | 1.0% |
| p73_130_169 | 25691 | 1.0% |
| p85_114_140 | 23658 | 0.9% |
| p52_67_90 | 18020 | 0.7% |
| p24_27_36 | 14627 | 0.6% |
| p69_72_116 | 12954 | 0.5% |
| Other values (63) | 59281 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 5244804 | |
| 1 | 3440737 | |
| 4 | 3170300 | |
| 7 | 1838516 | 7.8% |
| _ | 1829692 | 7.7% |
| a | 1723449 | 7.3% |
| b | 1723449 | 7.3% |
| 9 | 1447957 | 6.1% |
| P | 914846 | 3.9% |
| 0 | 873198 | 3.7% |
| Other values (4) | 1435457 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17450969 | |
| Lowercase Letter | 3446898 | 14.6% |
| Connector Punctuation | 1829692 | 7.7% |
| Uppercase Letter | 914846 | 3.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 5244804 | |
| 1 | 3440737 | |
| 4 | 3170300 | |
| 7 | 1838516 | 10.5% |
| 9 | 1447957 | 8.3% |
| 0 | 873198 | 5.0% |
| 3 | 840337 | 4.8% |
| 6 | 265561 | 1.5% |
| 8 | 233431 | 1.3% |
| 2 | 96128 | 0.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1723449 | |
| b | 1723449 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1829692 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 914846 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19280661 | |
| Latin | 4361744 | 18.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 5244804 | |
| 1 | 3440737 | |
| 4 | 3170300 | |
| 7 | 1838516 | 9.5% |
| _ | 1829692 | 9.5% |
| 9 | 1447957 | 7.5% |
| 0 | 873198 | 4.5% |
| 3 | 840337 | 4.4% |
| 6 | 265561 | 1.4% |
| 8 | 233431 | 1.2% |
Latin
| Value | Count | Frequency (%) |
| a | 1723449 | |
| b | 1723449 | |
| P | 914846 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23642405 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 5244804 | |
| 1 | 3440737 | |
| 4 | 3170300 | |
| 7 | 1838516 | 7.8% |
| _ | 1829692 | 7.7% |
| a | 1723449 | 7.3% |
| b | 1723449 | 7.3% |
| 9 | 1447957 | 6.1% |
| P | 914846 | 3.9% |
| 0 | 873198 | 3.7% |
| Other values (4) | 1435457 | 6.1% |
childnum_21L
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1605531 |
| Missing (%) | 60.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8416530785 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 574109 |
| Zeros (%) | 21.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.238723149 |
|---|---|
| Coefficient of variation (CV) | 1.471774037 |
| Kurtosis | 6.280759591 |
| Mean | 0.8416530785 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.047090682 |
| Sum | 869229 |
| Variance | 1.534435041 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 574109 | 21.8% |
| 1 | 220608 | 8.4% |
| 2 | 142948 | 5.4% |
| 3 | 52869 | 2.0% |
| 4 | 22176 | 0.8% |
| 5 | 11446 | 0.4% |
| 6 | 5079 | 0.2% |
| 7 | 1959 | 0.1% |
| 8 | 813 | < 0.1% |
| 9 | 406 | < 0.1% |
| Other values (9) | 351 | < 0.1% |
| (Missing) | 1605531 |
| Value | Count | Frequency (%) |
| 0 | 574109 | |
| 1 | 220608 | 8.4% |
| 2 | 142948 | 5.4% |
| 3 | 52869 | 2.0% |
| 4 | 22176 | 0.8% |
| Value | Count | Frequency (%) |
| 20 | 10 | |
| 17 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 8 | |
| 14 | 7 |
| Distinct | 5402 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 31 |
| Missing (%) | < 0.1% |
| Memory size | 20.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 26382640 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2018-11-20 |
|---|---|
| 2nd row | 2019-12-26 |
| 3rd row | 2014-07-17 |
| 4th row | 2017-08-21 |
| 5th row | 2014-12-28 |
| Value | Count | Frequency (%) |
| 2019-12-13 | 3083 | 0.1% |
| 2019-12-27 | 2862 | 0.1% |
| 2019-12-14 | 2856 | 0.1% |
| 2020-01-01 | 2836 | 0.1% |
| 2019-12-02 | 2740 | 0.1% |
| 2019-08-30 | 2716 | 0.1% |
| 2019-09-30 | 2693 | 0.1% |
| 2019-09-27 | 2674 | 0.1% |
| 2019-11-29 | 2672 | 0.1% |
| 2020-01-10 | 2637 | 0.1% |
| Other values (5392) | 2610495 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6241503 | |
| - | 5276528 | |
| 1 | 4657136 | |
| 2 | 4569753 | |
| 9 | 1160245 | 4.4% |
| 8 | 996039 | 3.8% |
| 7 | 821862 | 3.1% |
| 3 | 749668 | 2.8% |
| 6 | 685941 | 2.6% |
| 4 | 617725 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 21106112 | |
| Dash Punctuation | 5276528 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6241503 | |
| 1 | 4657136 | |
| 2 | 4569753 | |
| 9 | 1160245 | 5.5% |
| 8 | 996039 | 4.7% |
| 7 | 821862 | 3.9% |
| 3 | 749668 | 3.6% |
| 6 | 685941 | 3.2% |
| 4 | 617725 | 2.9% |
| 5 | 606240 | 2.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5276528 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26382640 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6241503 | |
| - | 5276528 | |
| 1 | 4657136 | |
| 2 | 4569753 | |
| 9 | 1160245 | 4.4% |
| 8 | 996039 | 3.8% |
| 7 | 821862 | 3.1% |
| 3 | 749668 | 2.8% |
| 6 | 685941 | 2.6% |
| 4 | 617725 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26382640 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6241503 | |
| - | 5276528 | |
| 1 | 4657136 | |
| 2 | 4569753 | |
| 9 | 1160245 | 4.4% |
| 8 | 996039 | 3.8% |
| 7 | 821862 | 3.1% |
| 3 | 749668 | 2.8% |
| 6 | 685941 | 2.6% |
| 4 | 617725 | 2.3% |
credacc_actualbalance_314A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 48131 |
|---|---|
| Distinct (%) | 31.7% |
| Missing | 2486439 |
| Missing (%) | 94.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16055.30345 |
| Minimum | -134008.42 |
|---|---|
| Maximum | 1600000 |
| Zeros | 47706 |
| Zeros (%) | 1.8% |
| Negative | 526 |
| Negative (%) | < 0.1% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | -134008.42 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 182 |
| Q3 | 23136 |
| 95-th percentile | 75998 |
| Maximum | 1600000 |
| Range | 1734008.42 |
| Interquartile range (IQR) | 23136 |
Descriptive statistics
| Standard deviation | 27948.64962 |
|---|---|
| Coefficient of variation (CV) | 1.740773677 |
| Kurtosis | 88.97889429 |
| Mean | 16055.30345 |
| Median Absolute Deviation (MAD) | 182 |
| Skewness | 4.150777153 |
| Sum | 2438094161 |
| Variance | 781127015.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 47706 | 1.8% |
| 100000 | 2563 | 0.1% |
| 2 | 785 | < 0.1% |
| 0.2 | 509 | < 0.1% |
| 190 | 462 | < 0.1% |
| 10 | 462 | < 0.1% |
| 4 | 450 | < 0.1% |
| 42640 | 437 | < 0.1% |
| 12000 | 435 | < 0.1% |
| 20300 | 427 | < 0.1% |
| Other values (48121) | 97620 | 3.7% |
| (Missing) | 2486439 |
| Value | Count | Frequency (%) |
| -134008.42 | 1 | |
| -99800 | 1 | |
| -94996 | 1 | |
| -83473.94 | 1 | |
| -70909.414 | 1 |
| Value | Count | Frequency (%) |
| 1600000 | 1 | |
| 952181.6 | 1 | |
| 459241.6 | 1 | |
| 419952.4 | 1 | |
| 400000.28 | 2 |
credacc_credlmt_575A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 30376 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 75062 |
| Missing (%) | 2.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3608.607928 |
| Minimum | 0 |
|---|---|
| Maximum | 400000 |
| Zeros | 2369092 |
| Zeros (%) | 89.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 24000 |
| Maximum | 400000 |
| Range | 400000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 16507.4851 |
|---|---|
| Coefficient of variation (CV) | 4.574474543 |
| Kurtosis | 68.64487772 |
| Mean | 3608.607928 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.800101576 |
| Sum | 9249702926 |
| Variance | 272497064.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2369092 | |
| 100000 | 21455 | 0.8% |
| 12000 | 6982 | 0.3% |
| 40000 | 4799 | 0.2% |
| 20000 | 4736 | 0.2% |
| 60000 | 3721 | 0.1% |
| 150000 | 2546 | 0.1% |
| 30000 | 2469 | 0.1% |
| 10000 | 2161 | 0.1% |
| 24000 | 1817 | 0.1% |
| Other values (30366) | 143455 | 5.4% |
| (Missing) | 75062 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 2369092 | |
| 0.2 | 773 | < 0.1% |
| 0.8 | 4 | < 0.1% |
| 1.2 | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 400000 | 195 | |
| 394600 | 2 | < 0.1% |
| 391400 | 2 | < 0.1% |
| 382200 | 2 | < 0.1% |
| 377400 | 1 | < 0.1% |
credacc_maxhisbal_375A
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 39326 |
|---|---|
| Distinct (%) | 25.9% |
| Missing | 2486439 |
| Missing (%) | 94.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -1878.262158 |
| Minimum | -290265.1 |
|---|---|
| Maximum | 3800000 |
| Zeros | 84335 |
| Zeros (%) | 3.2% |
| Negative | 21622 |
| Negative (%) | 0.8% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | -290265.1 |
|---|---|
| 5-th percentile | -29680.941 |
| Q1 | 0 |
| median | 0 |
| Q3 | 7.651499925 |
| 95-th percentile | 3544.169 |
| Maximum | 3800000 |
| Range | 4090265.1 |
| Interquartile range (IQR) | 7.651499925 |
Descriptive statistics
| Standard deviation | 29606.77848 |
|---|---|
| Coefficient of variation (CV) | -15.76285736 |
| Kurtosis | 4283.776058 |
| Mean | -1878.262158 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 43.42768751 |
| Sum | -285225378.2 |
| Variance | 876561331.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 84335 | 3.2% |
| 2 | 1352 | 0.1% |
| 4 | 683 | < 0.1% |
| 10 | 444 | < 0.1% |
| 190 | 435 | < 0.1% |
| 22 | 374 | < 0.1% |
| 6 | 367 | < 0.1% |
| 180 | 328 | < 0.1% |
| 90 | 307 | < 0.1% |
| 80 | 282 | < 0.1% |
| Other values (39316) | 62949 | 2.4% |
| (Missing) | 2486439 |
| Value | Count | Frequency (%) |
| -290265.1 | 1 | |
| -199950 | 1 | |
| -198762 | 1 | |
| -197850.3 | 1 | |
| -196450 | 1 |
| Value | Count | Frequency (%) |
| 3800000 | 1 | |
| 3640000 | 1 | |
| 2400200 | 1 | |
| 2000000 | 2 | |
| 1999990 | 1 |
credacc_minhisbal_90A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 37539 |
|---|---|
| Distinct (%) | 24.7% |
| Missing | 2486439 |
| Missing (%) | 94.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -5670.541866 |
| Minimum | -350532.6 |
|---|---|
| Maximum | 239000 |
| Zeros | 89232 |
| Zeros (%) | 3.4% |
| Negative | 29161 |
| Negative (%) | 1.1% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | -350532.6 |
|---|---|
| 5-th percentile | -39975 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 157.7795 |
| Maximum | 239000 |
| Range | 589532.6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 18143.37868 |
|---|---|
| Coefficient of variation (CV) | -3.199584644 |
| Kurtosis | 27.84215621 |
| Mean | -5670.541866 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -4.501135816 |
| Sum | -861105805.7 |
| Variance | 329182189.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 89232 | 3.4% |
| 2 | 998 | < 0.1% |
| 4 | 564 | < 0.1% |
| 10 | 444 | < 0.1% |
| 190 | 430 | < 0.1% |
| -10 | 366 | < 0.1% |
| 6 | 325 | < 0.1% |
| 180 | 324 | < 0.1% |
| 90 | 310 | < 0.1% |
| 80 | 269 | < 0.1% |
| Other values (37529) | 58594 | 2.2% |
| (Missing) | 2486439 |
| Value | Count | Frequency (%) |
| -350532.6 | 1 | |
| -319998.6 | 1 | |
| -319856 | 1 | |
| -309628.03 | 1 | |
| -299717.5 | 1 |
| Value | Count | Frequency (%) |
| 239000 | 1 | |
| 120000 | 1 | |
| 101840 | 1 | |
| 100000 | 2 | |
| 99990 | 1 |
MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2486439 |
| Missing (%) | 94.2% |
| Memory size | 20.1 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.00603203 |
| Min length | 2 |
Characters and Unicode
| Total characters | 304628 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AC |
|---|---|
| 2nd row | AC |
| 3rd row | AC |
| 4th row | AC |
| 5th row | AC |
| Value | Count | Frequency (%) |
| ac | 100509 | |
| cl | 43060 | |
| ca | 7098 | 4.7% |
| pcl | 916 | 0.6% |
| po | 249 | 0.2% |
| cr | 24 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 151607 | |
| A | 107607 | |
| L | 43976 | 14.4% |
| P | 1165 | 0.4% |
| O | 249 | 0.1% |
| R | 24 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 304628 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 151607 | |
| A | 107607 | |
| L | 43976 | 14.4% |
| P | 1165 | 0.4% |
| O | 249 | 0.1% |
| R | 24 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 304628 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 151607 | |
| A | 107607 | |
| L | 43976 | 14.4% |
| P | 1165 | 0.4% |
| O | 249 | 0.1% |
| R | 24 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 304628 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 151607 | |
| A | 107607 | |
| L | 43976 | 14.4% |
| P | 1165 | 0.4% |
| O | 249 | 0.1% |
| R | 24 | < 0.1% |
credacc_transactions_402L
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 93 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2486439 |
| Missing (%) | 94.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5760128016 |
| Minimum | 0 |
|---|---|
| Maximum | 147 |
| Zeros | 134552 |
| Zeros (%) | 5.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3 |
| Maximum | 147 |
| Range | 147 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.226973612 |
|---|---|
| Coefficient of variation (CV) | 5.602260233 |
| Kurtosis | 273.1004518 |
| Mean | 0.5760128016 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.22151442 |
| Sum | 87471 |
| Variance | 10.41335869 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 134552 | 5.1% |
| 1 | 6112 | 0.2% |
| 2 | 2671 | 0.1% |
| 3 | 2561 | 0.1% |
| 4 | 1221 | < 0.1% |
| 5 | 851 | < 0.1% |
| 6 | 560 | < 0.1% |
| 7 | 473 | < 0.1% |
| 8 | 368 | < 0.1% |
| 9 | 290 | < 0.1% |
| Other values (83) | 2197 | 0.1% |
| (Missing) | 2486439 |
| Value | Count | Frequency (%) |
| 0 | 134552 | |
| 1 | 6112 | 0.2% |
| 2 | 2671 | 0.1% |
| 3 | 2561 | 0.1% |
| 4 | 1221 | < 0.1% |
| Value | Count | Frequency (%) |
| 147 | 1 | |
| 135 | 1 | |
| 119 | 1 | |
| 118 | 1 | |
| 117 | 1 |
credamount_590A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 165672 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 78869 |
| Missing (%) | 3.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42985.43599 |
| Minimum | 0 |
|---|---|
| Maximum | 1000000 |
| Zeros | 73846 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4178 |
| Q1 | 14236 |
| median | 29978 |
| Q3 | 59087.7005 |
| 95-th percentile | 127199.85 |
| Maximum | 1000000 |
| Range | 1000000 |
| Interquartile range (IQR) | 44851.7005 |
Descriptive statistics
| Standard deviation | 45796.84328 |
|---|---|
| Coefficient of variation (CV) | 1.065403717 |
| Kurtosis | 14.53568459 |
| Mean | 42985.43599 |
| Median Absolute Deviation (MAD) | 17980 |
| Skewness | 2.97161691 |
| Sum | 1.100180425 × 1011 |
| Variance | 2097350854 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000 | 137768 | 5.2% |
| 60000 | 134159 | 5.1% |
| 40000 | 127555 | 4.8% |
| 20000 | 122036 | 4.6% |
| 30000 | 98139 | 3.7% |
| 0 | 73846 | 2.8% |
| 50000 | 46978 | 1.8% |
| 10000 | 38675 | 1.5% |
| 200000 | 34783 | 1.3% |
| 80000 | 34107 | 1.3% |
| Other values (165662) | 1711380 | |
| (Missing) | 78869 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 73846 | |
| 0.2 | 410 | < 0.1% |
| 0.8 | 2 | < 0.1% |
| 1.2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000000 | 7 | |
| 950000 | 1 | < 0.1% |
| 900000 | 1 | < 0.1% |
| 800000 | 3 | |
| 700000 | 1 | < 0.1% |
credtype_587L
Text
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 78869 |
| Missing (%) | 3.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 7678278 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CAL |
|---|---|
| 2nd row | CAL |
| 3rd row | CAL |
| 4th row | COL |
| 5th row | COL |
| Value | Count | Frequency (%) |
| col | 1249129 | |
| cal | 1097416 | |
| rel | 212881 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 2559426 | |
| C | 2346545 | |
| O | 1249129 | |
| A | 1097416 | |
| R | 212881 | 2.8% |
| E | 212881 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7678278 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 2559426 | |
| C | 2346545 | |
| O | 1249129 | |
| A | 1097416 | |
| R | 212881 | 2.8% |
| E | 212881 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7678278 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 2559426 | |
| C | 2346545 | |
| O | 1249129 | |
| A | 1097416 | |
| R | 212881 | 2.8% |
| E | 212881 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7678278 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| L | 2559426 | |
| C | 2346545 | |
| O | 1249129 | |
| A | 1097416 | |
| R | 212881 | 2.8% |
| E | 212881 | 2.8% |
currdebt_94A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 201408 |
|---|---|
| Distinct (%) | 12.1% |
| Missing | 976135 |
| Missing (%) | 37.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5301.262335 |
| Minimum | 0 |
|---|---|
| Maximum | 482980.84 |
| Zeros | 1441963 |
| Zeros (%) | 54.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 35702.45575 |
| Maximum | 482980.84 |
| Range | 482980.84 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 20463.6821 |
|---|---|
| Coefficient of variation (CV) | 3.860152696 |
| Kurtosis | 45.25006491 |
| Mean | 5301.262335 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.880501606 |
| Sum | 8811546203 |
| Variance | 418762285 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1441963 | |
| 100000 | 96 | < 0.1% |
| 9998 | 91 | < 0.1% |
| 11998 | 78 | < 0.1% |
| 17998 | 68 | < 0.1% |
| 7998 | 66 | < 0.1% |
| 19998 | 63 | < 0.1% |
| 5998 | 59 | < 0.1% |
| 60000 | 57 | < 0.1% |
| 13998 | 55 | < 0.1% |
| Other values (201398) | 219564 | 8.3% |
| (Missing) | 976135 |
| Value | Count | Frequency (%) |
| 0 | 1441963 | |
| 0.002 | 1 | < 0.1% |
| 0.006 | 1 | < 0.1% |
| 0.020000001 | 1 | < 0.1% |
| 0.06600001 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 482980.84 | 1 | |
| 476617.3 | 1 | |
| 473946.3 | 1 | |
| 459136.4 | 1 | |
| 458601.6 | 1 |
MISSING 
| Distinct | 4211 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1297051 |
| Missing (%) | 49.2% |
| Memory size | 20.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 13412440 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2019-11-06 |
|---|---|
| 2nd row | 2019-09-19 |
| 3rd row | 2019-10-21 |
| 4th row | 2019-12-06 |
| 5th row | 2019-11-05 |
| Value | Count | Frequency (%) |
| 2019-01-31 | 2342 | 0.2% |
| 2019-10-23 | 2070 | 0.2% |
| 2019-10-29 | 1955 | 0.1% |
| 2019-10-21 | 1954 | 0.1% |
| 2020-01-08 | 1937 | 0.1% |
| 2019-12-11 | 1933 | 0.1% |
| 2019-10-28 | 1892 | 0.1% |
| 2020-01-02 | 1863 | 0.1% |
| 2020-01-03 | 1850 | 0.1% |
| 2019-11-01 | 1837 | 0.1% |
| Other values (4201) | 1321611 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3156125 | |
| - | 2682488 | |
| 1 | 2442990 | |
| 2 | 2273853 | |
| 9 | 579514 | 4.3% |
| 8 | 527963 | 3.9% |
| 7 | 429076 | 3.2% |
| 3 | 364959 | 2.7% |
| 6 | 355623 | 2.7% |
| 5 | 301692 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10729952 | |
| Dash Punctuation | 2682488 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3156125 | |
| 1 | 2442990 | |
| 2 | 2273853 | |
| 9 | 579514 | 5.4% |
| 8 | 527963 | 4.9% |
| 7 | 429076 | 4.0% |
| 3 | 364959 | 3.4% |
| 6 | 355623 | 3.3% |
| 5 | 301692 | 2.8% |
| 4 | 298157 | 2.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2682488 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13412440 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3156125 | |
| - | 2682488 | |
| 1 | 2442990 | |
| 2 | 2273853 | |
| 9 | 579514 | 4.3% |
| 8 | 527963 | 3.9% |
| 7 | 429076 | 3.2% |
| 3 | 364959 | 2.7% |
| 6 | 355623 | 2.7% |
| 5 | 301692 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13412440 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3156125 | |
| - | 2682488 | |
| 1 | 2442990 | |
| 2 | 2273853 | |
| 9 | 579514 | 4.3% |
| 8 | 527963 | 3.9% |
| 7 | 429076 | 3.2% |
| 3 | 364959 | 2.7% |
| 6 | 355623 | 2.7% |
| 5 | 301692 | 2.2% |
district_544M
Text
| Distinct | 1030 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.29188358 |
| Min length | 8 |
Characters and Unicode
| Total characters | 27153025 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 134 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | P147_6_101 |
|---|---|
| 2nd row | P111_148_100 |
| 3rd row | a55475b1 |
| 4th row | P19_11_176 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 391791 | 14.9% |
| p131_33_167 | 101299 | 3.8% |
| p123_6_84 | 92997 | 3.5% |
| p197_47_166 | 70453 | 2.7% |
| p204_99_158 | 66400 | 2.5% |
| p98_137_111 | 53077 | 2.0% |
| p62_144_102 | 49284 | 1.9% |
| p159_143_123 | 48525 | 1.8% |
| p111_135_181 | 48495 | 1.8% |
| p147_21_170 | 47763 | 1.8% |
| Other values (1020) | 1668211 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5833743 | |
| _ | 4493008 | |
| 5 | 2412231 | |
| P | 2246502 | 8.3% |
| 7 | 2104439 | 7.8% |
| 4 | 1805523 | 6.6% |
| 6 | 1388964 | 5.1% |
| 3 | 1362572 | 5.0% |
| 2 | 1336975 | 4.9% |
| 8 | 1244108 | 4.6% |
| Other values (8) | 2924960 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19629925 | |
| Connector Punctuation | 4493008 | 16.5% |
| Uppercase Letter | 2246504 | 8.3% |
| Lowercase Letter | 783588 | 2.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5833743 | |
| 5 | 2412231 | |
| 7 | 2104439 | 10.7% |
| 4 | 1805523 | 9.2% |
| 6 | 1388964 | 7.1% |
| 3 | 1362572 | 6.9% |
| 2 | 1336975 | 6.8% |
| 8 | 1244108 | 6.3% |
| 9 | 1231948 | 6.3% |
| 0 | 909422 | 4.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 391791 | |
| b | 391791 | |
| t | 2 | < 0.1% |
| h | 2 | < 0.1% |
| e | 2 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2246502 | |
| Q | 2 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4493008 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 24122933 | |
| Latin | 3030092 | 11.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5833743 | |
| _ | 4493008 | |
| 5 | 2412231 | |
| 7 | 2104439 | 8.7% |
| 4 | 1805523 | 7.5% |
| 6 | 1388964 | 5.8% |
| 3 | 1362572 | 5.6% |
| 2 | 1336975 | 5.5% |
| 8 | 1244108 | 5.2% |
| 9 | 1231948 | 5.1% |
Latin
| Value | Count | Frequency (%) |
| P | 2246502 | |
| a | 391791 | 12.9% |
| b | 391791 | 12.9% |
| Q | 2 | < 0.1% |
| t | 2 | < 0.1% |
| h | 2 | < 0.1% |
| e | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27153025 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5833743 | |
| _ | 4493008 | |
| 5 | 2412231 | |
| P | 2246502 | 8.3% |
| 7 | 2104439 | 7.8% |
| 4 | 1805523 | 6.6% |
| 6 | 1388964 | 5.1% |
| 3 | 1362572 | 5.0% |
| 2 | 1336975 | 4.9% |
| 8 | 1244108 | 4.6% |
| Other values (8) | 2924960 |
downpmt_134A
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 13796 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 78869 |
| Missing (%) | 3.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 388.9884686 |
| Minimum | 0 |
|---|---|
| Maximum | 420400 |
| Zeros | 2348164 |
| Zeros (%) | 89.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1778 |
| Maximum | 420400 |
| Range | 420400 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2614.634679 |
|---|---|
| Coefficient of variation (CV) | 6.721625163 |
| Kurtosis | 1085.921984 |
| Mean | 388.9884686 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.07123108 |
| Sum | 995587200.2 |
| Variance | 6836314.502 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2348164 | |
| 2000 | 25129 | 1.0% |
| 4000 | 16373 | 0.6% |
| 1000 | 15180 | 0.6% |
| 6000 | 8923 | 0.3% |
| 10000 | 7809 | 0.3% |
| 200 | 7538 | 0.3% |
| 400 | 6910 | 0.3% |
| 3000 | 6440 | 0.2% |
| 8000 | 4928 | 0.2% |
| Other values (13786) | 112032 | 4.2% |
| (Missing) | 78869 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 2348164 | |
| 0.2 | 198 | < 0.1% |
| 0.4 | 27 | < 0.1% |
| 0.6 | 80 | < 0.1% |
| 0.8 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 420400 | 1 | |
| 320400 | 1 | |
| 275028 | 2 | |
| 230000 | 2 | |
| 222592.2 | 1 |
dtlastpmt_581D
Text
MISSING 
| Distinct | 2353 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1890009 |
| Missing (%) | 71.6% |
| Memory size | 20.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 7482860 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 179 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2019-12-15 |
|---|---|
| 2nd row | 2019-12-15 |
| 3rd row | 2019-12-26 |
| 4th row | 2019-12-05 |
| 5th row | 2019-12-28 |
| Value | Count | Frequency (%) |
| 2019-09-16 | 25909 | 3.5% |
| 2019-12-17 | 2089 | 0.3% |
| 2019-12-13 | 1475 | 0.2% |
| 2019-12-25 | 1338 | 0.2% |
| 2019-09-19 | 1300 | 0.2% |
| 2019-12-24 | 1251 | 0.2% |
| 2019-12-23 | 1234 | 0.2% |
| 2019-11-15 | 1186 | 0.2% |
| 2020-01-20 | 1179 | 0.2% |
| 2019-12-26 | 1179 | 0.2% |
| Other values (2343) | 710146 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1747905 | |
| - | 1496572 | |
| 2 | 1328877 | |
| 1 | 1267565 | |
| 9 | 415093 | 5.5% |
| 8 | 311251 | 4.2% |
| 7 | 255209 | 3.4% |
| 6 | 238229 | 3.2% |
| 3 | 164963 | 2.2% |
| 5 | 135974 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5986288 | |
| Dash Punctuation | 1496572 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1747905 | |
| 2 | 1328877 | |
| 1 | 1267565 | |
| 9 | 415093 | 6.9% |
| 8 | 311251 | 5.2% |
| 7 | 255209 | 4.3% |
| 6 | 238229 | 4.0% |
| 3 | 164963 | 2.8% |
| 5 | 135974 | 2.3% |
| 4 | 121222 | 2.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1496572 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7482860 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1747905 | |
| - | 1496572 | |
| 2 | 1328877 | |
| 1 | 1267565 | |
| 9 | 415093 | 5.5% |
| 8 | 311251 | 4.2% |
| 7 | 255209 | 3.4% |
| 6 | 238229 | 3.2% |
| 3 | 164963 | 2.2% |
| 5 | 135974 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7482860 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1747905 | |
| - | 1496572 | |
| 2 | 1328877 | |
| 1 | 1267565 | |
| 9 | 415093 | 5.5% |
| 8 | 311251 | 4.2% |
| 7 | 255209 | 3.4% |
| 6 | 238229 | 3.2% |
| 3 | 164963 | 2.2% |
| 5 | 135974 | 1.8% |
MISSING 
| Distinct | 2365 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1609466 |
| Missing (%) | 61.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 10288290 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 174 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2019-12-28 |
|---|---|
| 2nd row | 2019-12-15 |
| 3rd row | 2019-12-19 |
| 4th row | 2019-12-30 |
| 5th row | 2019-12-27 |
| Value | Count | Frequency (%) |
| 2019-09-16 | 26380 | 2.6% |
| 2020-01-20 | 3832 | 0.4% |
| 2019-12-25 | 3506 | 0.3% |
| 2019-12-27 | 3456 | 0.3% |
| 2020-01-01 | 3395 | 0.3% |
| 2019-12-24 | 3365 | 0.3% |
| 2019-12-26 | 3346 | 0.3% |
| 2020-01-24 | 3264 | 0.3% |
| 2019-12-17 | 3250 | 0.3% |
| 2019-12-23 | 3196 | 0.3% |
| Other values (2355) | 971839 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2591633 | |
| - | 2057658 | |
| 2 | 2034311 | |
| 1 | 1541821 | |
| 9 | 523583 | 5.1% |
| 8 | 374790 | 3.6% |
| 7 | 306382 | 3.0% |
| 6 | 286842 | 2.8% |
| 3 | 234640 | 2.3% |
| 5 | 178561 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8230632 | |
| Dash Punctuation | 2057658 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2591633 | |
| 2 | 2034311 | |
| 1 | 1541821 | |
| 9 | 523583 | 6.4% |
| 8 | 374790 | 4.6% |
| 7 | 306382 | 3.7% |
| 6 | 286842 | 3.5% |
| 3 | 234640 | 2.9% |
| 5 | 178561 | 2.2% |
| 4 | 158069 | 1.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2057658 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10288290 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2591633 | |
| - | 2057658 | |
| 2 | 2034311 | |
| 1 | 1541821 | |
| 9 | 523583 | 5.1% |
| 8 | 374790 | 3.6% |
| 7 | 306382 | 3.0% |
| 6 | 286842 | 2.8% |
| 3 | 234640 | 2.3% |
| 5 | 178561 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10288290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2591633 | |
| - | 2057658 | |
| 2 | 2034311 | |
| 1 | 1541821 | |
| 9 | 523583 | 5.1% |
| 8 | 374790 | 3.6% |
| 7 | 306382 | 3.0% |
| 6 | 286842 | 2.8% |
| 3 | 234640 | 2.3% |
| 5 | 178561 | 1.7% |
education_1138M
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 9.101808175 |
| Min length | 8 |
Characters and Unicode
| Total characters | 24013255 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | P97_36_170 |
| 4th row | a55475b1 |
| 5th row | P97_36_170 |
| Value | Count | Frequency (%) |
| a55475b1 | 1379355 | |
| p97_36_170 | 852757 | |
| p33_146_175 | 370707 | 14.1% |
| p106_81_188 | 17178 | 0.7% |
| p17_36_170 | 17168 | 0.7% |
| p157_18_172 | 1130 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 4509902 | |
| 7 | 3492172 | |
| 1 | 3062786 | |
| _ | 2517880 | |
| 4 | 1750062 | 7.3% |
| 3 | 1611339 | 6.7% |
| a | 1379355 | 5.7% |
| b | 1379355 | 5.7% |
| P | 1258940 | 5.2% |
| 6 | 1257810 | 5.2% |
| Other values (4) | 1793654 | 7.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17477725 | |
| Lowercase Letter | 2758710 | 11.5% |
| Connector Punctuation | 2517880 | 10.5% |
| Uppercase Letter | 1258940 | 5.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 4509902 | |
| 7 | 3492172 | |
| 1 | 3062786 | |
| 4 | 1750062 | 10.0% |
| 3 | 1611339 | 9.2% |
| 6 | 1257810 | 7.2% |
| 0 | 887103 | 5.1% |
| 9 | 852757 | 4.9% |
| 8 | 52664 | 0.3% |
| 2 | 1130 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1379355 | |
| b | 1379355 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2517880 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1258940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19995605 | |
| Latin | 4017650 | 16.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 4509902 | |
| 7 | 3492172 | |
| 1 | 3062786 | |
| _ | 2517880 | |
| 4 | 1750062 | 8.8% |
| 3 | 1611339 | 8.1% |
| 6 | 1257810 | 6.3% |
| 0 | 887103 | 4.4% |
| 9 | 852757 | 4.3% |
| 8 | 52664 | 0.3% |
Latin
| Value | Count | Frequency (%) |
| a | 1379355 | |
| b | 1379355 | |
| P | 1258940 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24013255 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 4509902 | |
| 7 | 3492172 | |
| 1 | 3062786 | |
| _ | 2517880 | |
| 4 | 1750062 | 7.3% |
| 3 | 1611339 | 6.7% |
| a | 1379355 | 5.7% |
| b | 1379355 | 5.7% |
| P | 1258940 | 5.2% |
| 6 | 1257810 | 5.2% |
| Other values (4) | 1793654 | 7.5% |
MISSING 
| Distinct | 8343 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 1705609 |
| Missing (%) | 64.6% |
| Memory size | 20.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 9326860 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1996 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2014-01-15 |
|---|---|
| 2nd row | 2013-04-15 |
| 3rd row | 2013-04-15 |
| 4th row | 2012-02-15 |
| 5th row | 2018-01-15 |
| Value | Count | Frequency (%) |
| 2017-01-15 | 16608 | 1.8% |
| 2015-01-15 | 15436 | 1.7% |
| 2013-01-15 | 15290 | 1.6% |
| 2014-01-15 | 15105 | 1.6% |
| 2016-01-15 | 15074 | 1.6% |
| 2012-01-15 | 13022 | 1.4% |
| 2018-01-15 | 11930 | 1.3% |
| 2010-01-15 | 10067 | 1.1% |
| 2010-09-15 | 9608 | 1.0% |
| 2011-01-15 | 9585 | 1.0% |
| Other values (8333) | 800961 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2123405 | |
| 1 | 1975825 | |
| - | 1865372 | |
| 2 | 1075472 | |
| 5 | 1043024 | |
| 9 | 389836 | 4.2% |
| 8 | 191653 | 2.1% |
| 6 | 171491 | 1.8% |
| 3 | 166811 | 1.8% |
| 4 | 164425 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7461488 | |
| Dash Punctuation | 1865372 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2123405 | |
| 1 | 1975825 | |
| 2 | 1075472 | |
| 5 | 1043024 | |
| 9 | 389836 | 5.2% |
| 8 | 191653 | 2.6% |
| 6 | 171491 | 2.3% |
| 3 | 166811 | 2.2% |
| 4 | 164425 | 2.2% |
| 7 | 159546 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1865372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9326860 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2123405 | |
| 1 | 1975825 | |
| - | 1865372 | |
| 2 | 1075472 | |
| 5 | 1043024 | |
| 9 | 389836 | 4.2% |
| 8 | 191653 | 2.1% |
| 6 | 171491 | 1.8% |
| 3 | 166811 | 1.8% |
| 4 | 164425 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9326860 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2123405 | |
| 1 | 1975825 | |
| - | 1865372 | |
| 2 | 1075472 | |
| 5 | 1043024 | |
| 9 | 389836 | 4.2% |
| 8 | 191653 | 2.1% |
| 6 | 171491 | 1.8% |
| 3 | 166811 | 1.8% |
| 4 | 164425 | 1.8% |
familystate_726L
Text
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1148691 |
| Missing (%) | 43.5% |
| Memory size | 20.1 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 7 |
| Mean length | 7.067301108 |
| Min length | 6 |
Characters and Unicode
| Total characters | 10527480 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MARRIED |
|---|---|
| 2nd row | SINGLE |
| 3rd row | SINGLE |
| 4th row | MARRIED |
| 5th row | MARRIED |
| Value | Count | Frequency (%) |
| married | 1082575 | |
| single | 201743 | 13.5% |
| widowed | 138790 | 9.3% |
| divorced | 45087 | 3.0% |
| living_with_partner | 21409 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 2253055 | |
| I | 1532422 | |
| E | 1489604 | |
| D | 1450329 | |
| A | 1103984 | |
| M | 1082575 | |
| W | 298989 | 2.8% |
| N | 244561 | 2.3% |
| L | 223152 | 2.1% |
| G | 223152 | 2.1% |
| Other values (8) | 625657 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10484662 | |
| Connector Punctuation | 42818 | 0.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 2253055 | |
| I | 1532422 | |
| E | 1489604 | |
| D | 1450329 | |
| A | 1103984 | |
| M | 1082575 | |
| W | 298989 | 2.9% |
| N | 244561 | 2.3% |
| L | 223152 | 2.1% |
| G | 223152 | 2.1% |
| Other values (7) | 582839 | 5.6% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 42818 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10484662 | |
| Common | 42818 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 2253055 | |
| I | 1532422 | |
| E | 1489604 | |
| D | 1450329 | |
| A | 1103984 | |
| M | 1082575 | |
| W | 298989 | 2.9% |
| N | 244561 | 2.3% |
| L | 223152 | 2.1% |
| G | 223152 | 2.1% |
| Other values (7) | 582839 | 5.6% |
Common
| Value | Count | Frequency (%) |
| _ | 42818 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10527480 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 2253055 | |
| I | 1532422 | |
| E | 1489604 | |
| D | 1450329 | |
| A | 1103984 | |
| M | 1082575 | |
| W | 298989 | 2.8% |
| N | 244561 | 2.3% |
| L | 223152 | 2.1% |
| G | 223152 | 2.1% |
| Other values (8) | 625657 | 5.9% |
firstnonzeroinstldate_307D
Text
MISSING 
| Distinct | 5153 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 287307 |
| Missing (%) | 10.9% |
| Memory size | 20.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 23509880 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 25 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2018-12-20 |
|---|---|
| 2nd row | 2020-01-26 |
| 3rd row | 2014-08-17 |
| 4th row | 2017-09-21 |
| 5th row | 2015-01-28 |
| Value | Count | Frequency (%) |
| 2019-12-15 | 5044 | 0.2% |
| 2019-03-14 | 4785 | 0.2% |
| 2019-09-15 | 4775 | 0.2% |
| 2020-03-15 | 4335 | 0.2% |
| 2019-10-15 | 4115 | 0.2% |
| 2020-01-15 | 3797 | 0.2% |
| 2020-02-15 | 3791 | 0.2% |
| 2019-10-12 | 3728 | 0.2% |
| 2019-07-15 | 3722 | 0.2% |
| 2020-01-11 | 3704 | 0.2% |
| Other values (5143) | 2309192 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5625459 | |
| - | 4701976 | |
| 1 | 4129869 | |
| 2 | 4102127 | |
| 9 | 990127 | 4.2% |
| 8 | 855287 | 3.6% |
| 7 | 710419 | 3.0% |
| 5 | 680403 | 2.9% |
| 3 | 621076 | 2.6% |
| 6 | 578670 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18807904 | |
| Dash Punctuation | 4701976 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5625459 | |
| 1 | 4129869 | |
| 2 | 4102127 | |
| 9 | 990127 | 5.3% |
| 8 | 855287 | 4.5% |
| 7 | 710419 | 3.8% |
| 5 | 680403 | 3.6% |
| 3 | 621076 | 3.3% |
| 6 | 578670 | 3.1% |
| 4 | 514467 | 2.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4701976 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23509880 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5625459 | |
| - | 4701976 | |
| 1 | 4129869 | |
| 2 | 4102127 | |
| 9 | 990127 | 4.2% |
| 8 | 855287 | 3.6% |
| 7 | 710419 | 3.0% |
| 5 | 680403 | 2.9% |
| 3 | 621076 | 2.6% |
| 6 | 578670 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23509880 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5625459 | |
| - | 4701976 | |
| 1 | 4129869 | |
| 2 | 4102127 | |
| 9 | 990127 | 4.2% |
| 8 | 855287 | 3.6% |
| 7 | 710419 | 3.0% |
| 5 | 680403 | 2.9% |
| 3 | 621076 | 2.6% |
| 6 | 578670 | 2.5% |
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 78869 |
| Missing (%) | 3.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.428913749 |
| Min length | 3 |
Characters and Unicode
| Total characters | 8776051 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CASH |
|---|---|
| 2nd row | CASH |
| 3rd row | CASH |
| 4th row | POS |
| 5th row | POS |
| Value | Count | Frequency (%) |
| pos | 1334600 | |
| cash | 1097773 | |
| ndf | 127053 | 5.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2432373 | |
| P | 1334600 | |
| O | 1334600 | |
| C | 1097773 | |
| A | 1097773 | |
| H | 1097773 | |
| N | 127053 | 1.4% |
| D | 127053 | 1.4% |
| F | 127053 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8776051 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2432373 | |
| P | 1334600 | |
| O | 1334600 | |
| C | 1097773 | |
| A | 1097773 | |
| H | 1097773 | |
| N | 127053 | 1.4% |
| D | 127053 | 1.4% |
| F | 127053 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8776051 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2432373 | |
| P | 1334600 | |
| O | 1334600 | |
| C | 1097773 | |
| A | 1097773 | |
| H | 1097773 | |
| N | 127053 | 1.4% |
| D | 127053 | 1.4% |
| F | 127053 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8776051 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2432373 | |
| P | 1334600 | |
| O | 1334600 | |
| C | 1097773 | |
| A | 1097773 | |
| H | 1097773 | |
| N | 127053 | 1.4% |
| D | 127053 | 1.4% |
| F | 127053 | 1.4% |
isbidproduct_390L
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31 |
| Missing (%) | < 0.1% |
| Memory size | 20.1 MiB |
| False | |
|---|---|
| True | 144802 |
| (Missing) | 31 |
| Value | Count | Frequency (%) |
| False | 2493462 | |
| True | 144802 | 5.5% |
| (Missing) | 31 | < 0.1% |
isdebitcard_527L
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2425416 |
| Missing (%) | 91.9% |
| Memory size | 20.1 MiB |
| False | 146098 |
|---|---|
| True | 66781 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 146098 | 5.5% |
| True | 66781 | 2.5% |
| (Missing) | 2425416 |
mainoccupationinc_437A
Real number (ℝ)
MISSING 
| Distinct | 17340 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 65371 |
| Missing (%) | 2.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43046.11571 |
| Minimum | 0 |
|---|---|
| Maximum | 199600 |
| Zeros | 32 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6315.8003 |
| Q1 | 20000 |
| median | 37000 |
| Q3 | 58000 |
| 95-th percentile | 100000 |
| Maximum | 199600 |
| Range | 199600 |
| Interquartile range (IQR) | 38000 |
Descriptive statistics
| Standard deviation | 32550.06984 |
|---|---|
| Coefficient of variation (CV) | 0.7561674102 |
| Kurtosis | 4.670832521 |
| Mean | 43046.11571 |
| Median Absolute Deviation (MAD) | 18000 |
| Skewness | 1.769823701 |
| Sum | 1.107543842 × 1011 |
| Variance | 1059507046 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40000 | 186417 | 7.1% |
| 30000 | 183448 | 7.0% |
| 50000 | 162276 | 6.2% |
| 60000 | 135382 | 5.1% |
| 20000 | 91145 | 3.5% |
| 70000 | 87299 | 3.3% |
| 24000 | 79960 | 3.0% |
| 36000 | 76607 | 2.9% |
| 80000 | 47766 | 1.8% |
| 100000 | 45645 | 1.7% |
| Other values (17330) | 1476979 | |
| (Missing) | 65371 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 32 | |
| 0.2 | 51 | |
| 0.4 | 6 | < 0.1% |
| 0.6 | 10 | < 0.1% |
| 0.8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 199600 | 12883 | |
| 199400 | 8 | < 0.1% |
| 199200 | 5 | < 0.1% |
| 199120 | 1 | < 0.1% |
| 199000 | 20 | < 0.1% |
maxdpdtolerance_577P
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 3209 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1278326 |
| Missing (%) | 48.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.78874518 |
| Minimum | 0 |
|---|---|
| Maximum | 4362 |
| Zeros | 986233 |
| Zeros (%) | 37.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 18 |
| Maximum | 4362 |
| Range | 4362 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 158.135784 |
|---|---|
| Coefficient of variation (CV) | 9.419154454 |
| Kurtosis | 263.3173287 |
| Mean | 16.78874518 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 14.91851137 |
| Sum | 22832173 |
| Variance | 25006.92618 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 986233 | |
| 1 | 191671 | 7.3% |
| 5 | 25367 | 1.0% |
| 6 | 19116 | 0.7% |
| 10 | 14207 | 0.5% |
| 4 | 8844 | 0.3% |
| 9 | 8395 | 0.3% |
| 14 | 7146 | 0.3% |
| 18 | 6781 | 0.3% |
| 7 | 6239 | 0.2% |
| Other values (3199) | 85970 | 3.3% |
| (Missing) | 1278326 |
| Value | Count | Frequency (%) |
| 0 | 986233 | |
| 1 | 191671 | 7.3% |
| 2 | 6033 | 0.2% |
| 3 | 1935 | 0.1% |
| 4 | 8844 | 0.3% |
| Value | Count | Frequency (%) |
| 4362 | 1 | |
| 4245 | 2 | |
| 4222 | 1 | |
| 4206 | 1 | |
| 4185 | 1 |
num_group1
Real number (ℝ)
ZEROS 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.613015603 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 438525 |
| Zeros (%) | 16.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 7 |
| 95-th percentile | 14 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.485514586 |
|---|---|
| Coefficient of variation (CV) | 0.9723605927 |
| Kurtosis | 0.7244665374 |
| Mean | 4.613015603 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.163952685 |
| Sum | 12170496 |
| Variance | 20.1198411 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 438525 | |
| 1 | 369409 | |
| 2 | 309947 | |
| 3 | 259106 | |
| 4 | 216484 | |
| 5 | 180462 | |
| 6 | 150620 | 5.7% |
| 7 | 125780 | 4.8% |
| 8 | 105118 | 4.0% |
| 9 | 88418 | 3.4% |
| Other values (10) | 394426 |
| Value | Count | Frequency (%) |
| 0 | 438525 | |
| 1 | 369409 | |
| 2 | 309947 | |
| 3 | 259106 | |
| 4 | 216484 |
| Value | Count | Frequency (%) |
| 19 | 17541 | |
| 18 | 20372 | |
| 17 | 23687 | |
| 16 | 27647 | |
| 15 | 32524 |
outstandingdebt_522A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 179739 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 980346 |
| Missing (%) | 37.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7097.727649 |
| Minimum | 0 |
|---|---|
| Maximum | 1029392.8 |
| Zeros | 1436446 |
| Zeros (%) | 54.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 40964.368 |
| Maximum | 1029392.8 |
| Range | 1029392.8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 30871.45589 |
|---|---|
| Coefficient of variation (CV) | 4.349484429 |
| Kurtosis | 79.30366253 |
| Mean | 7097.727649 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.504083064 |
| Sum | 1.176767046 × 1010 |
| Variance | 953046788.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1436446 | |
| 10 | 286 | < 0.1% |
| 9998 | 92 | < 0.1% |
| 11998 | 72 | < 0.1% |
| 17998 | 64 | < 0.1% |
| 20 | 61 | < 0.1% |
| 7998 | 60 | < 0.1% |
| 19998 | 59 | < 0.1% |
| 5998 | 58 | < 0.1% |
| 8998 | 52 | < 0.1% |
| Other values (179729) | 220699 | 8.4% |
| (Missing) | 980346 |
| Value | Count | Frequency (%) |
| 0 | 1436446 | |
| 0.002 | 1 | < 0.1% |
| 0.004 | 1 | < 0.1% |
| 0.006 | 1 | < 0.1% |
| 0.008 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1029392.8 | 1 | |
| 987535 | 1 | |
| 984399 | 1 | |
| 978072.2 | 1 | |
| 910766 | 1 |
pmtnum_8L
Real number (ℝ)
MISSING 
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 238987 |
| Missing (%) | 9.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.85803407 |
| Minimum | 3 |
|---|---|
| Maximum | 63 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 9 |
| median | 12 |
| Q3 | 24 |
| 95-th percentile | 36 |
| Maximum | 63 |
| Range | 60 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.25158885 |
|---|---|
| Coefficient of variation (CV) | 0.6674318494 |
| Kurtosis | 1.295980682 |
| Mean | 16.85803407 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.211121972 |
| Sum | 40447616 |
| Variance | 126.5982517 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 594080 | |
| 24 | 414664 | |
| 6 | 355258 | |
| 18 | 200921 | 7.6% |
| 36 | 156551 | 5.9% |
| 3 | 117252 | 4.4% |
| 48 | 81251 | 3.1% |
| 16 | 77062 | 2.9% |
| 9 | 51005 | 1.9% |
| 4 | 47083 | 1.8% |
| Other values (50) | 304181 | |
| (Missing) | 238987 |
| Value | Count | Frequency (%) |
| 3 | 117252 | 4.4% |
| 4 | 47083 | 1.8% |
| 5 | 23564 | 0.9% |
| 6 | 355258 | |
| 7 | 4135 | 0.2% |
| Value | Count | Frequency (%) |
| 63 | 3 | < 0.1% |
| 62 | 12 | < 0.1% |
| 61 | 18 | < 0.1% |
| 60 | 12352 | |
| 59 | 1 | < 0.1% |
postype_4733339M
Text
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 11.24311951 |
| Min length | 8 |
Characters and Unicode
| Total characters | 29662666 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P46_145_78 |
|---|---|
| 2nd row | P149_40_170 |
| 3rd row | P46_145_78 |
| 4th row | P177_117_192 |
| 5th row | P60_146_156 |
| Value | Count | Frequency (%) |
| p177_117_192 | 1283096 | |
| p46_145_78 | 671053 | |
| p149_40_170 | 260036 | 9.9% |
| p60_146_156 | 200271 | 7.6% |
| p67_102_161 | 175416 | 6.6% |
| p217_110_186 | 30413 | 1.2% |
| p169_115_83 | 13766 | 0.5% |
| p140_48_169 | 3899 | 0.1% |
| a55475b1 | 345 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 7421392 | |
| _ | 5275900 | |
| 7 | 4986551 | |
| P | 2637950 | 8.9% |
| 4 | 2070592 | 7.0% |
| 6 | 1670776 | 5.6% |
| 9 | 1560797 | 5.3% |
| 2 | 1488925 | 5.0% |
| 0 | 930071 | 3.1% |
| 5 | 886125 | 3.0% |
| Other values (4) | 733587 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 21748126 | |
| Connector Punctuation | 5275900 | 17.8% |
| Uppercase Letter | 2637950 | 8.9% |
| Lowercase Letter | 690 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7421392 | |
| 7 | 4986551 | |
| 4 | 2070592 | 9.5% |
| 6 | 1670776 | 7.7% |
| 9 | 1560797 | 7.2% |
| 2 | 1488925 | 6.8% |
| 0 | 930071 | 4.3% |
| 5 | 886125 | 4.1% |
| 8 | 719131 | 3.3% |
| 3 | 13766 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 345 | |
| b | 345 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5275900 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2637950 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 27024026 | |
| Latin | 2638640 | 8.9% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 7421392 | |
| _ | 5275900 | |
| 7 | 4986551 | |
| 4 | 2070592 | 7.7% |
| 6 | 1670776 | 6.2% |
| 9 | 1560797 | 5.8% |
| 2 | 1488925 | 5.5% |
| 0 | 930071 | 3.4% |
| 5 | 886125 | 3.3% |
| 8 | 719131 | 2.7% |
Latin
| Value | Count | Frequency (%) |
| P | 2637950 | |
| a | 345 | < 0.1% |
| b | 345 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29662666 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 7421392 | |
| _ | 5275900 | |
| 7 | 4986551 | |
| P | 2637950 | 8.9% |
| 4 | 2070592 | 7.0% |
| 6 | 1670776 | 5.6% |
| 9 | 1560797 | 5.3% |
| 2 | 1488925 | 5.0% |
| 0 | 930071 | 3.1% |
| 5 | 886125 | 3.0% |
| Other values (4) | 733587 | 2.5% |
profession_152M
Text
| Distinct | 5799 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.024117091 |
| Min length | 7 |
Characters and Unicode
| Total characters | 21169988 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3891 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 2613108 | |
| p46_72_80 | 682 | < 0.1% |
| p104_137_180 | 436 | < 0.1% |
| p167_22_171 | 374 | < 0.1% |
| p21_76_53 | 372 | < 0.1% |
| p143_116_69 | 342 | < 0.1% |
| p25_111_112 | 335 | < 0.1% |
| p139_125_64 | 322 | < 0.1% |
| p121_114_58 | 283 | < 0.1% |
| p103_114_185 | 279 | < 0.1% |
| Other values (5789) | 21762 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 7854981 | |
| 1 | 2667040 | 12.6% |
| 7 | 2630879 | 12.4% |
| 4 | 2628479 | 12.4% |
| a | 2613130 | 12.3% |
| b | 2613109 | 12.3% |
| _ | 50374 | 0.2% |
| P | 25146 | 0.1% |
| 2 | 17853 | 0.1% |
| 6 | 17554 | 0.1% |
| Other values (24) | 51443 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15868035 | |
| Lowercase Letter | 5226392 | 24.7% |
| Connector Punctuation | 50374 | 0.2% |
| Uppercase Letter | 25187 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2613130 | |
| b | 2613109 | |
| e | 24 | < 0.1% |
| r | 19 | < 0.1% |
| o | 14 | < 0.1% |
| t | 12 | < 0.1% |
| d | 12 | < 0.1% |
| k | 10 | < 0.1% |
| y | 10 | < 0.1% |
| c | 10 | < 0.1% |
| Other values (11) | 42 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 7854981 | |
| 1 | 2667040 | 16.8% |
| 7 | 2630879 | 16.6% |
| 4 | 2628479 | 16.6% |
| 2 | 17853 | 0.1% |
| 6 | 17554 | 0.1% |
| 3 | 13264 | 0.1% |
| 8 | 13257 | 0.1% |
| 0 | 13197 | 0.1% |
| 9 | 11531 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 25146 | |
| Q | 41 | 0.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 50374 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15918409 | |
| Latin | 5251579 | 24.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2613130 | |
| b | 2613109 | |
| P | 25146 | 0.5% |
| Q | 41 | < 0.1% |
| e | 24 | < 0.1% |
| r | 19 | < 0.1% |
| o | 14 | < 0.1% |
| t | 12 | < 0.1% |
| d | 12 | < 0.1% |
| k | 10 | < 0.1% |
| Other values (13) | 62 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 5 | 7854981 | |
| 1 | 2667040 | 16.8% |
| 7 | 2630879 | 16.5% |
| 4 | 2628479 | 16.5% |
| _ | 50374 | 0.3% |
| 2 | 17853 | 0.1% |
| 6 | 17554 | 0.1% |
| 3 | 13264 | 0.1% |
| 8 | 13257 | 0.1% |
| 0 | 13197 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21169988 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 7854981 | |
| 1 | 2667040 | 12.6% |
| 7 | 2630879 | 12.4% |
| 4 | 2628479 | 12.4% |
| a | 2613130 | 12.3% |
| b | 2613109 | 12.3% |
| _ | 50374 | 0.2% |
| P | 25146 | 0.1% |
| 2 | 17853 | 0.1% |
| 6 | 17554 | 0.1% |
| Other values (24) | 51443 | 0.2% |
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 8.730558561 |
| Min length | 8 |
Characters and Unicode
| Total characters | 23033789 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P198_131_9 |
|---|---|
| 2nd row | P45_84_106 |
| 3rd row | a55475b1 |
| 4th row | P99_56_166 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 1814894 | |
| p99_56_166 | 354478 | 13.4% |
| p94_109_143 | 285561 | 10.8% |
| p198_131_9 | 88231 | 3.3% |
| p45_84_106 | 83707 | 3.2% |
| p48_22_32 | 4446 | 0.2% |
| p30_86_84 | 2041 | 0.1% |
| p121_60_164 | 1378 | 0.1% |
| p196_88_176 | 1347 | 0.1% |
| p52_67_90 | 1240 | < 0.1% |
| Other values (8) | 972 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 5884277 | |
| 1 | 3097659 | |
| 4 | 2561781 | |
| 7 | 1817922 | 7.9% |
| a | 1814894 | 7.9% |
| b | 1814894 | 7.9% |
| _ | 1646802 | 7.1% |
| 9 | 1459810 | 6.3% |
| 6 | 1157112 | 5.0% |
| P | 823401 | 3.6% |
| Other values (4) | 955237 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16933798 | |
| Lowercase Letter | 3629788 | 15.8% |
| Connector Punctuation | 1646802 | 7.1% |
| Uppercase Letter | 823401 | 3.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 5884277 | |
| 1 | 3097659 | |
| 4 | 2561781 | |
| 7 | 1817922 | 10.7% |
| 9 | 1459810 | 8.6% |
| 6 | 1157112 | 6.8% |
| 3 | 380407 | 2.2% |
| 0 | 374229 | 2.2% |
| 8 | 183643 | 1.1% |
| 2 | 16958 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1814894 | |
| b | 1814894 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1646802 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 823401 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18580600 | |
| Latin | 4453189 | 19.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 5884277 | |
| 1 | 3097659 | |
| 4 | 2561781 | |
| 7 | 1817922 | 9.8% |
| _ | 1646802 | 8.9% |
| 9 | 1459810 | 7.9% |
| 6 | 1157112 | 6.2% |
| 3 | 380407 | 2.0% |
| 0 | 374229 | 2.0% |
| 8 | 183643 | 1.0% |
Latin
| Value | Count | Frequency (%) |
| a | 1814894 | |
| b | 1814894 | |
| P | 823401 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23033789 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 5884277 | |
| 1 | 3097659 | |
| 4 | 2561781 | |
| 7 | 1817922 | 7.9% |
| a | 1814894 | 7.9% |
| b | 1814894 | 7.9% |
| _ | 1646802 | 7.1% |
| 9 | 1459810 | 6.3% |
| 6 | 1157112 | 5.0% |
| P | 823401 | 3.6% |
| Other values (4) | 955237 | 4.1% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 8.792563758 |
| Min length | 8 |
Characters and Unicode
| Total characters | 23197377 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P94_109_143 |
|---|---|
| 2nd row | P94_109_143 |
| 3rd row | a55475b1 |
| 4th row | P94_109_143 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 1889911 | |
| p94_109_143 | 654356 | 24.8% |
| p30_86_84 | 48409 | 1.8% |
| p52_67_90 | 18027 | 0.7% |
| p69_72_116 | 12955 | 0.5% |
| p129_162_80 | 8320 | 0.3% |
| p84_14_61 | 2885 | 0.1% |
| p64_121_167 | 1849 | 0.1% |
| p19_25_34 | 761 | < 0.1% |
| p5_143_178 | 612 | < 0.1% |
| Other values (4) | 210 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 5689545 | |
| 4 | 3256030 | |
| 1 | 3254895 | |
| 7 | 1923354 | 8.3% |
| a | 1889911 | 8.1% |
| b | 1889911 | 8.1% |
| _ | 1496768 | 6.5% |
| 9 | 1348782 | 5.8% |
| P | 748384 | 3.2% |
| 0 | 729319 | 3.1% |
| Other values (4) | 970478 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17172403 | |
| Lowercase Letter | 3779822 | 16.3% |
| Connector Punctuation | 1496768 | 6.5% |
| Uppercase Letter | 748384 | 3.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 5689545 | |
| 4 | 3256030 | |
| 1 | 3254895 | |
| 7 | 1923354 | 11.2% |
| 9 | 1348782 | 7.9% |
| 0 | 729319 | 4.2% |
| 3 | 704345 | 4.1% |
| 8 | 108638 | 0.6% |
| 6 | 107252 | 0.6% |
| 2 | 50243 | 0.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1889911 | |
| b | 1889911 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1496768 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 748384 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18669171 | |
| Latin | 4528206 | 19.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 5689545 | |
| 4 | 3256030 | |
| 1 | 3254895 | |
| 7 | 1923354 | 10.3% |
| _ | 1496768 | 8.0% |
| 9 | 1348782 | 7.2% |
| 0 | 729319 | 3.9% |
| 3 | 704345 | 3.8% |
| 8 | 108638 | 0.6% |
| 6 | 107252 | 0.6% |
Latin
| Value | Count | Frequency (%) |
| a | 1889911 | |
| b | 1889911 | |
| P | 748384 | 16.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23197377 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 5689545 | |
| 4 | 3256030 | |
| 1 | 3254895 | |
| 7 | 1923354 | 8.3% |
| a | 1889911 | 8.1% |
| b | 1889911 | 8.1% |
| _ | 1496768 | 6.5% |
| 9 | 1348782 | 5.8% |
| P | 748384 | 3.2% |
| 0 | 729319 | 3.1% |
| Other values (4) | 970478 | 4.2% |
revolvingaccount_394A
Real number (ℝ)
MISSING 
| Distinct | 52406 |
|---|---|
| Distinct (%) | 37.4% |
| Missing | 2498196 |
| Missing (%) | 94.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 761933430.7 |
| Minimum | 540342340 |
|---|---|
| Maximum | 800608700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 540342340 |
|---|---|
| 5-th percentile | 685001930 |
| Q1 | 760163480 |
| median | 780311500 |
| Q3 | 780789950 |
| 95-th percentile | 800253630 |
| Maximum | 800608700 |
| Range | 260266360 |
| Interquartile range (IQR) | 20626470 |
Descriptive statistics
| Standard deviation | 47558807.47 |
|---|---|
| Coefficient of variation (CV) | 0.06241858613 |
| Kurtosis | 10.27654451 |
| Mean | 761933430.7 |
| Median Absolute Deviation (MAD) | 19745840 |
| Skewness | -3.060067244 |
| Sum | 1.067461117 × 1014 |
| Variance | 2.261840168 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 780784600 | 29 | < 0.1% |
| 780851400 | 27 | < 0.1% |
| 800146900 | 25 | < 0.1% |
| 780783040 | 24 | < 0.1% |
| 780851650 | 24 | < 0.1% |
| 780661440 | 23 | < 0.1% |
| 780826560 | 23 | < 0.1% |
| 780561100 | 23 | < 0.1% |
| 780826300 | 22 | < 0.1% |
| 780621760 | 22 | < 0.1% |
| Other values (52396) | 139857 | 5.3% |
| (Missing) | 2498196 |
| Value | Count | Frequency (%) |
| 540342340 | 1 | < 0.1% |
| 540342460 | 2 | |
| 540342500 | 3 | |
| 540342600 | 2 | |
| 540342660 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 800608700 | 1 | |
| 800608100 | 1 | |
| 800607550 | 1 | |
| 800607500 | 1 | |
| 800607400 | 1 |
status_219L
Text
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31 |
| Missing (%) | < 0.1% |
| Memory size | 20.1 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2638264 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | D |
|---|---|
| 2nd row | D |
| 3rd row | D |
| 4th row | D |
| 5th row | D |
| Value | Count | Frequency (%) |
| d | 1104119 | |
| k | 1053357 | |
| a | 284608 | 10.8% |
| t | 177685 | 6.7% |
| n | 14762 | 0.6% |
| q | 2711 | 0.1% |
| l | 489 | < 0.1% |
| s | 302 | < 0.1% |
| h | 196 | < 0.1% |
| p | 20 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 1104119 | |
| K | 1053357 | |
| A | 284608 | 10.8% |
| T | 177685 | 6.7% |
| N | 14762 | 0.6% |
| Q | 2711 | 0.1% |
| L | 489 | < 0.1% |
| S | 302 | < 0.1% |
| H | 196 | < 0.1% |
| P | 20 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2638264 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1104119 | |
| K | 1053357 | |
| A | 284608 | 10.8% |
| T | 177685 | 6.7% |
| N | 14762 | 0.6% |
| Q | 2711 | 0.1% |
| L | 489 | < 0.1% |
| S | 302 | < 0.1% |
| H | 196 | < 0.1% |
| P | 20 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2638264 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 1104119 | |
| K | 1053357 | |
| A | 284608 | 10.8% |
| T | 177685 | 6.7% |
| N | 14762 | 0.6% |
| Q | 2711 | 0.1% |
| L | 489 | < 0.1% |
| S | 302 | < 0.1% |
| H | 196 | < 0.1% |
| P | 20 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2638264 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| D | 1104119 | |
| K | 1053357 | |
| A | 284608 | 10.8% |
| T | 177685 | 6.7% |
| N | 14762 | 0.6% |
| Q | 2711 | 0.1% |
| L | 489 | < 0.1% |
| S | 302 | < 0.1% |
| H | 196 | < 0.1% |
| P | 20 | < 0.1% |
tenor_203L
Real number (ℝ)
MISSING 
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 238987 |
| Missing (%) | 9.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.85803407 |
| Minimum | 3 |
|---|---|
| Maximum | 63 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 20.1 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 9 |
| median | 12 |
| Q3 | 24 |
| 95-th percentile | 36 |
| Maximum | 63 |
| Range | 60 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.25158885 |
|---|---|
| Coefficient of variation (CV) | 0.6674318494 |
| Kurtosis | 1.295980682 |
| Mean | 16.85803407 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.211121972 |
| Sum | 40447616 |
| Variance | 126.5982517 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 594080 | |
| 24 | 414664 | |
| 6 | 355258 | |
| 18 | 200921 | 7.6% |
| 36 | 156551 | 5.9% |
| 3 | 117252 | 4.4% |
| 48 | 81251 | 3.1% |
| 16 | 77062 | 2.9% |
| 9 | 51005 | 1.9% |
| 4 | 47083 | 1.8% |
| Other values (50) | 304181 | |
| (Missing) | 238987 |
| Value | Count | Frequency (%) |
| 3 | 117252 | 4.4% |
| 4 | 47083 | 1.8% |
| 5 | 23564 | 0.9% |
| 6 | 355258 | |
| 7 | 4135 | 0.2% |
| Value | Count | Frequency (%) |
| 63 | 3 | < 0.1% |
| 62 | 12 | < 0.1% |
| 61 | 18 | < 0.1% |
| 60 | 12352 | |
| 59 | 1 | < 0.1% |