Dataset statistics
| Number of variables | 41 |
|---|---|
| Number of observations | 3887684 |
| Missing cells | 49311059 |
| Missing cells (%) | 30.9% |
| Total size in memory | 1.2 GiB |
| Average record size in memory | 328.0 B |
Variable types
| Numeric | 20 |
|---|---|
| Text | 19 |
| Boolean | 2 |
isbidproduct_390L is highly imbalanced (68.7%) | Imbalance |
annuity_853A has 155851 (4.0%) missing values | Missing |
approvaldate_319D has 1766021 (45.4%) missing values | Missing |
byoccupationinc_3656910L has 2896024 (74.5%) missing values | Missing |
childnum_21L has 1953893 (50.3%) missing values | Missing |
credacc_actualbalance_314A has 3719506 (95.7%) missing values | Missing |
credacc_credlmt_575A has 119070 (3.1%) missing values | Missing |
credacc_maxhisbal_375A has 3719506 (95.7%) missing values | Missing |
credacc_minhisbal_90A has 3719506 (95.7%) missing values | Missing |
credacc_status_367L has 3719506 (95.7%) missing values | Missing |
credacc_transactions_402L has 3719506 (95.7%) missing values | Missing |
credamount_590A has 123329 (3.2%) missing values | Missing |
credtype_587L has 123329 (3.2%) missing values | Missing |
currdebt_94A has 1270377 (32.7%) missing values | Missing |
dateactivated_425D has 1844702 (47.4%) missing values | Missing |
downpmt_134A has 123329 (3.2%) missing values | Missing |
dtlastpmt_581D has 2860375 (73.6%) missing values | Missing |
dtlastpmtallstes_3545839D has 2434155 (62.6%) missing values | Missing |
employedfrom_700D has 2180869 (56.1%) missing values | Missing |
familystate_726L has 1245201 (32.0%) missing values | Missing |
firstnonzeroinstldate_307D has 365175 (9.4%) missing values | Missing |
inittransactioncode_279L has 123329 (3.2%) missing values | Missing |
isdebitcard_527L has 3637550 (93.6%) missing values | Missing |
maxdpdtolerance_577P has 1817378 (46.7%) missing values | Missing |
outstandingdebt_522A has 1277922 (32.9%) missing values | Missing |
pmtnum_8L has 312833 (8.0%) missing values | Missing |
revolvingaccount_394A has 3731033 (96.0%) missing values | Missing |
tenor_203L has 312833 (8.0%) missing values | Missing |
actualdpd_943P is highly skewed (γ1 = 716.1410421) | Skewed |
credacc_maxhisbal_375A is highly skewed (γ1 = 154.6093224) | Skewed |
actualdpd_943P has 3882797 (99.9%) zeros | Zeros |
annuity_853A has 225443 (5.8%) zeros | Zeros |
byoccupationinc_3656910L has 63137 (1.6%) zeros | Zeros |
childnum_21L has 1054010 (27.1%) zeros | Zeros |
credacc_credlmt_575A has 3470494 (89.3%) zeros | Zeros |
credacc_maxhisbal_375A has 98392 (2.5%) zeros | Zeros |
credacc_minhisbal_90A has 100911 (2.6%) zeros | Zeros |
credacc_transactions_402L has 150233 (3.9%) zeros | Zeros |
credamount_590A has 42592 (1.1%) zeros | Zeros |
currdebt_94A has 2238778 (57.6%) zeros | Zeros |
downpmt_134A has 3381858 (87.0%) zeros | Zeros |
maxdpdtolerance_577P has 1527003 (39.3%) zeros | Zeros |
num_group1 has 782997 (20.1%) zeros | Zeros |
outstandingdebt_522A has 2229538 (57.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-02-13 19:36:08.396202 |
|---|---|
| Analysis finished | 2024-02-13 19:36:34.918128 |
| Duration | 26.52 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
case_id
Real number (ℝ)
| Distinct | 782997 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1397916.185 |
| Minimum | 2 |
|---|---|
| Maximum | 2651092 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 123162 |
| Q1 | 1251506 |
| median | 1451959 |
| Q3 | 1641585 |
| 95-th percentile | 2617890 |
| Maximum | 2651092 |
| Range | 2651090 |
| Interquartile range (IQR) | 390079 |
Descriptive statistics
| Standard deviation | 760159.4198 |
|---|---|
| Coefficient of variation (CV) | 0.5437803984 |
| Kurtosis | -0.5110858461 |
| Mean | 1397916.185 |
| Median Absolute Deviation (MAD) | 194660 |
| Skewness | -0.1345953203 |
| Sum | 5.434656384 × 1012 |
| Variance | 5.778423435 × 1011 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1451156 | 20 | < 0.1% |
| 149924 | 20 | < 0.1% |
| 1617749 | 20 | < 0.1% |
| 149841 | 20 | < 0.1% |
| 149843 | 20 | < 0.1% |
| 2538422 | 20 | < 0.1% |
| 177936 | 20 | < 0.1% |
| 2538424 | 20 | < 0.1% |
| 111866 | 20 | < 0.1% |
| 2588419 | 20 | < 0.1% |
| Other values (782987) | 3887484 |
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 3 |
| Value | Count | Frequency (%) |
| 2651092 | 8 | |
| 2651091 | 3 | < 0.1% |
| 2651090 | 2 | < 0.1% |
| 2651089 | 12 | |
| 2651088 | 13 |
actualdpd_943P
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 101 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2234 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.01056299785 |
| Minimum | 0 |
|---|---|
| Maximum | 3676 |
| Zeros | 3882797 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 3676 |
| Range | 3676 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.75427732 |
|---|---|
| Coefficient of variation (CV) | 355.417787 |
| Kurtosis | 586099.5401 |
| Mean | 0.01056299785 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 716.1410421 |
| Sum | 41042 |
| Variance | 14.0945982 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3882797 | |
| 1 | 870 | < 0.1% |
| 2 | 530 | < 0.1% |
| 3 | 338 | < 0.1% |
| 4 | 174 | < 0.1% |
| 5 | 118 | < 0.1% |
| 6 | 85 | < 0.1% |
| 7 | 64 | < 0.1% |
| 8 | 58 | < 0.1% |
| 9 | 46 | < 0.1% |
| Other values (91) | 370 | < 0.1% |
| (Missing) | 2234 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3882797 | |
| 1 | 870 | < 0.1% |
| 2 | 530 | < 0.1% |
| 3 | 338 | < 0.1% |
| 4 | 174 | < 0.1% |
| Value | Count | Frequency (%) |
| 3676 | 1 | |
| 3661 | 1 | |
| 2119 | 1 | |
| 2107 | 1 | |
| 1957 | 1 |
annuity_853A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 80309 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 155851 |
| Missing (%) | 4.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3413.166466 |
| Minimum | 0 |
|---|---|
| Maximum | 105130.2 |
| Zeros | 225443 |
| Zeros (%) | 5.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1710.6 |
| median | 2761.2 |
| Q3 | 4396.4 |
| 95-th percentile | 8325.2 |
| Maximum | 105130.2 |
| Range | 105130.2 |
| Interquartile range (IQR) | 2685.8 |
Descriptive statistics
| Standard deviation | 2828.269 |
|---|---|
| Coefficient of variation (CV) | 0.8286349432 |
| Kurtosis | 32.4088215 |
| Mean | 3413.166466 |
| Median Absolute Deviation (MAD) | 1247.4 |
| Skewness | 3.37853393 |
| Sum | 1.273736725 × 1010 |
| Variance | 7999105.539 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 225443 | 5.8% |
| 1580 | 3773 | 0.1% |
| 1508 | 3156 | 0.1% |
| 2716 | 2434 | 0.1% |
| 2558.4001 | 1736 | < 0.1% |
| 2103 | 1685 | < 0.1% |
| 1668 | 1676 | < 0.1% |
| 2000 | 1666 | < 0.1% |
| 3837.4001 | 1625 | < 0.1% |
| 3840 | 1609 | < 0.1% |
| Other values (80299) | 3487030 | |
| (Missing) | 155851 | 4.0% |
| Value | Count | Frequency (%) |
| 0 | 225443 | |
| 1.8000001 | 1 | < 0.1% |
| 7.6 | 2 | < 0.1% |
| 8.2 | 2 | < 0.1% |
| 10.400001 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 105130.2 | 1 | |
| 103000 | 1 | |
| 100061.4 | 1 | |
| 99837.4 | 2 | |
| 99646.6 | 1 |
MISSING 
| Distinct | 5105 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1766021 |
| Missing (%) | 45.4% |
| Memory size | 29.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 21216630 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2019-01-11 |
|---|---|
| 2nd row | 2018-10-11 |
| 3rd row | 2018-12-31 |
| 4th row | 2018-11-02 |
| 5th row | 2018-12-11 |
| Value | Count | Frequency (%) |
| 2018-12-07 | 2982 | 0.1% |
| 2018-01-12 | 2867 | 0.1% |
| 2018-01-13 | 2720 | 0.1% |
| 2018-12-08 | 2567 | 0.1% |
| 2019-01-13 | 2387 | 0.1% |
| 2018-12-29 | 2273 | 0.1% |
| 2018-07-27 | 2215 | 0.1% |
| 2018-12-28 | 2193 | 0.1% |
| 2018-11-24 | 2161 | 0.1% |
| 2017-12-02 | 2126 | 0.1% |
| Other values (5095) | 2097172 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4872977 | |
| - | 4243326 | |
| 1 | 3932682 | |
| 2 | 3489665 | |
| 8 | 964186 | 4.5% |
| 7 | 774209 | 3.6% |
| 9 | 687236 | 3.2% |
| 3 | 618144 | 2.9% |
| 6 | 606097 | 2.9% |
| 5 | 522667 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16973304 | |
| Dash Punctuation | 4243326 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4872977 | |
| 1 | 3932682 | |
| 2 | 3489665 | |
| 8 | 964186 | 5.7% |
| 7 | 774209 | 4.6% |
| 9 | 687236 | 4.0% |
| 3 | 618144 | 3.6% |
| 6 | 606097 | 3.6% |
| 5 | 522667 | 3.1% |
| 4 | 505441 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4243326 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21216630 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4872977 | |
| - | 4243326 | |
| 1 | 3932682 | |
| 2 | 3489665 | |
| 8 | 964186 | 4.5% |
| 7 | 774209 | 3.6% |
| 9 | 687236 | 3.2% |
| 3 | 618144 | 2.9% |
| 6 | 606097 | 2.9% |
| 5 | 522667 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21216630 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4872977 | |
| - | 4243326 | |
| 1 | 3932682 | |
| 2 | 3489665 | |
| 8 | 964186 | 4.5% |
| 7 | 774209 | 3.6% |
| 9 | 687236 | 3.2% |
| 3 | 618144 | 2.9% |
| 6 | 606097 | 2.9% |
| 5 | 522667 | 2.5% |
byoccupationinc_3656910L
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 24298 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 2896024 |
| Missing (%) | 74.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19796.48403 |
| Minimum | 0 |
|---|---|
| Maximum | 200000 |
| Zeros | 63137 |
| Zeros (%) | 1.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 5000 |
| Q3 | 30000 |
| 95-th percentile | 71316.25 |
| Maximum | 200000 |
| Range | 200000 |
| Interquartile range (IQR) | 29999 |
Descriptive statistics
| Standard deviation | 30687.65251 |
|---|---|
| Coefficient of variation (CV) | 1.550156708 |
| Kurtosis | 10.76351177 |
| Mean | 19796.48403 |
| Median Absolute Deviation (MAD) | 5000 |
| Skewness | 2.799320872 |
| Sum | 1.963138135 × 1010 |
| Variance | 941732016.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 424404 | 10.9% |
| 0 | 63137 | 1.6% |
| 15000 | 50750 | 1.3% |
| 20000 | 42942 | 1.1% |
| 30000 | 38439 | 1.0% |
| 25000 | 34911 | 0.9% |
| 50000 | 33424 | 0.9% |
| 10000 | 23527 | 0.6% |
| 40000 | 19665 | 0.5% |
| 35000 | 19252 | 0.5% |
| Other values (24288) | 241209 | 6.2% |
| (Missing) | 2896024 |
| Value | Count | Frequency (%) |
| 0 | 63137 | 1.6% |
| 1 | 424404 | |
| 2 | 9 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 200000 | 7073 | |
| 199000 | 12 | < 0.1% |
| 198600 | 3 | < 0.1% |
| 198000 | 15 | < 0.1% |
| 197000 | 9 | < 0.1% |
| Distinct | 71 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.7 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 8.84928379 |
| Min length | 8 |
Characters and Unicode
| Total characters | 34403219 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | P94_109_143 |
| 4th row | P24_27_36 |
| 5th row | P85_114_140 |
| Value | Count | Frequency (%) |
| a55475b1 | 2729942 | |
| p94_109_143 | 848995 | 21.8% |
| p180_60_137 | 43051 | 1.1% |
| p73_130_169 | 42479 | 1.1% |
| p198_89_166 | 37300 | 1.0% |
| p85_114_140 | 34868 | 0.9% |
| p30_86_84 | 31802 | 0.8% |
| p24_27_36 | 16040 | 0.4% |
| p141_135_146 | 15512 | 0.4% |
| p52_67_90 | 13724 | 0.4% |
| Other values (61) | 73971 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 8288778 | |
| 1 | 5002986 | |
| 4 | 4590459 | |
| 7 | 2870275 | 8.3% |
| a | 2729942 | 7.9% |
| b | 2729942 | 7.9% |
| _ | 2315484 | 6.7% |
| 9 | 1874613 | 5.4% |
| P | 1157742 | 3.4% |
| 0 | 1109335 | 3.2% |
| Other values (4) | 1733663 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25470109 | |
| Lowercase Letter | 5459884 | 15.9% |
| Connector Punctuation | 2315484 | 6.7% |
| Uppercase Letter | 1157742 | 3.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 8288778 | |
| 1 | 5002986 | |
| 4 | 4590459 | |
| 7 | 2870275 | 11.3% |
| 9 | 1874613 | 7.4% |
| 0 | 1109335 | 4.4% |
| 3 | 1084801 | 4.3% |
| 6 | 302589 | 1.2% |
| 8 | 252522 | 1.0% |
| 2 | 93751 | 0.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2729942 | |
| b | 2729942 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2315484 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1157742 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 27785593 | |
| Latin | 6617626 | 19.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 8288778 | |
| 1 | 5002986 | |
| 4 | 4590459 | |
| 7 | 2870275 | 10.3% |
| _ | 2315484 | 8.3% |
| 9 | 1874613 | 6.7% |
| 0 | 1109335 | 4.0% |
| 3 | 1084801 | 3.9% |
| 6 | 302589 | 1.1% |
| 8 | 252522 | 0.9% |
Latin
| Value | Count | Frequency (%) |
| a | 2729942 | |
| b | 2729942 | |
| P | 1157742 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34403219 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 8288778 | |
| 1 | 5002986 | |
| 4 | 4590459 | |
| 7 | 2870275 | 8.3% |
| a | 2729942 | 7.9% |
| b | 2729942 | 7.9% |
| _ | 2315484 | 6.7% |
| 9 | 1874613 | 5.4% |
| P | 1157742 | 3.4% |
| 0 | 1109335 | 3.2% |
| Other values (4) | 1733663 | 5.0% |
childnum_21L
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1953893 |
| Missing (%) | 50.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8434701578 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 1054010 |
| Zeros (%) | 27.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.209425285 |
|---|---|
| Coefficient of variation (CV) | 1.433868494 |
| Kurtosis | 6.155115887 |
| Mean | 0.8434701578 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.989613502 |
| Sum | 1631095 |
| Variance | 1.46270952 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1054010 | |
| 1 | 430613 | 11.1% |
| 2 | 277280 | 7.1% |
| 3 | 99088 | 2.5% |
| 4 | 39691 | 1.0% |
| 5 | 19363 | 0.5% |
| 6 | 8144 | 0.2% |
| 7 | 3028 | 0.1% |
| 8 | 1401 | < 0.1% |
| 9 | 539 | < 0.1% |
| Other values (10) | 634 | < 0.1% |
| (Missing) | 1953893 |
| Value | Count | Frequency (%) |
| 0 | 1054010 | |
| 1 | 430613 | |
| 2 | 277280 | 7.1% |
| 3 | 99088 | 2.5% |
| 4 | 39691 | 1.0% |
| Value | Count | Frequency (%) |
| 20 | 12 | |
| 18 | 5 | < 0.1% |
| 17 | 4 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 18 |
| Distinct | 5107 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 35 |
| Missing (%) | < 0.1% |
| Memory size | 29.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 38876490 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2013-04-03 |
|---|---|
| 2nd row | 2013-04-03 |
| 3rd row | 2019-01-07 |
| 4th row | 2019-01-08 |
| 5th row | 2019-01-16 |
| Value | Count | Frequency (%) |
| 2018-12-07 | 4961 | 0.1% |
| 2018-01-12 | 4168 | 0.1% |
| 2018-12-08 | 3986 | 0.1% |
| 2018-12-28 | 3940 | 0.1% |
| 2019-01-02 | 3859 | 0.1% |
| 2018-01-13 | 3799 | 0.1% |
| 2019-01-04 | 3796 | 0.1% |
| 2018-07-27 | 3692 | 0.1% |
| 2019-01-13 | 3646 | 0.1% |
| 2019-01-11 | 3618 | 0.1% |
| Other values (5097) | 3848184 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8914984 | |
| - | 7775298 | |
| 1 | 7156075 | |
| 2 | 6398126 | |
| 8 | 1706927 | 4.4% |
| 7 | 1362854 | 3.5% |
| 9 | 1346334 | 3.5% |
| 3 | 1148470 | 3.0% |
| 6 | 1091746 | 2.8% |
| 4 | 994262 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 31101192 | |
| Dash Punctuation | 7775298 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8914984 | |
| 1 | 7156075 | |
| 2 | 6398126 | |
| 8 | 1706927 | 5.5% |
| 7 | 1362854 | 4.4% |
| 9 | 1346334 | 4.3% |
| 3 | 1148470 | 3.7% |
| 6 | 1091746 | 3.5% |
| 4 | 994262 | 3.2% |
| 5 | 981414 | 3.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7775298 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 38876490 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8914984 | |
| - | 7775298 | |
| 1 | 7156075 | |
| 2 | 6398126 | |
| 8 | 1706927 | 4.4% |
| 7 | 1362854 | 3.5% |
| 9 | 1346334 | 3.5% |
| 3 | 1148470 | 3.0% |
| 6 | 1091746 | 2.8% |
| 4 | 994262 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38876490 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8914984 | |
| - | 7775298 | |
| 1 | 7156075 | |
| 2 | 6398126 | |
| 8 | 1706927 | 4.4% |
| 7 | 1362854 | 3.5% |
| 9 | 1346334 | 3.5% |
| 3 | 1148470 | 3.0% |
| 6 | 1091746 | 2.8% |
| 4 | 994262 | 2.6% |
credacc_actualbalance_314A
Real number (ℝ)
MISSING 
| Distinct | 57143 |
|---|---|
| Distinct (%) | 34.0% |
| Missing | 3719506 |
| Missing (%) | 95.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20269.5803 |
| Minimum | -114086 |
|---|---|
| Maximum | 2540730 |
| Zeros | 37132 |
| Zeros (%) | 1.0% |
| Negative | 365 |
| Negative (%) | < 0.1% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | -114086 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3.0305 |
| median | 12585 |
| Q3 | 30446 |
| 95-th percentile | 76163.788 |
| Maximum | 2540730 |
| Range | 2654816 |
| Interquartile range (IQR) | 30442.9695 |
Descriptive statistics
| Standard deviation | 26002.78165 |
|---|---|
| Coefficient of variation (CV) | 1.282847561 |
| Kurtosis | 528.4045432 |
| Mean | 20269.5803 |
| Median Absolute Deviation (MAD) | 12585 |
| Skewness | 6.9851974 |
| Sum | 3408897475 |
| Variance | 676144653.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37132 | 1.0% |
| 100000 | 2944 | 0.1% |
| 2 | 896 | < 0.1% |
| 12000 | 808 | < 0.1% |
| 42640 | 748 | < 0.1% |
| 0.2 | 735 | < 0.1% |
| 30000 | 642 | < 0.1% |
| 20300 | 539 | < 0.1% |
| 4 | 520 | < 0.1% |
| 40600 | 499 | < 0.1% |
| Other values (57133) | 122715 | 3.2% |
| (Missing) | 3719506 |
| Value | Count | Frequency (%) |
| -114086 | 1 | |
| -57634.06 | 1 | |
| -52432.37 | 1 | |
| -47822.348 | 1 | |
| -36996.402 | 1 |
| Value | Count | Frequency (%) |
| 2540730 | 1 | |
| 519966 | 1 | |
| 428026.25 | 1 | |
| 300000 | 1 | |
| 264004.8 | 2 |
credacc_credlmt_575A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 37849 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 119070 |
| Missing (%) | 3.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3254.676597 |
| Minimum | 0 |
|---|---|
| Maximum | 400000 |
| Zeros | 3470494 |
| Zeros (%) | 89.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 23294 |
| Maximum | 400000 |
| Range | 400000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 14061.34948 |
|---|---|
| Coefficient of variation (CV) | 4.320352288 |
| Kurtosis | 36.98869737 |
| Mean | 3254.676597 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.584726264 |
| Sum | 1.226561979 × 1010 |
| Variance | 197721549.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3470494 | |
| 100000 | 32568 | 0.8% |
| 12000 | 13694 | 0.4% |
| 20000 | 6333 | 0.2% |
| 40000 | 6175 | 0.2% |
| 60000 | 4714 | 0.1% |
| 24000 | 3837 | 0.1% |
| 30000 | 3538 | 0.1% |
| 10000 | 3177 | 0.1% |
| 42640 | 2828 | 0.1% |
| Other values (37839) | 221256 | 5.7% |
| (Missing) | 119070 | 3.1% |
| Value | Count | Frequency (%) |
| 0 | 3470494 | |
| 0.2 | 1275 | < 0.1% |
| 0.6 | 2 | < 0.1% |
| 3.6000001 | 2 | < 0.1% |
| 20 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 400000 | 14 | |
| 300000 | 14 | |
| 270400 | 1 | < 0.1% |
| 245200 | 1 | < 0.1% |
| 240000 | 1 | < 0.1% |
credacc_maxhisbal_375A
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 43046 |
|---|---|
| Distinct (%) | 25.6% |
| Missing | 3719506 |
| Missing (%) | 95.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -3288.887246 |
| Minimum | -196108.17 |
|---|---|
| Maximum | 7988198.5 |
| Zeros | 98392 |
| Zeros (%) | 2.5% |
| Negative | 32587 |
| Negative (%) | 0.8% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | -196108.17 |
|---|---|
| 5-th percentile | -29255.55 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 588.9267 |
| Maximum | 7988198.5 |
| Range | 8184306.67 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 28086.11335 |
|---|---|
| Coefficient of variation (CV) | -8.539700891 |
| Kurtosis | 40843.88705 |
| Mean | -3288.887246 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 154.6093224 |
| Sum | -553118479.3 |
| Variance | 788829762.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 98392 | 2.5% |
| 2 | 2042 | 0.1% |
| 4 | 1072 | < 0.1% |
| 6 | 560 | < 0.1% |
| 22 | 500 | < 0.1% |
| 10 | 343 | < 0.1% |
| 24 | 298 | < 0.1% |
| 8 | 285 | < 0.1% |
| 20 | 238 | < 0.1% |
| 42 | 213 | < 0.1% |
| Other values (43036) | 64235 | 1.7% |
| (Missing) | 3719506 |
| Value | Count | Frequency (%) |
| -196108.17 | 2 | |
| -192894.4 | 1 | |
| -185260 | 1 | |
| -183642.02 | 1 | |
| -181545.2 | 1 |
| Value | Count | Frequency (%) |
| 7988198.5 | 1 | |
| 3556000 | 1 | |
| 1900000 | 2 | |
| 1600000 | 1 | |
| 940000 | 1 |
credacc_minhisbal_90A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 42878 |
|---|---|
| Distinct (%) | 25.5% |
| Missing | 3719506 |
| Missing (%) | 95.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -6554.784203 |
| Minimum | -206808.17 |
|---|---|
| Maximum | 199567 |
| Zeros | 100911 |
| Zeros (%) | 2.6% |
| Negative | 43646 |
| Negative (%) | 1.1% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | -206808.17 |
|---|---|
| 5-th percentile | -40000 |
| Q1 | -1042.079025 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 55.8066 |
| Maximum | 199567 |
| Range | 406375.17 |
| Interquartile range (IQR) | 1042.079025 |
Descriptive statistics
| Standard deviation | 16888.86244 |
|---|---|
| Coefficient of variation (CV) | -2.576570321 |
| Kurtosis | 17.9638962 |
| Mean | -6554.784203 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -3.715506656 |
| Sum | -1102370498 |
| Variance | 285233674.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 100911 | 2.6% |
| 2 | 1383 | < 0.1% |
| 4 | 778 | < 0.1% |
| 6 | 427 | < 0.1% |
| 10 | 315 | < 0.1% |
| 22 | 309 | < 0.1% |
| -10 | 242 | < 0.1% |
| 24 | 216 | < 0.1% |
| 8 | 210 | < 0.1% |
| 20 | 208 | < 0.1% |
| Other values (42868) | 63179 | 1.6% |
| (Missing) | 3719506 |
| Value | Count | Frequency (%) |
| -206808.17 | 2 | |
| -200000 | 1 | |
| -199998 | 1 | |
| -199996 | 1 | |
| -199994.4 | 1 |
| Value | Count | Frequency (%) |
| 199567 | 2 | |
| 100000 | 1 | |
| 89859.59 | 1 | |
| 79000 | 1 | |
| 70000 | 1 |
MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3719506 |
| Missing (%) | 95.7% |
| Memory size | 29.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.010262936 |
| Min length | 2 |
Characters and Unicode
| Total characters | 338082 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CL |
|---|---|
| 2nd row | CL |
| 3rd row | CL |
| 4th row | CL |
| 5th row | AC |
| Value | Count | Frequency (%) |
| ac | 90216 | |
| cl | 61855 | |
| ca | 14052 | 8.4% |
| pcl | 1726 | 1.0% |
| po | 282 | 0.2% |
| cr | 47 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 167896 | |
| A | 104268 | |
| L | 63581 | 18.8% |
| P | 2008 | 0.6% |
| O | 282 | 0.1% |
| R | 47 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 338082 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 167896 | |
| A | 104268 | |
| L | 63581 | 18.8% |
| P | 2008 | 0.6% |
| O | 282 | 0.1% |
| R | 47 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 338082 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 167896 | |
| A | 104268 | |
| L | 63581 | 18.8% |
| P | 2008 | 0.6% |
| O | 282 | 0.1% |
| R | 47 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 338082 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 167896 | |
| A | 104268 | |
| L | 63581 | 18.8% |
| P | 2008 | 0.6% |
| O | 282 | 0.1% |
| R | 47 | < 0.1% |
credacc_transactions_402L
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 86 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3719506 |
| Missing (%) | 95.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5221253672 |
| Minimum | 0 |
|---|---|
| Maximum | 155 |
| Zeros | 150233 |
| Zeros (%) | 3.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 155 |
| Range | 155 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.951672329 |
|---|---|
| Coefficient of variation (CV) | 5.653186982 |
| Kurtosis | 344.9259013 |
| Mean | 0.5221253672 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 14.1469766 |
| Sum | 87810 |
| Variance | 8.712369536 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 150233 | 3.9% |
| 1 | 6693 | 0.2% |
| 2 | 2853 | 0.1% |
| 3 | 1893 | < 0.1% |
| 4 | 1252 | < 0.1% |
| 5 | 931 | < 0.1% |
| 6 | 730 | < 0.1% |
| 7 | 552 | < 0.1% |
| 8 | 411 | < 0.1% |
| 9 | 387 | < 0.1% |
| Other values (76) | 2243 | 0.1% |
| (Missing) | 3719506 |
| Value | Count | Frequency (%) |
| 0 | 150233 | |
| 1 | 6693 | 0.2% |
| 2 | 2853 | 0.1% |
| 3 | 1893 | < 0.1% |
| 4 | 1252 | < 0.1% |
| Value | Count | Frequency (%) |
| 155 | 2 | |
| 135 | 1 | |
| 126 | 1 | |
| 123 | 1 | |
| 110 | 1 |
credamount_590A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 201793 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 123329 |
| Missing (%) | 3.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38657.85509 |
| Minimum | 0 |
|---|---|
| Maximum | 715392 |
| Zeros | 42592 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5022 |
| Q1 | 13998 |
| median | 27000 |
| Q3 | 50000 |
| 95-th percentile | 100000 |
| Maximum | 715392 |
| Range | 715392 |
| Interquartile range (IQR) | 36002 |
Descriptive statistics
| Standard deviation | 37544.33619 |
|---|---|
| Coefficient of variation (CV) | 0.9711955334 |
| Kurtosis | 10.0483708 |
| Mean | 38657.85509 |
| Median Absolute Deviation (MAD) | 15020 |
| Skewness | 2.478930874 |
| Sum | 1.455218901 × 1011 |
| Variance | 1409577180 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60000 | 186103 | 4.8% |
| 100000 | 175966 | 4.5% |
| 40000 | 174743 | 4.5% |
| 20000 | 170088 | 4.4% |
| 30000 | 140432 | 3.6% |
| 50000 | 63461 | 1.6% |
| 10000 | 52190 | 1.3% |
| 24000 | 47919 | 1.2% |
| 80000 | 43464 | 1.1% |
| 0 | 42592 | 1.1% |
| Other values (201783) | 2667397 | |
| (Missing) | 123329 | 3.2% |
| Value | Count | Frequency (%) |
| 0 | 42592 | |
| 0.2 | 675 | < 0.1% |
| 0.6 | 1 | < 0.1% |
| 3.6000001 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 715392 | 2 | |
| 550000 | 1 | |
| 501422.22 | 2 | |
| 493000 | 1 | |
| 480665.1 | 2 |
credtype_587L
Text
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 123329 |
| Missing (%) | 3.2% |
| Memory size | 29.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 11293065 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CAL |
|---|---|
| 2nd row | CAL |
| 3rd row | CAL |
| 4th row | CAL |
| 5th row | CAL |
| Value | Count | Frequency (%) |
| col | 2035876 | |
| cal | 1478343 | |
| rel | 250136 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 3764355 | |
| C | 3514219 | |
| O | 2035876 | |
| A | 1478343 | 13.1% |
| R | 250136 | 2.2% |
| E | 250136 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 11293065 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 3764355 | |
| C | 3514219 | |
| O | 2035876 | |
| A | 1478343 | 13.1% |
| R | 250136 | 2.2% |
| E | 250136 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11293065 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 3764355 | |
| C | 3514219 | |
| O | 2035876 | |
| A | 1478343 | 13.1% |
| R | 250136 | 2.2% |
| E | 250136 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11293065 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| L | 3764355 | |
| C | 3514219 | |
| O | 2035876 | |
| A | 1478343 | 13.1% |
| R | 250136 | 2.2% |
| E | 250136 | 2.2% |
currdebt_94A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 346082 |
|---|---|
| Distinct (%) | 13.2% |
| Missing | 1270377 |
| Missing (%) | 32.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5229.100749 |
| Minimum | 0 |
|---|---|
| Maximum | 507429.72 |
| Zeros | 2238778 |
| Zeros (%) | 57.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 35505.071 |
| Maximum | 507429.72 |
| Range | 507429.72 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 19278.42258 |
|---|---|
| Coefficient of variation (CV) | 3.686756769 |
| Kurtosis | 47.65318341 |
| Mean | 5229.100749 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.85585581 |
| Sum | 1.368616199 × 1010 |
| Variance | 371657577.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2238778 | |
| 19998 | 91 | < 0.1% |
| 100000 | 91 | < 0.1% |
| 20000 | 85 | < 0.1% |
| 11998 | 84 | < 0.1% |
| 17998 | 82 | < 0.1% |
| 7998 | 80 | < 0.1% |
| 40000 | 77 | < 0.1% |
| 15998 | 75 | < 0.1% |
| 13998 | 75 | < 0.1% |
| Other values (346072) | 377789 | 9.7% |
| (Missing) | 1270377 |
| Value | Count | Frequency (%) |
| 0 | 2238778 | |
| 0.002 | 1 | < 0.1% |
| 0.010000001 | 1 | < 0.1% |
| 0.020000001 | 1 | < 0.1% |
| 0.048 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 507429.72 | 1 | |
| 507040.06 | 1 | |
| 491492.1 | 1 | |
| 490718.6 | 1 | |
| 489944.25 | 1 |
MISSING 
| Distinct | 3939 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1844702 |
| Missing (%) | 47.4% |
| Memory size | 29.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 20429820 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 53 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2018-10-19 |
|---|---|
| 2nd row | 2018-11-07 |
| 3rd row | 2018-12-28 |
| 4th row | 2018-10-09 |
| 5th row | 2018-11-16 |
| Value | Count | Frequency (%) |
| 2019-01-31 | 3811 | 0.2% |
| 2018-10-17 | 3053 | 0.1% |
| 2018-10-04 | 3022 | 0.1% |
| 2019-01-11 | 2915 | 0.1% |
| 2018-05-29 | 2894 | 0.1% |
| 2019-01-09 | 2885 | 0.1% |
| 2019-01-10 | 2842 | 0.1% |
| 2018-11-14 | 2817 | 0.1% |
| 2019-01-30 | 2805 | 0.1% |
| 2018-09-12 | 2780 | 0.1% |
| Other values (3929) | 2013158 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4702739 | |
| - | 4085964 | |
| 1 | 3822098 | |
| 2 | 3324977 | |
| 8 | 938945 | 4.6% |
| 7 | 742863 | 3.6% |
| 9 | 664418 | 3.3% |
| 6 | 587810 | 2.9% |
| 3 | 569050 | 2.8% |
| 5 | 503414 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16343856 | |
| Dash Punctuation | 4085964 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4702739 | |
| 1 | 3822098 | |
| 2 | 3324977 | |
| 8 | 938945 | 5.7% |
| 7 | 742863 | 4.5% |
| 9 | 664418 | 4.1% |
| 6 | 587810 | 3.6% |
| 3 | 569050 | 3.5% |
| 5 | 503414 | 3.1% |
| 4 | 487542 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4085964 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20429820 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4702739 | |
| - | 4085964 | |
| 1 | 3822098 | |
| 2 | 3324977 | |
| 8 | 938945 | 4.6% |
| 7 | 742863 | 3.6% |
| 9 | 664418 | 3.3% |
| 6 | 587810 | 2.9% |
| 3 | 569050 | 2.8% |
| 5 | 503414 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20429820 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4702739 | |
| - | 4085964 | |
| 1 | 3822098 | |
| 2 | 3324977 | |
| 8 | 938945 | 4.6% |
| 7 | 742863 | 3.6% |
| 9 | 664418 | 3.3% |
| 6 | 587810 | 2.9% |
| 3 | 569050 | 2.8% |
| 5 | 503414 | 2.5% |
district_544M
Text
| Distinct | 479 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.7 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.43327107 |
| Min length | 8 |
Characters and Unicode
| Total characters | 40561261 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 54 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | P136_108_173 |
|---|---|
| 2nd row | P136_108_173 |
| 3rd row | P131_33_167 |
| 4th row | P194_82_174 |
| 5th row | P54_133_26 |
| Value | Count | Frequency (%) |
| a55475b1 | 388961 | 10.0% |
| p131_33_167 | 190215 | 4.9% |
| p123_6_84 | 160709 | 4.1% |
| p197_47_166 | 137010 | 3.5% |
| p204_99_158 | 114381 | 2.9% |
| p98_137_111 | 94298 | 2.4% |
| p62_144_102 | 86540 | 2.2% |
| p159_143_123 | 85401 | 2.2% |
| p147_21_170 | 80957 | 2.1% |
| p178_112_160 | 71042 | 1.8% |
| Other values (469) | 2478170 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 8872392 | |
| _ | 6997446 | |
| P | 3498722 | 8.6% |
| 7 | 3069372 | 7.6% |
| 5 | 3022413 | 7.5% |
| 4 | 2644775 | 6.5% |
| 6 | 2218257 | 5.5% |
| 3 | 2204920 | 5.4% |
| 2 | 2070115 | 5.1% |
| 9 | 1904803 | 4.7% |
| Other values (8) | 4058046 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29287166 | |
| Connector Punctuation | 6997446 | 17.3% |
| Uppercase Letter | 3498723 | 8.6% |
| Lowercase Letter | 777926 | 1.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8872392 | |
| 7 | 3069372 | 10.5% |
| 5 | 3022413 | 10.3% |
| 4 | 2644775 | 9.0% |
| 6 | 2218257 | 7.6% |
| 3 | 2204920 | 7.5% |
| 2 | 2070115 | 7.1% |
| 9 | 1904803 | 6.5% |
| 8 | 1893444 | 6.5% |
| 0 | 1386675 | 4.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 388961 | |
| b | 388961 | |
| e | 2 | < 0.1% |
| m | 1 | < 0.1% |
| t | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3498722 | |
| Q | 1 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 6997446 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 36284612 | |
| Latin | 4276649 | 10.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 8872392 | |
| _ | 6997446 | |
| 7 | 3069372 | 8.5% |
| 5 | 3022413 | 8.3% |
| 4 | 2644775 | 7.3% |
| 6 | 2218257 | 6.1% |
| 3 | 2204920 | 6.1% |
| 2 | 2070115 | 5.7% |
| 9 | 1904803 | 5.2% |
| 8 | 1893444 | 5.2% |
Latin
| Value | Count | Frequency (%) |
| P | 3498722 | |
| a | 388961 | 9.1% |
| b | 388961 | 9.1% |
| e | 2 | < 0.1% |
| Q | 1 | < 0.1% |
| m | 1 | < 0.1% |
| t | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40561261 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 8872392 | |
| _ | 6997446 | |
| P | 3498722 | 8.6% |
| 7 | 3069372 | 7.6% |
| 5 | 3022413 | 7.5% |
| 4 | 2644775 | 6.5% |
| 6 | 2218257 | 5.5% |
| 3 | 2204920 | 5.4% |
| 2 | 2070115 | 5.1% |
| 9 | 1904803 | 4.7% |
| Other values (8) | 4058046 |
downpmt_134A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 18720 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 123329 |
| Missing (%) | 3.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 457.8898882 |
| Minimum | 0 |
|---|---|
| Maximum | 320400 |
| Zeros | 3381858 |
| Zeros (%) | 87.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2000 |
| Maximum | 320400 |
| Range | 320400 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2697.225764 |
|---|---|
| Coefficient of variation (CV) | 5.890555423 |
| Kurtosis | 776.6842527 |
| Mean | 457.8898882 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 17.93615961 |
| Sum | 1723660090 |
| Variance | 7275026.82 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3381858 | |
| 2000 | 44574 | 1.1% |
| 4000 | 29703 | 0.8% |
| 1000 | 28357 | 0.7% |
| 6000 | 16262 | 0.4% |
| 200 | 13639 | 0.4% |
| 10000 | 13472 | 0.3% |
| 400 | 12662 | 0.3% |
| 3000 | 11986 | 0.3% |
| 8000 | 8796 | 0.2% |
| Other values (18710) | 203046 | 5.2% |
| (Missing) | 123329 | 3.2% |
| Value | Count | Frequency (%) |
| 0 | 3381858 | |
| 0.2 | 390 | < 0.1% |
| 0.4 | 48 | < 0.1% |
| 0.6 | 124 | < 0.1% |
| 0.8 | 33 | < 0.1% |
| Value | Count | Frequency (%) |
| 320400 | 1 | < 0.1% |
| 305200 | 1 | < 0.1% |
| 300000 | 2 | |
| 274998 | 3 | |
| 268134.4 | 1 | < 0.1% |
dtlastpmt_581D
Text
MISSING 
| Distinct | 2150 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2860375 |
| Missing (%) | 73.6% |
| Memory size | 29.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 10273090 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 214 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2019-01-10 |
|---|---|
| 2nd row | 2019-01-03 |
| 3rd row | 2019-01-08 |
| 4th row | 2018-12-05 |
| 5th row | 2018-12-26 |
| Value | Count | Frequency (%) |
| 2019-09-16 | 22398 | 2.2% |
| 2018-12-24 | 2210 | 0.2% |
| 2019-01-11 | 2176 | 0.2% |
| 2018-07-23 | 2052 | 0.2% |
| 2018-12-20 | 2012 | 0.2% |
| 2019-01-02 | 1982 | 0.2% |
| 2019-02-25 | 1939 | 0.2% |
| 2018-05-24 | 1929 | 0.2% |
| 2019-01-22 | 1922 | 0.2% |
| 2018-12-25 | 1916 | 0.2% |
| Other values (2140) | 986773 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2267061 | |
| - | 2054618 | |
| 1 | 1871415 | |
| 2 | 1671458 | |
| 8 | 544478 | 5.3% |
| 9 | 486902 | 4.7% |
| 7 | 418414 | 4.1% |
| 6 | 361158 | 3.5% |
| 3 | 229774 | 2.2% |
| 5 | 196197 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8218472 | |
| Dash Punctuation | 2054618 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2267061 | |
| 1 | 1871415 | |
| 2 | 1671458 | |
| 8 | 544478 | 6.6% |
| 9 | 486902 | 5.9% |
| 7 | 418414 | 5.1% |
| 6 | 361158 | 4.4% |
| 3 | 229774 | 2.8% |
| 5 | 196197 | 2.4% |
| 4 | 171615 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2054618 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10273090 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2267061 | |
| - | 2054618 | |
| 1 | 1871415 | |
| 2 | 1671458 | |
| 8 | 544478 | 5.3% |
| 9 | 486902 | 4.7% |
| 7 | 418414 | 4.1% |
| 6 | 361158 | 3.5% |
| 3 | 229774 | 2.2% |
| 5 | 196197 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10273090 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2267061 | |
| - | 2054618 | |
| 1 | 1871415 | |
| 2 | 1671458 | |
| 8 | 544478 | 5.3% |
| 9 | 486902 | 4.7% |
| 7 | 418414 | 4.1% |
| 6 | 361158 | 3.5% |
| 3 | 229774 | 2.2% |
| 5 | 196197 | 1.9% |
MISSING 
| Distinct | 2168 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2434155 |
| Missing (%) | 62.6% |
| Memory size | 29.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 14535290 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 224 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2019-01-10 |
|---|---|
| 2nd row | 2018-12-29 |
| 3rd row | 2019-01-03 |
| 4th row | 2019-01-08 |
| 5th row | 2019-01-05 |
| Value | Count | Frequency (%) |
| 2019-09-16 | 24576 | 1.7% |
| 2019-05-22 | 4054 | 0.3% |
| 2019-05-20 | 3990 | 0.3% |
| 2019-03-25 | 3786 | 0.3% |
| 2019-01-22 | 3759 | 0.3% |
| 2019-05-27 | 3685 | 0.3% |
| 2019-07-19 | 3601 | 0.2% |
| 2019-02-25 | 3591 | 0.2% |
| 2019-03-19 | 3582 | 0.2% |
| 2019-03-20 | 3565 | 0.2% |
| Other values (2158) | 1395340 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3193320 | |
| - | 2907058 | |
| 1 | 2662034 | |
| 2 | 2367467 | |
| 9 | 969878 | 6.7% |
| 8 | 655291 | 4.5% |
| 7 | 507252 | 3.5% |
| 6 | 443031 | 3.0% |
| 3 | 318089 | 2.2% |
| 5 | 272558 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11628232 | |
| Dash Punctuation | 2907058 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3193320 | |
| 1 | 2662034 | |
| 2 | 2367467 | |
| 9 | 969878 | 8.3% |
| 8 | 655291 | 5.6% |
| 7 | 507252 | 4.4% |
| 6 | 443031 | 3.8% |
| 3 | 318089 | 2.7% |
| 5 | 272558 | 2.3% |
| 4 | 239312 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2907058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14535290 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3193320 | |
| - | 2907058 | |
| 1 | 2662034 | |
| 2 | 2367467 | |
| 9 | 969878 | 6.7% |
| 8 | 655291 | 4.5% |
| 7 | 507252 | 3.5% |
| 6 | 443031 | 3.0% |
| 3 | 318089 | 2.2% |
| 5 | 272558 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14535290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3193320 | |
| - | 2907058 | |
| 1 | 2662034 | |
| 2 | 2367467 | |
| 9 | 969878 | 6.7% |
| 8 | 655291 | 4.5% |
| 7 | 507252 | 3.5% |
| 6 | 443031 | 3.0% |
| 3 | 318089 | 2.2% |
| 5 | 272558 | 1.9% |
education_1138M
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.7 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 9.311229771 |
| Min length | 8 |
Characters and Unicode
| Total characters | 36199119 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P97_36_170 |
|---|---|
| 2nd row | P97_36_170 |
| 3rd row | P97_36_170 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 1693666 | |
| p97_36_170 | 1458441 | |
| p33_146_175 | 680819 | |
| p106_81_188 | 26860 | 0.7% |
| p17_36_170 | 25966 | 0.7% |
| p157_18_172 | 1932 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 5763749 | |
| 7 | 5347163 | |
| 1 | 4652053 | |
| _ | 4388036 | |
| 3 | 2846045 | |
| 4 | 2374485 | |
| P | 2194018 | 6.1% |
| 6 | 2192086 | 6.1% |
| a | 1693666 | 4.7% |
| b | 1693666 | 4.7% |
| Other values (4) | 3054152 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26229733 | |
| Connector Punctuation | 4388036 | 12.1% |
| Lowercase Letter | 3387332 | 9.4% |
| Uppercase Letter | 2194018 | 6.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 5763749 | |
| 7 | 5347163 | |
| 1 | 4652053 | |
| 3 | 2846045 | |
| 4 | 2374485 | |
| 6 | 2192086 | 8.4% |
| 0 | 1511267 | 5.8% |
| 9 | 1458441 | 5.6% |
| 8 | 82512 | 0.3% |
| 2 | 1932 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1693666 | |
| b | 1693666 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4388036 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2194018 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 30617769 | |
| Latin | 5581350 | 15.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 5763749 | |
| 7 | 5347163 | |
| 1 | 4652053 | |
| _ | 4388036 | |
| 3 | 2846045 | |
| 4 | 2374485 | |
| 6 | 2192086 | 7.2% |
| 0 | 1511267 | 4.9% |
| 9 | 1458441 | 4.8% |
| 8 | 82512 | 0.3% |
Latin
| Value | Count | Frequency (%) |
| P | 2194018 | |
| a | 1693666 | |
| b | 1693666 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36199119 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 5763749 | |
| 7 | 5347163 | |
| 1 | 4652053 | |
| _ | 4388036 | |
| 3 | 2846045 | |
| 4 | 2374485 | |
| P | 2194018 | 6.1% |
| 6 | 2192086 | 6.1% |
| a | 1693666 | 4.7% |
| b | 1693666 | 4.7% |
| Other values (4) | 3054152 |
MISSING 
| Distinct | 9285 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 2180869 |
| Missing (%) | 56.1% |
| Memory size | 29.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 17068150 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1902 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2010-02-15 |
|---|---|
| 2nd row | 2010-02-15 |
| 3rd row | 2018-05-15 |
| 4th row | 2013-09-15 |
| 5th row | 2012-09-15 |
| Value | Count | Frequency (%) |
| 2015-01-15 | 29305 | 1.7% |
| 2016-01-15 | 28713 | 1.7% |
| 2014-01-15 | 28642 | 1.7% |
| 2017-01-15 | 28477 | 1.7% |
| 2013-01-15 | 28312 | 1.7% |
| 2012-01-15 | 24185 | 1.4% |
| 2010-01-15 | 18632 | 1.1% |
| 2011-09-15 | 17503 | 1.0% |
| 2010-09-15 | 17495 | 1.0% |
| 2012-09-15 | 17356 | 1.0% |
| Other values (9275) | 1468195 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3897011 | |
| 1 | 3611856 | |
| - | 3413630 | |
| 2 | 1980481 | |
| 5 | 1922389 | |
| 9 | 677460 | 4.0% |
| 8 | 341301 | 2.0% |
| 6 | 318819 | 1.9% |
| 3 | 311076 | 1.8% |
| 4 | 305810 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13654520 | |
| Dash Punctuation | 3413630 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3897011 | |
| 1 | 3611856 | |
| 2 | 1980481 | |
| 5 | 1922389 | |
| 9 | 677460 | 5.0% |
| 8 | 341301 | 2.5% |
| 6 | 318819 | 2.3% |
| 3 | 311076 | 2.3% |
| 4 | 305810 | 2.2% |
| 7 | 288317 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3413630 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17068150 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3897011 | |
| 1 | 3611856 | |
| - | 3413630 | |
| 2 | 1980481 | |
| 5 | 1922389 | |
| 9 | 677460 | 4.0% |
| 8 | 341301 | 2.0% |
| 6 | 318819 | 1.9% |
| 3 | 311076 | 1.8% |
| 4 | 305810 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17068150 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3897011 | |
| 1 | 3611856 | |
| - | 3413630 | |
| 2 | 1980481 | |
| 5 | 1922389 | |
| 9 | 677460 | 4.0% |
| 8 | 341301 | 2.0% |
| 6 | 318819 | 1.9% |
| 3 | 311076 | 1.8% |
| 4 | 305810 | 1.8% |
familystate_726L
Text
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1245201 |
| Missing (%) | 32.0% |
| Memory size | 29.7 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 7 |
| Mean length | 7.068848882 |
| Min length | 6 |
Characters and Unicode
| Total characters | 18679313 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SINGLE |
|---|---|
| 2nd row | SINGLE |
| 3rd row | MARRIED |
| 4th row | SINGLE |
| 5th row | SINGLE |
| Value | Count | Frequency (%) |
| married | 1904175 | |
| single | 391421 | 14.8% |
| widowed | 220519 | 8.3% |
| divorced | 85733 | 3.2% |
| living_with_partner | 40635 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 3975353 | |
| I | 2723753 | |
| E | 2642483 | |
| D | 2516679 | |
| A | 1944810 | |
| M | 1904175 | |
| W | 481673 | 2.6% |
| N | 472691 | 2.5% |
| L | 432056 | 2.3% |
| G | 432056 | 2.3% |
| Other values (8) | 1153584 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 18598043 | |
| Connector Punctuation | 81270 | 0.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 3975353 | |
| I | 2723753 | |
| E | 2642483 | |
| D | 2516679 | |
| A | 1944810 | |
| M | 1904175 | |
| W | 481673 | 2.6% |
| N | 472691 | 2.5% |
| L | 432056 | 2.3% |
| G | 432056 | 2.3% |
| Other values (7) | 1072314 | 5.8% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 81270 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18598043 | |
| Common | 81270 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 3975353 | |
| I | 2723753 | |
| E | 2642483 | |
| D | 2516679 | |
| A | 1944810 | |
| M | 1904175 | |
| W | 481673 | 2.6% |
| N | 472691 | 2.5% |
| L | 432056 | 2.3% |
| G | 432056 | 2.3% |
| Other values (7) | 1072314 | 5.8% |
Common
| Value | Count | Frequency (%) |
| _ | 81270 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18679313 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 3975353 | |
| I | 2723753 | |
| E | 2642483 | |
| D | 2516679 | |
| A | 1944810 | |
| M | 1904175 | |
| W | 481673 | 2.6% |
| N | 472691 | 2.5% |
| L | 432056 | 2.3% |
| G | 432056 | 2.3% |
| Other values (8) | 1153584 | 6.2% |
firstnonzeroinstldate_307D
Text
MISSING 
| Distinct | 4886 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 365175 |
| Missing (%) | 9.4% |
| Memory size | 29.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 35225090 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2013-05-04 |
|---|---|
| 2nd row | 2013-05-04 |
| 3rd row | 2019-02-07 |
| 4th row | 2019-02-08 |
| 5th row | 2018-10-12 |
| Value | Count | Frequency (%) |
| 2019-03-14 | 8057 | 0.2% |
| 2018-12-15 | 6898 | 0.2% |
| 2018-09-15 | 6596 | 0.2% |
| 2018-02-15 | 6117 | 0.2% |
| 2019-02-11 | 6078 | 0.2% |
| 2018-03-14 | 5897 | 0.2% |
| 2018-02-11 | 5858 | 0.2% |
| 2018-07-15 | 5657 | 0.2% |
| 2019-02-15 | 5602 | 0.2% |
| 2017-12-15 | 5221 | 0.1% |
| Other values (4876) | 3460528 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8125548 | |
| - | 7045018 | |
| 1 | 6501171 | |
| 2 | 5801762 | |
| 8 | 1472398 | 4.2% |
| 9 | 1257862 | 3.6% |
| 7 | 1174381 | 3.3% |
| 5 | 1090650 | 3.1% |
| 3 | 972274 | 2.8% |
| 6 | 949158 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28180072 | |
| Dash Punctuation | 7045018 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8125548 | |
| 1 | 6501171 | |
| 2 | 5801762 | |
| 8 | 1472398 | 5.2% |
| 9 | 1257862 | 4.5% |
| 7 | 1174381 | 4.2% |
| 5 | 1090650 | 3.9% |
| 3 | 972274 | 3.5% |
| 6 | 949158 | 3.4% |
| 4 | 834868 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7045018 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 35225090 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8125548 | |
| - | 7045018 | |
| 1 | 6501171 | |
| 2 | 5801762 | |
| 8 | 1472398 | 4.2% |
| 9 | 1257862 | 3.6% |
| 7 | 1174381 | 3.3% |
| 5 | 1090650 | 3.1% |
| 3 | 972274 | 2.8% |
| 6 | 949158 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35225090 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8125548 | |
| - | 7045018 | |
| 1 | 6501171 | |
| 2 | 5801762 | |
| 8 | 1472398 | 4.2% |
| 9 | 1257862 | 3.6% |
| 7 | 1174381 | 3.3% |
| 5 | 1090650 | 3.1% |
| 3 | 972274 | 2.8% |
| 6 | 949158 | 2.7% |
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 123329 |
| Missing (%) | 3.2% |
| Memory size | 29.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.392895197 |
| Min length | 3 |
Characters and Unicode
| Total characters | 12772062 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CASH |
|---|---|
| 2nd row | CASH |
| 3rd row | CASH |
| 4th row | CASH |
| 5th row | CASH |
| Value | Count | Frequency (%) |
| pos | 2167053 | |
| cash | 1478997 | |
| ndf | 118305 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 3646050 | |
| P | 2167053 | |
| O | 2167053 | |
| C | 1478997 | |
| A | 1478997 | |
| H | 1478997 | |
| N | 118305 | 0.9% |
| D | 118305 | 0.9% |
| F | 118305 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12772062 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3646050 | |
| P | 2167053 | |
| O | 2167053 | |
| C | 1478997 | |
| A | 1478997 | |
| H | 1478997 | |
| N | 118305 | 0.9% |
| D | 118305 | 0.9% |
| F | 118305 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12772062 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 3646050 | |
| P | 2167053 | |
| O | 2167053 | |
| C | 1478997 | |
| A | 1478997 | |
| H | 1478997 | |
| N | 118305 | 0.9% |
| D | 118305 | 0.9% |
| F | 118305 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12772062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 3646050 | |
| P | 2167053 | |
| O | 2167053 | |
| C | 1478997 | |
| A | 1478997 | |
| H | 1478997 | |
| N | 118305 | 0.9% |
| D | 118305 | 0.9% |
| F | 118305 | 0.9% |
isbidproduct_390L
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 35 |
| Missing (%) | < 0.1% |
| Memory size | 29.7 MiB |
| False | |
|---|---|
| True | 218833 |
| (Missing) | 35 |
| Value | Count | Frequency (%) |
| False | 3668816 | |
| True | 218833 | 5.6% |
| (Missing) | 35 | < 0.1% |
isdebitcard_527L
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3637550 |
| Missing (%) | 93.6% |
| Memory size | 29.7 MiB |
| False | 212706 |
|---|---|
| True | 37428 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 212706 | 5.5% |
| True | 37428 | 1.0% |
| (Missing) | 3637550 |
mainoccupationinc_437A
Real number (ℝ)
| Distinct | 21957 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 36612 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40029.69379 |
| Minimum | 0 |
|---|---|
| Maximum | 196000 |
| Zeros | 47 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6000 |
| Q1 | 18000 |
| median | 34000 |
| Q3 | 52000 |
| 95-th percentile | 100000 |
| Maximum | 196000 |
| Range | 196000 |
| Interquartile range (IQR) | 34000 |
Descriptive statistics
| Standard deviation | 31396.36665 |
|---|---|
| Coefficient of variation (CV) | 0.7843269254 |
| Kurtosis | 5.201732139 |
| Mean | 40029.69379 |
| Median Absolute Deviation (MAD) | 17000 |
| Skewness | 1.863363842 |
| Sum | 1.541572329 × 1011 |
| Variance | 985731838.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30000 | 278622 | 7.2% |
| 40000 | 272284 | 7.0% |
| 50000 | 230602 | 5.9% |
| 60000 | 184523 | 4.7% |
| 20000 | 145935 | 3.8% |
| 24000 | 124888 | 3.2% |
| 70000 | 114524 | 2.9% |
| 36000 | 113091 | 2.9% |
| 16000 | 74780 | 1.9% |
| 12000 | 69980 | 1.8% |
| Other values (21947) | 2241843 |
| Value | Count | Frequency (%) |
| 0 | 47 | |
| 0.038 | 1 | < 0.1% |
| 0.2 | 78 | |
| 0.4 | 6 | < 0.1% |
| 0.6 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 196000 | 19426 | |
| 195800 | 4 | < 0.1% |
| 195600 | 16 | < 0.1% |
| 195540 | 1 | < 0.1% |
| 195400 | 1 | < 0.1% |
maxdpdtolerance_577P
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 3199 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1817378 |
| Missing (%) | 46.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.26581674 |
| Minimum | 0 |
|---|---|
| Maximum | 4058 |
| Zeros | 1527003 |
| Zeros (%) | 39.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 15 |
| Maximum | 4058 |
| Range | 4058 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 134.8893241 |
|---|---|
| Coefficient of variation (CV) | 10.16818841 |
| Kurtosis | 328.0205761 |
| Mean | 13.26581674 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.64564203 |
| Sum | 27464300 |
| Variance | 18195.12975 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1527003 | |
| 1 | 283391 | 7.3% |
| 5 | 35934 | 0.9% |
| 6 | 28382 | 0.7% |
| 10 | 21438 | 0.6% |
| 4 | 11330 | 0.3% |
| 9 | 11187 | 0.3% |
| 14 | 10079 | 0.3% |
| 7 | 9775 | 0.3% |
| 18 | 9442 | 0.2% |
| Other values (3189) | 122345 | 3.1% |
| (Missing) | 1817378 |
| Value | Count | Frequency (%) |
| 0 | 1527003 | |
| 1 | 283391 | 7.3% |
| 2 | 8816 | 0.2% |
| 3 | 2658 | 0.1% |
| 4 | 11330 | 0.3% |
| Value | Count | Frequency (%) |
| 4058 | 1 | |
| 4025 | 1 | |
| 4024 | 1 | |
| 4000 | 2 | |
| 3999 | 1 |
num_group1
Real number (ℝ)
ZEROS 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.910262254 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 782997 |
| Zeros (%) | 20.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 13 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.101353263 |
|---|---|
| Coefficient of variation (CV) | 1.048869103 |
| Kurtosis | 1.58474303 |
| Mean | 3.910262254 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.393787504 |
| Sum | 15201864 |
| Variance | 16.82109859 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 782997 | |
| 1 | 622546 | |
| 2 | 494850 | |
| 3 | 394066 | |
| 4 | 314724 | |
| 5 | 251725 | 6.5% |
| 6 | 202039 | 5.2% |
| 7 | 162746 | 4.2% |
| 8 | 131605 | 3.4% |
| 9 | 106507 | 2.7% |
| Other values (10) | 423879 |
| Value | Count | Frequency (%) |
| 0 | 782997 | |
| 1 | 622546 | |
| 2 | 494850 | |
| 3 | 394066 | |
| 4 | 314724 |
| Value | Count | Frequency (%) |
| 19 | 16520 | |
| 18 | 19472 | |
| 17 | 23111 | |
| 16 | 27575 | |
| 15 | 33102 |
outstandingdebt_522A
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 269095 |
|---|---|
| Distinct (%) | 10.3% |
| Missing | 1277922 |
| Missing (%) | 32.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6994.587846 |
| Minimum | 0 |
|---|---|
| Maximum | 1210629.1 |
| Zeros | 2229538 |
| Zeros (%) | 57.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 41496.3719 |
| Maximum | 1210629.1 |
| Range | 1210629.1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 29162.48428 |
|---|---|
| Coefficient of variation (CV) | 4.169292734 |
| Kurtosis | 88.30188385 |
| Mean | 6994.587846 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.63303055 |
| Sum | 1.825420957 × 1010 |
| Variance | 850450489.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2229538 | |
| 10 | 188 | < 0.1% |
| 19998 | 82 | < 0.1% |
| 17998 | 75 | < 0.1% |
| 11998 | 73 | < 0.1% |
| 7998 | 72 | < 0.1% |
| 15998 | 66 | < 0.1% |
| 14998 | 66 | < 0.1% |
| 9978 | 64 | < 0.1% |
| 13998 | 64 | < 0.1% |
| Other values (269085) | 379474 | 9.8% |
| (Missing) | 1277922 |
| Value | Count | Frequency (%) |
| 0 | 2229538 | |
| 0.002 | 3 | < 0.1% |
| 0.004 | 1 | < 0.1% |
| 0.008 | 1 | < 0.1% |
| 0.010000001 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1210629.1 | 1 | |
| 1192100.9 | 1 | |
| 1092393 | 1 | |
| 1085048.1 | 1 | |
| 1071760.9 | 1 |
pmtnum_8L
Real number (ℝ)
MISSING 
| Distinct | 57 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 312833 |
| Missing (%) | 8.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.78210253 |
| Minimum | 3 |
|---|---|
| Maximum | 62 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 12 |
| Q3 | 24 |
| 95-th percentile | 36 |
| Maximum | 62 |
| Range | 59 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 10.46206069 |
|---|---|
| Coefficient of variation (CV) | 0.662906648 |
| Kurtosis | 1.294429223 |
| Mean | 15.78210253 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.219734053 |
| Sum | 56418665 |
| Variance | 109.4547138 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 883553 | |
| 6 | 589676 | |
| 24 | 562114 | |
| 18 | 296861 | 7.6% |
| 36 | 190785 | 4.9% |
| 3 | 188014 | 4.8% |
| 16 | 133452 | 3.4% |
| 48 | 102226 | 2.6% |
| 9 | 78979 | 2.0% |
| 10 | 75231 | 1.9% |
| Other values (47) | 473960 | |
| (Missing) | 312833 | 8.0% |
| Value | Count | Frequency (%) |
| 3 | 188014 | 4.8% |
| 4 | 73219 | 1.9% |
| 5 | 43429 | 1.1% |
| 6 | 589676 | |
| 7 | 7541 | 0.2% |
| Value | Count | Frequency (%) |
| 62 | 1 | < 0.1% |
| 61 | 3 | < 0.1% |
| 60 | 3148 | |
| 58 | 46 | < 0.1% |
| 56 | 23 | < 0.1% |
postype_4733339M
Text
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.7 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 8.092561535 |
| Min length | 8 |
Characters and Unicode
| Total characters | 31461322 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 3779480 | |
| p177_117_192 | 57712 | 1.5% |
| p46_145_78 | 24034 | 0.6% |
| p149_40_170 | 11308 | 0.3% |
| p60_146_156 | 8054 | 0.2% |
| p67_102_161 | 4470 | 0.1% |
| p217_110_186 | 1560 | < 0.1% |
| p169_115_83 | 848 | < 0.1% |
| p140_48_169 | 218 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 11371376 | |
| 1 | 4095716 | 13.0% |
| 7 | 3993988 | 12.7% |
| 4 | 3858654 | 12.3% |
| a | 3779480 | 12.0% |
| b | 3779480 | 12.0% |
| _ | 216408 | 0.7% |
| P | 108204 | 0.3% |
| 9 | 70086 | 0.2% |
| 2 | 63742 | 0.2% |
| Other values (4) | 124188 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23577750 | |
| Lowercase Letter | 7558960 | 24.0% |
| Connector Punctuation | 216408 | 0.7% |
| Uppercase Letter | 108204 | 0.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 11371376 | |
| 1 | 4095716 | 17.4% |
| 7 | 3993988 | 16.9% |
| 4 | 3858654 | 16.4% |
| 9 | 70086 | 0.3% |
| 2 | 63742 | 0.3% |
| 6 | 59762 | 0.3% |
| 0 | 36918 | 0.2% |
| 8 | 26660 | 0.1% |
| 3 | 848 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3779480 | |
| b | 3779480 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 216408 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 108204 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23794158 | |
| Latin | 7667164 | 24.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 11371376 | |
| 1 | 4095716 | 17.2% |
| 7 | 3993988 | 16.8% |
| 4 | 3858654 | 16.2% |
| _ | 216408 | 0.9% |
| 9 | 70086 | 0.3% |
| 2 | 63742 | 0.3% |
| 6 | 59762 | 0.3% |
| 0 | 36918 | 0.2% |
| 8 | 26660 | 0.1% |
Latin
| Value | Count | Frequency (%) |
| a | 3779480 | |
| b | 3779480 | |
| P | 108204 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31461322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 11371376 | |
| 1 | 4095716 | 13.0% |
| 7 | 3993988 | 12.7% |
| 4 | 3858654 | 12.3% |
| a | 3779480 | 12.0% |
| b | 3779480 | 12.0% |
| _ | 216408 | 0.7% |
| P | 108204 | 0.3% |
| 9 | 70086 | 0.2% |
| 2 | 63742 | 0.2% |
| Other values (4) | 124188 | 0.4% |
profession_152M
Text
| Distinct | 9028 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.7 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 8 |
| Mean length | 8.031521852 |
| Min length | 7 |
Characters and Unicode
| Total characters | 31224019 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5636 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 3839511 | |
| p46_72_80 | 1281 | < 0.1% |
| p104_137_180 | 959 | < 0.1% |
| p167_22_171 | 699 | < 0.1% |
| p139_125_64 | 671 | < 0.1% |
| p143_116_69 | 665 | < 0.1% |
| p25_111_112 | 640 | < 0.1% |
| p21_76_53 | 612 | < 0.1% |
| p116_59_165 | 532 | < 0.1% |
| p103_114_185 | 526 | < 0.1% |
| Other values (9018) | 41588 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 11548196 | |
| 1 | 3943241 | 12.6% |
| 7 | 3873726 | 12.4% |
| 4 | 3869256 | 12.4% |
| a | 3839535 | 12.3% |
| b | 3839515 | 12.3% |
| _ | 96346 | 0.3% |
| P | 48092 | 0.2% |
| 2 | 34187 | 0.1% |
| 6 | 33655 | 0.1% |
| Other values (26) | 98270 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23400120 | |
| Lowercase Letter | 7679380 | 24.6% |
| Connector Punctuation | 96346 | 0.3% |
| Uppercase Letter | 48173 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3839535 | |
| b | 3839515 | |
| e | 46 | < 0.1% |
| o | 39 | < 0.1% |
| r | 37 | < 0.1% |
| t | 35 | < 0.1% |
| u | 22 | < 0.1% |
| s | 16 | < 0.1% |
| k | 16 | < 0.1% |
| h | 15 | < 0.1% |
| Other values (13) | 104 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 11548196 | |
| 1 | 3943241 | 16.9% |
| 7 | 3873726 | 16.6% |
| 4 | 3869256 | 16.5% |
| 2 | 34187 | 0.1% |
| 6 | 33655 | 0.1% |
| 3 | 25672 | 0.1% |
| 0 | 25385 | 0.1% |
| 8 | 24952 | 0.1% |
| 9 | 21850 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 48092 | |
| Q | 81 | 0.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 96346 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23496466 | |
| Latin | 7727553 | 24.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3839535 | |
| b | 3839515 | |
| P | 48092 | 0.6% |
| Q | 81 | < 0.1% |
| e | 46 | < 0.1% |
| o | 39 | < 0.1% |
| r | 37 | < 0.1% |
| t | 35 | < 0.1% |
| u | 22 | < 0.1% |
| s | 16 | < 0.1% |
| Other values (15) | 135 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 5 | 11548196 | |
| 1 | 3943241 | 16.8% |
| 7 | 3873726 | 16.5% |
| 4 | 3869256 | 16.5% |
| _ | 96346 | 0.4% |
| 2 | 34187 | 0.1% |
| 6 | 33655 | 0.1% |
| 3 | 25672 | 0.1% |
| 0 | 25385 | 0.1% |
| 8 | 24952 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31224019 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 11548196 | |
| 1 | 3943241 | 12.6% |
| 7 | 3873726 | 12.4% |
| 4 | 3869256 | 12.4% |
| a | 3839535 | 12.3% |
| b | 3839515 | 12.3% |
| _ | 96346 | 0.3% |
| P | 48092 | 0.2% |
| 2 | 34187 | 0.1% |
| 6 | 33655 | 0.1% |
| Other values (26) | 98270 | 0.3% |
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.7 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 8.667440306 |
| Min length | 8 |
Characters and Unicode
| Total characters | 33696269 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | P94_109_143 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 2848762 | |
| p94_109_143 | 524764 | 13.5% |
| p99_56_166 | 358720 | 9.2% |
| p45_84_106 | 71489 | 1.8% |
| p198_131_9 | 65648 | 1.7% |
| p30_86_84 | 5808 | 0.1% |
| p52_67_90 | 3027 | 0.1% |
| p48_22_32 | 2793 | 0.1% |
| p196_88_176 | 2112 | 0.1% |
| p121_60_164 | 1636 | < 0.1% |
| Other values (8) | 2925 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 8980061 | |
| 1 | 4540393 | |
| 4 | 4053100 | |
| 7 | 2855155 | 8.5% |
| a | 2848762 | 8.5% |
| b | 2848762 | 8.5% |
| _ | 2077844 | 6.2% |
| 9 | 1905359 | 5.7% |
| 6 | 1167599 | 3.5% |
| P | 1038922 | 3.1% |
| Other values (4) | 1380312 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 24881979 | |
| Lowercase Letter | 5697524 | 16.9% |
| Connector Punctuation | 2077844 | 6.2% |
| Uppercase Letter | 1038922 | 3.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 8980061 | |
| 1 | 4540393 | |
| 4 | 4053100 | |
| 7 | 2855155 | 11.5% |
| 9 | 1905359 | 7.7% |
| 6 | 1167599 | 4.7% |
| 0 | 607651 | 2.4% |
| 3 | 599414 | 2.4% |
| 8 | 157338 | 0.6% |
| 2 | 15909 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2848762 | |
| b | 2848762 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2077844 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1038922 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26959823 | |
| Latin | 6736446 | 20.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 8980061 | |
| 1 | 4540393 | |
| 4 | 4053100 | |
| 7 | 2855155 | 10.6% |
| _ | 2077844 | 7.7% |
| 9 | 1905359 | 7.1% |
| 6 | 1167599 | 4.3% |
| 0 | 607651 | 2.3% |
| 3 | 599414 | 2.2% |
| 8 | 157338 | 0.6% |
Latin
| Value | Count | Frequency (%) |
| a | 2848762 | |
| b | 2848762 | |
| P | 1038922 | 15.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33696269 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 8980061 | |
| 1 | 4540393 | |
| 4 | 4053100 | |
| 7 | 2855155 | 8.5% |
| a | 2848762 | 8.5% |
| b | 2848762 | 8.5% |
| _ | 2077844 | 6.2% |
| 9 | 1905359 | 5.7% |
| 6 | 1167599 | 3.5% |
| P | 1038922 | 3.1% |
| Other values (4) | 1380312 | 4.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.7 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 8.610769548 |
| Min length | 8 |
Characters and Unicode
| Total characters | 33475951 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | a55475b1 |
|---|---|
| 2nd row | a55475b1 |
| 3rd row | a55475b1 |
| 4th row | a55475b1 |
| 5th row | a55475b1 |
| Value | Count | Frequency (%) |
| a55475b1 | 3062831 | |
| p94_109_143 | 762661 | 19.6% |
| p30_86_84 | 29604 | 0.8% |
| p52_67_90 | 12511 | 0.3% |
| p69_72_116 | 7443 | 0.2% |
| p129_162_80 | 7216 | 0.2% |
| p84_14_61 | 2946 | 0.1% |
| p64_121_167 | 897 | < 0.1% |
| p5_143_178 | 635 | < 0.1% |
| p19_25_34 | 622 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 9202897 | |
| 1 | 4628582 | |
| 4 | 4625803 | |
| 7 | 3084317 | 9.2% |
| a | 3062831 | 9.1% |
| b | 3062831 | 9.1% |
| _ | 1649706 | 4.9% |
| 9 | 1553114 | 4.6% |
| P | 824853 | 2.5% |
| 0 | 812310 | 2.4% |
| Other values (4) | 968707 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 24875730 | |
| Lowercase Letter | 6125662 | 18.3% |
| Connector Punctuation | 1649706 | 4.9% |
| Uppercase Letter | 824853 | 2.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 9202897 | |
| 1 | 4628582 | |
| 4 | 4625803 | |
| 7 | 3084317 | 12.4% |
| 9 | 1553114 | 6.2% |
| 0 | 812310 | 3.3% |
| 3 | 793840 | 3.2% |
| 8 | 70005 | 0.3% |
| 6 | 68957 | 0.3% |
| 2 | 35905 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3062831 | |
| b | 3062831 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1649706 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 824853 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26525436 | |
| Latin | 6950515 | 20.8% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 9202897 | |
| 1 | 4628582 | |
| 4 | 4625803 | |
| 7 | 3084317 | 11.6% |
| _ | 1649706 | 6.2% |
| 9 | 1553114 | 5.9% |
| 0 | 812310 | 3.1% |
| 3 | 793840 | 3.0% |
| 8 | 70005 | 0.3% |
| 6 | 68957 | 0.3% |
Latin
| Value | Count | Frequency (%) |
| a | 3062831 | |
| b | 3062831 | |
| P | 824853 | 11.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33475951 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 9202897 | |
| 1 | 4628582 | |
| 4 | 4625803 | |
| 7 | 3084317 | 9.2% |
| a | 3062831 | 9.1% |
| b | 3062831 | 9.1% |
| _ | 1649706 | 4.9% |
| 9 | 1553114 | 4.6% |
| P | 824853 | 2.5% |
| 0 | 812310 | 2.4% |
| Other values (4) | 968707 | 2.9% |
revolvingaccount_394A
Real number (ℝ)
MISSING 
| Distinct | 60659 |
|---|---|
| Distinct (%) | 38.7% |
| Missing | 3731033 |
| Missing (%) | 96.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 740354832.4 |
| Minimum | 540342400 |
|---|---|
| Maximum | 780865400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 540342400 |
|---|---|
| 5-th percentile | 561029650 |
| Q1 | 740688450 |
| median | 760248700 |
| Q3 | 760724950 |
| 95-th percentile | 780564600 |
| Maximum | 780865400 |
| Range | 240523000 |
| Interquartile range (IQR) | 20036500 |
Descriptive statistics
| Standard deviation | 54986858.03 |
|---|---|
| Coefficient of variation (CV) | 0.07427095174 |
| Kurtosis | 5.600146409 |
| Mean | 740354832.4 |
| Median Absolute Deviation (MAD) | 19415740 |
| Skewness | -2.485268947 |
| Sum | 1.159773249 × 1014 |
| Variance | 3.023554556 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 760540100 | 35 | < 0.1% |
| 742529500 | 31 | < 0.1% |
| 760482240 | 30 | < 0.1% |
| 760434100 | 27 | < 0.1% |
| 760635840 | 26 | < 0.1% |
| 760467400 | 25 | < 0.1% |
| 760470700 | 25 | < 0.1% |
| 760540350 | 24 | < 0.1% |
| 760558800 | 24 | < 0.1% |
| 760447170 | 23 | < 0.1% |
| Other values (60649) | 156381 | 4.0% |
| (Missing) | 3731033 |
| Value | Count | Frequency (%) |
| 540342400 | 3 | |
| 540342460 | 7 | |
| 540342500 | 5 | |
| 540342600 | 4 | |
| 540342660 | 3 |
| Value | Count | Frequency (%) |
| 780865400 | 1 | |
| 780865200 | 1 | |
| 780864800 | 2 | |
| 780864700 | 2 | |
| 780864260 | 1 |
status_219L
Text
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 35 |
| Missing (%) | < 0.1% |
| Memory size | 29.7 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3887649 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | D |
|---|---|
| 2nd row | D |
| 3rd row | D |
| 4th row | T |
| 5th row | T |
| Value | Count | Frequency (%) |
| k | 1605077 | |
| d | 1563834 | |
| a | 431299 | 11.1% |
| t | 263947 | 6.8% |
| n | 15668 | 0.4% |
| q | 4766 | 0.1% |
| s | 2265 | 0.1% |
| l | 470 | < 0.1% |
| h | 276 | < 0.1% |
| p | 39 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| K | 1605077 | |
| D | 1563834 | |
| A | 431299 | 11.1% |
| T | 263947 | 6.8% |
| N | 15668 | 0.4% |
| Q | 4766 | 0.1% |
| S | 2265 | 0.1% |
| L | 470 | < 0.1% |
| H | 276 | < 0.1% |
| P | 39 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3887649 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 1605077 | |
| D | 1563834 | |
| A | 431299 | 11.1% |
| T | 263947 | 6.8% |
| N | 15668 | 0.4% |
| Q | 4766 | 0.1% |
| S | 2265 | 0.1% |
| L | 470 | < 0.1% |
| H | 276 | < 0.1% |
| P | 39 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3887649 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| K | 1605077 | |
| D | 1563834 | |
| A | 431299 | 11.1% |
| T | 263947 | 6.8% |
| N | 15668 | 0.4% |
| Q | 4766 | 0.1% |
| S | 2265 | 0.1% |
| L | 470 | < 0.1% |
| H | 276 | < 0.1% |
| P | 39 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3887649 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| K | 1605077 | |
| D | 1563834 | |
| A | 431299 | 11.1% |
| T | 263947 | 6.8% |
| N | 15668 | 0.4% |
| Q | 4766 | 0.1% |
| S | 2265 | 0.1% |
| L | 470 | < 0.1% |
| H | 276 | < 0.1% |
| P | 39 | < 0.1% |
tenor_203L
Real number (ℝ)
MISSING 
| Distinct | 57 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 312833 |
| Missing (%) | 8.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.78210253 |
| Minimum | 3 |
|---|---|
| Maximum | 62 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.7 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 12 |
| Q3 | 24 |
| 95-th percentile | 36 |
| Maximum | 62 |
| Range | 59 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 10.46206069 |
|---|---|
| Coefficient of variation (CV) | 0.662906648 |
| Kurtosis | 1.294429223 |
| Mean | 15.78210253 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.219734053 |
| Sum | 56418665 |
| Variance | 109.4547138 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 883553 | |
| 6 | 589676 | |
| 24 | 562114 | |
| 18 | 296861 | 7.6% |
| 36 | 190785 | 4.9% |
| 3 | 188014 | 4.8% |
| 16 | 133452 | 3.4% |
| 48 | 102226 | 2.6% |
| 9 | 78979 | 2.0% |
| 10 | 75231 | 1.9% |
| Other values (47) | 473960 | |
| (Missing) | 312833 | 8.0% |
| Value | Count | Frequency (%) |
| 3 | 188014 | 4.8% |
| 4 | 73219 | 1.9% |
| 5 | 43429 | 1.1% |
| 6 | 589676 | |
| 7 | 7541 | 0.2% |
| Value | Count | Frequency (%) |
| 62 | 1 | < 0.1% |
| 61 | 3 | < 0.1% |
| 60 | 3148 | |
| 58 | 46 | < 0.1% |
| 56 | 23 | < 0.1% |