Dataset statistics
Number of variables | 41 |
---|---|
Number of observations | 3887684 |
Missing cells | 49311059 |
Missing cells (%) | 30.9% |
Total size in memory | 1.2 GiB |
Average record size in memory | 328.0 B |
Variable types
Numeric | 20 |
---|---|
Text | 19 |
Boolean | 2 |
isbidproduct_390L is highly imbalanced (68.7%) | Imbalance |
annuity_853A has 155851 (4.0%) missing values | Missing |
approvaldate_319D has 1766021 (45.4%) missing values | Missing |
byoccupationinc_3656910L has 2896024 (74.5%) missing values | Missing |
childnum_21L has 1953893 (50.3%) missing values | Missing |
credacc_actualbalance_314A has 3719506 (95.7%) missing values | Missing |
credacc_credlmt_575A has 119070 (3.1%) missing values | Missing |
credacc_maxhisbal_375A has 3719506 (95.7%) missing values | Missing |
credacc_minhisbal_90A has 3719506 (95.7%) missing values | Missing |
credacc_status_367L has 3719506 (95.7%) missing values | Missing |
credacc_transactions_402L has 3719506 (95.7%) missing values | Missing |
credamount_590A has 123329 (3.2%) missing values | Missing |
credtype_587L has 123329 (3.2%) missing values | Missing |
currdebt_94A has 1270377 (32.7%) missing values | Missing |
dateactivated_425D has 1844702 (47.4%) missing values | Missing |
downpmt_134A has 123329 (3.2%) missing values | Missing |
dtlastpmt_581D has 2860375 (73.6%) missing values | Missing |
dtlastpmtallstes_3545839D has 2434155 (62.6%) missing values | Missing |
employedfrom_700D has 2180869 (56.1%) missing values | Missing |
familystate_726L has 1245201 (32.0%) missing values | Missing |
firstnonzeroinstldate_307D has 365175 (9.4%) missing values | Missing |
inittransactioncode_279L has 123329 (3.2%) missing values | Missing |
isdebitcard_527L has 3637550 (93.6%) missing values | Missing |
maxdpdtolerance_577P has 1817378 (46.7%) missing values | Missing |
outstandingdebt_522A has 1277922 (32.9%) missing values | Missing |
pmtnum_8L has 312833 (8.0%) missing values | Missing |
revolvingaccount_394A has 3731033 (96.0%) missing values | Missing |
tenor_203L has 312833 (8.0%) missing values | Missing |
actualdpd_943P is highly skewed (γ1 = 716.1410421) | Skewed |
credacc_maxhisbal_375A is highly skewed (γ1 = 154.6093224) | Skewed |
actualdpd_943P has 3882797 (99.9%) zeros | Zeros |
annuity_853A has 225443 (5.8%) zeros | Zeros |
byoccupationinc_3656910L has 63137 (1.6%) zeros | Zeros |
childnum_21L has 1054010 (27.1%) zeros | Zeros |
credacc_credlmt_575A has 3470494 (89.3%) zeros | Zeros |
credacc_maxhisbal_375A has 98392 (2.5%) zeros | Zeros |
credacc_minhisbal_90A has 100911 (2.6%) zeros | Zeros |
credacc_transactions_402L has 150233 (3.9%) zeros | Zeros |
credamount_590A has 42592 (1.1%) zeros | Zeros |
currdebt_94A has 2238778 (57.6%) zeros | Zeros |
downpmt_134A has 3381858 (87.0%) zeros | Zeros |
maxdpdtolerance_577P has 1527003 (39.3%) zeros | Zeros |
num_group1 has 782997 (20.1%) zeros | Zeros |
outstandingdebt_522A has 2229538 (57.3%) zeros | Zeros |
Reproduction
Analysis started | 2024-02-13 19:36:08.396202 |
---|---|
Analysis finished | 2024-02-13 19:36:34.918128 |
Duration | 26.52 seconds |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
Distinct | 782997 |
---|---|
Distinct (%) | 20.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1397916.185 |
Minimum | 2 |
---|---|
Maximum | 2651092 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 123162 |
Q1 | 1251506 |
median | 1451959 |
Q3 | 1641585 |
95-th percentile | 2617890 |
Maximum | 2651092 |
Range | 2651090 |
Interquartile range (IQR) | 390079 |
Descriptive statistics
Standard deviation | 760159.4198 |
---|---|
Coefficient of variation (CV) | 0.5437803984 |
Kurtosis | -0.5110858461 |
Mean | 1397916.185 |
Median Absolute Deviation (MAD) | 194660 |
Skewness | -0.1345953203 |
Sum | 5.434656384 × 1012 |
Variance | 5.778423435 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1451156 | 20 | < 0.1% |
149924 | 20 | < 0.1% |
1617749 | 20 | < 0.1% |
149841 | 20 | < 0.1% |
149843 | 20 | < 0.1% |
2538422 | 20 | < 0.1% |
177936 | 20 | < 0.1% |
2538424 | 20 | < 0.1% |
111866 | 20 | < 0.1% |
2588419 | 20 | < 0.1% |
Other values (782987) | 3887484 |
Value | Count | Frequency (%) |
2 | 2 | |
3 | 1 | < 0.1% |
4 | 1 | < 0.1% |
5 | 1 | < 0.1% |
6 | 3 |
Value | Count | Frequency (%) |
2651092 | 8 | |
2651091 | 3 | < 0.1% |
2651090 | 2 | < 0.1% |
2651089 | 12 | |
2651088 | 13 |
actualdpd_943P
Real number (ℝ)
SKEWED
  ZEROS
 
Distinct | 101 |
---|---|
Distinct (%) | < 0.1% |
Missing | 2234 |
Missing (%) | 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.01056299785 |
Minimum | 0 |
---|---|
Maximum | 3676 |
Zeros | 3882797 |
Zeros (%) | 99.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 3676 |
Range | 3676 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 3.75427732 |
---|---|
Coefficient of variation (CV) | 355.417787 |
Kurtosis | 586099.5401 |
Mean | 0.01056299785 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 716.1410421 |
Sum | 41042 |
Variance | 14.0945982 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3882797 | |
1 | 870 | < 0.1% |
2 | 530 | < 0.1% |
3 | 338 | < 0.1% |
4 | 174 | < 0.1% |
5 | 118 | < 0.1% |
6 | 85 | < 0.1% |
7 | 64 | < 0.1% |
8 | 58 | < 0.1% |
9 | 46 | < 0.1% |
Other values (91) | 370 | < 0.1% |
(Missing) | 2234 | 0.1% |
Value | Count | Frequency (%) |
0 | 3882797 | |
1 | 870 | < 0.1% |
2 | 530 | < 0.1% |
3 | 338 | < 0.1% |
4 | 174 | < 0.1% |
Value | Count | Frequency (%) |
3676 | 1 | |
3661 | 1 | |
2119 | 1 | |
2107 | 1 | |
1957 | 1 |
annuity_853A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 80309 |
---|---|
Distinct (%) | 2.2% |
Missing | 155851 |
Missing (%) | 4.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3413.166466 |
Minimum | 0 |
---|---|
Maximum | 105130.2 |
Zeros | 225443 |
Zeros (%) | 5.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1710.6 |
median | 2761.2 |
Q3 | 4396.4 |
95-th percentile | 8325.2 |
Maximum | 105130.2 |
Range | 105130.2 |
Interquartile range (IQR) | 2685.8 |
Descriptive statistics
Standard deviation | 2828.269 |
---|---|
Coefficient of variation (CV) | 0.8286349432 |
Kurtosis | 32.4088215 |
Mean | 3413.166466 |
Median Absolute Deviation (MAD) | 1247.4 |
Skewness | 3.37853393 |
Sum | 1.273736725 × 1010 |
Variance | 7999105.539 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 225443 | 5.8% |
1580 | 3773 | 0.1% |
1508 | 3156 | 0.1% |
2716 | 2434 | 0.1% |
2558.4001 | 1736 | < 0.1% |
2103 | 1685 | < 0.1% |
1668 | 1676 | < 0.1% |
2000 | 1666 | < 0.1% |
3837.4001 | 1625 | < 0.1% |
3840 | 1609 | < 0.1% |
Other values (80299) | 3487030 | |
(Missing) | 155851 | 4.0% |
Value | Count | Frequency (%) |
0 | 225443 | |
1.8000001 | 1 | < 0.1% |
7.6 | 2 | < 0.1% |
8.2 | 2 | < 0.1% |
10.400001 | 1 | < 0.1% |
Value | Count | Frequency (%) |
105130.2 | 1 | |
103000 | 1 | |
100061.4 | 1 | |
99837.4 | 2 | |
99646.6 | 1 |
MISSING
 
Distinct | 5105 |
---|---|
Distinct (%) | 0.2% |
Missing | 1766021 |
Missing (%) | 45.4% |
Memory size | 29.7 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 21216630 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 3 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2019-01-11 |
---|---|
2nd row | 2018-10-11 |
3rd row | 2018-12-31 |
4th row | 2018-11-02 |
5th row | 2018-12-11 |
Value | Count | Frequency (%) |
2018-12-07 | 2982 | 0.1% |
2018-01-12 | 2867 | 0.1% |
2018-01-13 | 2720 | 0.1% |
2018-12-08 | 2567 | 0.1% |
2019-01-13 | 2387 | 0.1% |
2018-12-29 | 2273 | 0.1% |
2018-07-27 | 2215 | 0.1% |
2018-12-28 | 2193 | 0.1% |
2018-11-24 | 2161 | 0.1% |
2017-12-02 | 2126 | 0.1% |
Other values (5095) | 2097172 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 4872977 | |
- | 4243326 | |
1 | 3932682 | |
2 | 3489665 | |
8 | 964186 | 4.5% |
7 | 774209 | 3.6% |
9 | 687236 | 3.2% |
3 | 618144 | 2.9% |
6 | 606097 | 2.9% |
5 | 522667 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16973304 | |
Dash Punctuation | 4243326 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 4872977 | |
1 | 3932682 | |
2 | 3489665 | |
8 | 964186 | 5.7% |
7 | 774209 | 4.6% |
9 | 687236 | 4.0% |
3 | 618144 | 3.6% |
6 | 606097 | 3.6% |
5 | 522667 | 3.1% |
4 | 505441 | 3.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4243326 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 21216630 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 4872977 | |
- | 4243326 | |
1 | 3932682 | |
2 | 3489665 | |
8 | 964186 | 4.5% |
7 | 774209 | 3.6% |
9 | 687236 | 3.2% |
3 | 618144 | 2.9% |
6 | 606097 | 2.9% |
5 | 522667 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 21216630 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 4872977 | |
- | 4243326 | |
1 | 3932682 | |
2 | 3489665 | |
8 | 964186 | 4.5% |
7 | 774209 | 3.6% |
9 | 687236 | 3.2% |
3 | 618144 | 2.9% |
6 | 606097 | 2.9% |
5 | 522667 | 2.5% |
byoccupationinc_3656910L
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 24298 |
---|---|
Distinct (%) | 2.5% |
Missing | 2896024 |
Missing (%) | 74.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19796.48403 |
Minimum | 0 |
---|---|
Maximum | 200000 |
Zeros | 63137 |
Zeros (%) | 1.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 5000 |
Q3 | 30000 |
95-th percentile | 71316.25 |
Maximum | 200000 |
Range | 200000 |
Interquartile range (IQR) | 29999 |
Descriptive statistics
Standard deviation | 30687.65251 |
---|---|
Coefficient of variation (CV) | 1.550156708 |
Kurtosis | 10.76351177 |
Mean | 19796.48403 |
Median Absolute Deviation (MAD) | 5000 |
Skewness | 2.799320872 |
Sum | 1.963138135 × 1010 |
Variance | 941732016.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 424404 | 10.9% |
0 | 63137 | 1.6% |
15000 | 50750 | 1.3% |
20000 | 42942 | 1.1% |
30000 | 38439 | 1.0% |
25000 | 34911 | 0.9% |
50000 | 33424 | 0.9% |
10000 | 23527 | 0.6% |
40000 | 19665 | 0.5% |
35000 | 19252 | 0.5% |
Other values (24288) | 241209 | 6.2% |
(Missing) | 2896024 |
Value | Count | Frequency (%) |
0 | 63137 | 1.6% |
1 | 424404 | |
2 | 9 | < 0.1% |
3 | 1 | < 0.1% |
4 | 3 | < 0.1% |
Value | Count | Frequency (%) |
200000 | 7073 | |
199000 | 12 | < 0.1% |
198600 | 3 | < 0.1% |
198000 | 15 | < 0.1% |
197000 | 9 | < 0.1% |
Distinct | 71 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 29.7 MiB |
Length
Max length | 12 |
---|---|
Median length | 8 |
Mean length | 8.84928379 |
Min length | 8 |
Characters and Unicode
Total characters | 34403219 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 4 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | a55475b1 |
---|---|
2nd row | a55475b1 |
3rd row | P94_109_143 |
4th row | P24_27_36 |
5th row | P85_114_140 |
Value | Count | Frequency (%) |
a55475b1 | 2729942 | |
p94_109_143 | 848995 | 21.8% |
p180_60_137 | 43051 | 1.1% |
p73_130_169 | 42479 | 1.1% |
p198_89_166 | 37300 | 1.0% |
p85_114_140 | 34868 | 0.9% |
p30_86_84 | 31802 | 0.8% |
p24_27_36 | 16040 | 0.4% |
p141_135_146 | 15512 | 0.4% |
p52_67_90 | 13724 | 0.4% |
Other values (61) | 73971 | 1.9% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 8288778 | |
1 | 5002986 | |
4 | 4590459 | |
7 | 2870275 | 8.3% |
a | 2729942 | 7.9% |
b | 2729942 | 7.9% |
_ | 2315484 | 6.7% |
9 | 1874613 | 5.4% |
P | 1157742 | 3.4% |
0 | 1109335 | 3.2% |
Other values (4) | 1733663 | 5.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 25470109 | |
Lowercase Letter | 5459884 | 15.9% |
Connector Punctuation | 2315484 | 6.7% |
Uppercase Letter | 1157742 | 3.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 8288778 | |
1 | 5002986 | |
4 | 4590459 | |
7 | 2870275 | 11.3% |
9 | 1874613 | 7.4% |
0 | 1109335 | 4.4% |
3 | 1084801 | 4.3% |
6 | 302589 | 1.2% |
8 | 252522 | 1.0% |
2 | 93751 | 0.4% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 2729942 | |
b | 2729942 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2315484 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 1157742 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 27785593 | |
Latin | 6617626 | 19.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 8288778 | |
1 | 5002986 | |
4 | 4590459 | |
7 | 2870275 | 10.3% |
_ | 2315484 | 8.3% |
9 | 1874613 | 6.7% |
0 | 1109335 | 4.0% |
3 | 1084801 | 3.9% |
6 | 302589 | 1.1% |
8 | 252522 | 0.9% |
Latin
Value | Count | Frequency (%) |
a | 2729942 | |
b | 2729942 | |
P | 1157742 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 34403219 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 8288778 | |
1 | 5002986 | |
4 | 4590459 | |
7 | 2870275 | 8.3% |
a | 2729942 | 7.9% |
b | 2729942 | 7.9% |
_ | 2315484 | 6.7% |
9 | 1874613 | 5.4% |
P | 1157742 | 3.4% |
0 | 1109335 | 3.2% |
Other values (4) | 1733663 | 5.0% |
childnum_21L
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 20 |
---|---|
Distinct (%) | < 0.1% |
Missing | 1953893 |
Missing (%) | 50.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.8434701578 |
Minimum | 0 |
---|---|
Maximum | 20 |
Zeros | 1054010 |
Zeros (%) | 27.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 3 |
Maximum | 20 |
Range | 20 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.209425285 |
---|---|
Coefficient of variation (CV) | 1.433868494 |
Kurtosis | 6.155115887 |
Mean | 0.8434701578 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.989613502 |
Sum | 1631095 |
Variance | 1.46270952 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1054010 | |
1 | 430613 | 11.1% |
2 | 277280 | 7.1% |
3 | 99088 | 2.5% |
4 | 39691 | 1.0% |
5 | 19363 | 0.5% |
6 | 8144 | 0.2% |
7 | 3028 | 0.1% |
8 | 1401 | < 0.1% |
9 | 539 | < 0.1% |
Other values (10) | 634 | < 0.1% |
(Missing) | 1953893 |
Value | Count | Frequency (%) |
0 | 1054010 | |
1 | 430613 | |
2 | 277280 | 7.1% |
3 | 99088 | 2.5% |
4 | 39691 | 1.0% |
Value | Count | Frequency (%) |
20 | 12 | |
18 | 5 | < 0.1% |
17 | 4 | < 0.1% |
16 | 2 | < 0.1% |
15 | 18 |
Distinct | 5107 |
---|---|
Distinct (%) | 0.1% |
Missing | 35 |
Missing (%) | < 0.1% |
Memory size | 29.7 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 38876490 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 2 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2013-04-03 |
---|---|
2nd row | 2013-04-03 |
3rd row | 2019-01-07 |
4th row | 2019-01-08 |
5th row | 2019-01-16 |
Value | Count | Frequency (%) |
2018-12-07 | 4961 | 0.1% |
2018-01-12 | 4168 | 0.1% |
2018-12-08 | 3986 | 0.1% |
2018-12-28 | 3940 | 0.1% |
2019-01-02 | 3859 | 0.1% |
2018-01-13 | 3799 | 0.1% |
2019-01-04 | 3796 | 0.1% |
2018-07-27 | 3692 | 0.1% |
2019-01-13 | 3646 | 0.1% |
2019-01-11 | 3618 | 0.1% |
Other values (5097) | 3848184 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8914984 | |
- | 7775298 | |
1 | 7156075 | |
2 | 6398126 | |
8 | 1706927 | 4.4% |
7 | 1362854 | 3.5% |
9 | 1346334 | 3.5% |
3 | 1148470 | 3.0% |
6 | 1091746 | 2.8% |
4 | 994262 | 2.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 31101192 | |
Dash Punctuation | 7775298 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 8914984 | |
1 | 7156075 | |
2 | 6398126 | |
8 | 1706927 | 5.5% |
7 | 1362854 | 4.4% |
9 | 1346334 | 4.3% |
3 | 1148470 | 3.7% |
6 | 1091746 | 3.5% |
4 | 994262 | 3.2% |
5 | 981414 | 3.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 7775298 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 38876490 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 8914984 | |
- | 7775298 | |
1 | 7156075 | |
2 | 6398126 | |
8 | 1706927 | 4.4% |
7 | 1362854 | 3.5% |
9 | 1346334 | 3.5% |
3 | 1148470 | 3.0% |
6 | 1091746 | 2.8% |
4 | 994262 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 38876490 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8914984 | |
- | 7775298 | |
1 | 7156075 | |
2 | 6398126 | |
8 | 1706927 | 4.4% |
7 | 1362854 | 3.5% |
9 | 1346334 | 3.5% |
3 | 1148470 | 3.0% |
6 | 1091746 | 2.8% |
4 | 994262 | 2.6% |
credacc_actualbalance_314A
Real number (ℝ)
MISSING
 
Distinct | 57143 |
---|---|
Distinct (%) | 34.0% |
Missing | 3719506 |
Missing (%) | 95.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20269.5803 |
Minimum | -114086 |
---|---|
Maximum | 2540730 |
Zeros | 37132 |
Zeros (%) | 1.0% |
Negative | 365 |
Negative (%) | < 0.1% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | -114086 |
---|---|
5-th percentile | 0 |
Q1 | 3.0305 |
median | 12585 |
Q3 | 30446 |
95-th percentile | 76163.788 |
Maximum | 2540730 |
Range | 2654816 |
Interquartile range (IQR) | 30442.9695 |
Descriptive statistics
Standard deviation | 26002.78165 |
---|---|
Coefficient of variation (CV) | 1.282847561 |
Kurtosis | 528.4045432 |
Mean | 20269.5803 |
Median Absolute Deviation (MAD) | 12585 |
Skewness | 6.9851974 |
Sum | 3408897475 |
Variance | 676144653.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 37132 | 1.0% |
100000 | 2944 | 0.1% |
2 | 896 | < 0.1% |
12000 | 808 | < 0.1% |
42640 | 748 | < 0.1% |
0.2 | 735 | < 0.1% |
30000 | 642 | < 0.1% |
20300 | 539 | < 0.1% |
4 | 520 | < 0.1% |
40600 | 499 | < 0.1% |
Other values (57133) | 122715 | 3.2% |
(Missing) | 3719506 |
Value | Count | Frequency (%) |
-114086 | 1 | |
-57634.06 | 1 | |
-52432.37 | 1 | |
-47822.348 | 1 | |
-36996.402 | 1 |
Value | Count | Frequency (%) |
2540730 | 1 | |
519966 | 1 | |
428026.25 | 1 | |
300000 | 1 | |
264004.8 | 2 |
credacc_credlmt_575A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 37849 |
---|---|
Distinct (%) | 1.0% |
Missing | 119070 |
Missing (%) | 3.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3254.676597 |
Minimum | 0 |
---|---|
Maximum | 400000 |
Zeros | 3470494 |
Zeros (%) | 89.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 23294 |
Maximum | 400000 |
Range | 400000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 14061.34948 |
---|---|
Coefficient of variation (CV) | 4.320352288 |
Kurtosis | 36.98869737 |
Mean | 3254.676597 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.584726264 |
Sum | 1.226561979 × 1010 |
Variance | 197721549.3 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3470494 | |
100000 | 32568 | 0.8% |
12000 | 13694 | 0.4% |
20000 | 6333 | 0.2% |
40000 | 6175 | 0.2% |
60000 | 4714 | 0.1% |
24000 | 3837 | 0.1% |
30000 | 3538 | 0.1% |
10000 | 3177 | 0.1% |
42640 | 2828 | 0.1% |
Other values (37839) | 221256 | 5.7% |
(Missing) | 119070 | 3.1% |
Value | Count | Frequency (%) |
0 | 3470494 | |
0.2 | 1275 | < 0.1% |
0.6 | 2 | < 0.1% |
3.6000001 | 2 | < 0.1% |
20 | 2 | < 0.1% |
Value | Count | Frequency (%) |
400000 | 14 | |
300000 | 14 | |
270400 | 1 | < 0.1% |
245200 | 1 | < 0.1% |
240000 | 1 | < 0.1% |
credacc_maxhisbal_375A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 43046 |
---|---|
Distinct (%) | 25.6% |
Missing | 3719506 |
Missing (%) | 95.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -3288.887246 |
Minimum | -196108.17 |
---|---|
Maximum | 7988198.5 |
Zeros | 98392 |
Zeros (%) | 2.5% |
Negative | 32587 |
Negative (%) | 0.8% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | -196108.17 |
---|---|
5-th percentile | -29255.55 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 588.9267 |
Maximum | 7988198.5 |
Range | 8184306.67 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 28086.11335 |
---|---|
Coefficient of variation (CV) | -8.539700891 |
Kurtosis | 40843.88705 |
Mean | -3288.887246 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 154.6093224 |
Sum | -553118479.3 |
Variance | 788829762.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 98392 | 2.5% |
2 | 2042 | 0.1% |
4 | 1072 | < 0.1% |
6 | 560 | < 0.1% |
22 | 500 | < 0.1% |
10 | 343 | < 0.1% |
24 | 298 | < 0.1% |
8 | 285 | < 0.1% |
20 | 238 | < 0.1% |
42 | 213 | < 0.1% |
Other values (43036) | 64235 | 1.7% |
(Missing) | 3719506 |
Value | Count | Frequency (%) |
-196108.17 | 2 | |
-192894.4 | 1 | |
-185260 | 1 | |
-183642.02 | 1 | |
-181545.2 | 1 |
Value | Count | Frequency (%) |
7988198.5 | 1 | |
3556000 | 1 | |
1900000 | 2 | |
1600000 | 1 | |
940000 | 1 |
credacc_minhisbal_90A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 42878 |
---|---|
Distinct (%) | 25.5% |
Missing | 3719506 |
Missing (%) | 95.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -6554.784203 |
Minimum | -206808.17 |
---|---|
Maximum | 199567 |
Zeros | 100911 |
Zeros (%) | 2.6% |
Negative | 43646 |
Negative (%) | 1.1% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | -206808.17 |
---|---|
5-th percentile | -40000 |
Q1 | -1042.079025 |
median | 0 |
Q3 | 0 |
95-th percentile | 55.8066 |
Maximum | 199567 |
Range | 406375.17 |
Interquartile range (IQR) | 1042.079025 |
Descriptive statistics
Standard deviation | 16888.86244 |
---|---|
Coefficient of variation (CV) | -2.576570321 |
Kurtosis | 17.9638962 |
Mean | -6554.784203 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -3.715506656 |
Sum | -1102370498 |
Variance | 285233674.5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 100911 | 2.6% |
2 | 1383 | < 0.1% |
4 | 778 | < 0.1% |
6 | 427 | < 0.1% |
10 | 315 | < 0.1% |
22 | 309 | < 0.1% |
-10 | 242 | < 0.1% |
24 | 216 | < 0.1% |
8 | 210 | < 0.1% |
20 | 208 | < 0.1% |
Other values (42868) | 63179 | 1.6% |
(Missing) | 3719506 |
Value | Count | Frequency (%) |
-206808.17 | 2 | |
-200000 | 1 | |
-199998 | 1 | |
-199996 | 1 | |
-199994.4 | 1 |
Value | Count | Frequency (%) |
199567 | 2 | |
100000 | 1 | |
89859.59 | 1 | |
79000 | 1 | |
70000 | 1 |
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 3719506 |
Missing (%) | 95.7% |
Memory size | 29.7 MiB |
Value | Count | Frequency (%) |
ac | 90216 | |
cl | 61855 | |
ca | 14052 | 8.4% |
pcl | 1726 | 1.0% |
po | 282 | 0.2% |
cr | 47 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
C | 167896 | |
A | 104268 | |
L | 63581 | 18.8% |
P | 2008 | 0.6% |
O | 282 | 0.1% |
R | 47 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 338082 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 167896 | |
A | 104268 | |
L | 63581 | 18.8% |
P | 2008 | 0.6% |
O | 282 | 0.1% |
R | 47 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 338082 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
C | 167896 | |
A | 104268 | |
L | 63581 | 18.8% |
P | 2008 | 0.6% |
O | 282 | 0.1% |
R | 47 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 338082 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 167896 | |
A | 104268 | |
L | 63581 | 18.8% |
P | 2008 | 0.6% |
O | 282 | 0.1% |
R | 47 | < 0.1% |
credacc_transactions_402L
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 86 |
---|---|
Distinct (%) | 0.1% |
Missing | 3719506 |
Missing (%) | 95.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.5221253672 |
Minimum | 0 |
---|---|
Maximum | 155 |
Zeros | 150233 |
Zeros (%) | 3.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2 |
Maximum | 155 |
Range | 155 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.951672329 |
---|---|
Coefficient of variation (CV) | 5.653186982 |
Kurtosis | 344.9259013 |
Mean | 0.5221253672 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 14.1469766 |
Sum | 87810 |
Variance | 8.712369536 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 150233 | 3.9% |
1 | 6693 | 0.2% |
2 | 2853 | 0.1% |
3 | 1893 | < 0.1% |
4 | 1252 | < 0.1% |
5 | 931 | < 0.1% |
6 | 730 | < 0.1% |
7 | 552 | < 0.1% |
8 | 411 | < 0.1% |
9 | 387 | < 0.1% |
Other values (76) | 2243 | 0.1% |
(Missing) | 3719506 |
Value | Count | Frequency (%) |
0 | 150233 | |
1 | 6693 | 0.2% |
2 | 2853 | 0.1% |
3 | 1893 | < 0.1% |
4 | 1252 | < 0.1% |
Value | Count | Frequency (%) |
155 | 2 | |
135 | 1 | |
126 | 1 | |
123 | 1 | |
110 | 1 |
credamount_590A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 201793 |
---|---|
Distinct (%) | 5.4% |
Missing | 123329 |
Missing (%) | 3.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38657.85509 |
Minimum | 0 |
---|---|
Maximum | 715392 |
Zeros | 42592 |
Zeros (%) | 1.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 5022 |
Q1 | 13998 |
median | 27000 |
Q3 | 50000 |
95-th percentile | 100000 |
Maximum | 715392 |
Range | 715392 |
Interquartile range (IQR) | 36002 |
Descriptive statistics
Standard deviation | 37544.33619 |
---|---|
Coefficient of variation (CV) | 0.9711955334 |
Kurtosis | 10.0483708 |
Mean | 38657.85509 |
Median Absolute Deviation (MAD) | 15020 |
Skewness | 2.478930874 |
Sum | 1.455218901 × 1011 |
Variance | 1409577180 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
60000 | 186103 | 4.8% |
100000 | 175966 | 4.5% |
40000 | 174743 | 4.5% |
20000 | 170088 | 4.4% |
30000 | 140432 | 3.6% |
50000 | 63461 | 1.6% |
10000 | 52190 | 1.3% |
24000 | 47919 | 1.2% |
80000 | 43464 | 1.1% |
0 | 42592 | 1.1% |
Other values (201783) | 2667397 | |
(Missing) | 123329 | 3.2% |
Value | Count | Frequency (%) |
0 | 42592 | |
0.2 | 675 | < 0.1% |
0.6 | 1 | < 0.1% |
3.6000001 | 1 | < 0.1% |
20 | 1 | < 0.1% |
Value | Count | Frequency (%) |
715392 | 2 | |
550000 | 1 | |
501422.22 | 2 | |
493000 | 1 | |
480665.1 | 2 |
credtype_587L
Text
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 123329 |
Missing (%) | 3.2% |
Memory size | 29.7 MiB |
Value | Count | Frequency (%) |
col | 2035876 | |
cal | 1478343 | |
rel | 250136 | 6.6% |
Most occurring characters
Value | Count | Frequency (%) |
L | 3764355 | |
C | 3514219 | |
O | 2035876 | |
A | 1478343 | 13.1% |
R | 250136 | 2.2% |
E | 250136 | 2.2% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 11293065 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
L | 3764355 | |
C | 3514219 | |
O | 2035876 | |
A | 1478343 | 13.1% |
R | 250136 | 2.2% |
E | 250136 | 2.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 11293065 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
L | 3764355 | |
C | 3514219 | |
O | 2035876 | |
A | 1478343 | 13.1% |
R | 250136 | 2.2% |
E | 250136 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 11293065 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
L | 3764355 | |
C | 3514219 | |
O | 2035876 | |
A | 1478343 | 13.1% |
R | 250136 | 2.2% |
E | 250136 | 2.2% |
currdebt_94A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 346082 |
---|---|
Distinct (%) | 13.2% |
Missing | 1270377 |
Missing (%) | 32.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5229.100749 |
Minimum | 0 |
---|---|
Maximum | 507429.72 |
Zeros | 2238778 |
Zeros (%) | 57.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 35505.071 |
Maximum | 507429.72 |
Range | 507429.72 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 19278.42258 |
---|---|
Coefficient of variation (CV) | 3.686756769 |
Kurtosis | 47.65318341 |
Mean | 5229.100749 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.85585581 |
Sum | 1.368616199 × 1010 |
Variance | 371657577.3 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2238778 | |
19998 | 91 | < 0.1% |
100000 | 91 | < 0.1% |
20000 | 85 | < 0.1% |
11998 | 84 | < 0.1% |
17998 | 82 | < 0.1% |
7998 | 80 | < 0.1% |
40000 | 77 | < 0.1% |
15998 | 75 | < 0.1% |
13998 | 75 | < 0.1% |
Other values (346072) | 377789 | 9.7% |
(Missing) | 1270377 |
Value | Count | Frequency (%) |
0 | 2238778 | |
0.002 | 1 | < 0.1% |
0.010000001 | 1 | < 0.1% |
0.020000001 | 1 | < 0.1% |
0.048 | 2 | < 0.1% |
Value | Count | Frequency (%) |
507429.72 | 1 | |
507040.06 | 1 | |
491492.1 | 1 | |
490718.6 | 1 | |
489944.25 | 1 |
MISSING
 
Distinct | 3939 |
---|---|
Distinct (%) | 0.2% |
Missing | 1844702 |
Missing (%) | 47.4% |
Memory size | 29.7 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 20429820 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 53 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2018-10-19 |
---|---|
2nd row | 2018-11-07 |
3rd row | 2018-12-28 |
4th row | 2018-10-09 |
5th row | 2018-11-16 |
Value | Count | Frequency (%) |
2019-01-31 | 3811 | 0.2% |
2018-10-17 | 3053 | 0.1% |
2018-10-04 | 3022 | 0.1% |
2019-01-11 | 2915 | 0.1% |
2018-05-29 | 2894 | 0.1% |
2019-01-09 | 2885 | 0.1% |
2019-01-10 | 2842 | 0.1% |
2018-11-14 | 2817 | 0.1% |
2019-01-30 | 2805 | 0.1% |
2018-09-12 | 2780 | 0.1% |
Other values (3929) | 2013158 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 4702739 | |
- | 4085964 | |
1 | 3822098 | |
2 | 3324977 | |
8 | 938945 | 4.6% |
7 | 742863 | 3.6% |
9 | 664418 | 3.3% |
6 | 587810 | 2.9% |
3 | 569050 | 2.8% |
5 | 503414 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16343856 | |
Dash Punctuation | 4085964 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 4702739 | |
1 | 3822098 | |
2 | 3324977 | |
8 | 938945 | 5.7% |
7 | 742863 | 4.5% |
9 | 664418 | 4.1% |
6 | 587810 | 3.6% |
3 | 569050 | 3.5% |
5 | 503414 | 3.1% |
4 | 487542 | 3.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4085964 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 20429820 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 4702739 | |
- | 4085964 | |
1 | 3822098 | |
2 | 3324977 | |
8 | 938945 | 4.6% |
7 | 742863 | 3.6% |
9 | 664418 | 3.3% |
6 | 587810 | 2.9% |
3 | 569050 | 2.8% |
5 | 503414 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 20429820 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 4702739 | |
- | 4085964 | |
1 | 3822098 | |
2 | 3324977 | |
8 | 938945 | 4.6% |
7 | 742863 | 3.6% |
9 | 664418 | 3.3% |
6 | 587810 | 2.9% |
3 | 569050 | 2.8% |
5 | 503414 | 2.5% |
district_544M
Text
Distinct | 479 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 29.7 MiB |
Length
Max length | 12 |
---|---|
Median length | 11 |
Mean length | 10.43327107 |
Min length | 8 |
Characters and Unicode
Total characters | 40561261 |
---|---|
Distinct characters | 18 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 54 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | P136_108_173 |
---|---|
2nd row | P136_108_173 |
3rd row | P131_33_167 |
4th row | P194_82_174 |
5th row | P54_133_26 |
Value | Count | Frequency (%) |
a55475b1 | 388961 | 10.0% |
p131_33_167 | 190215 | 4.9% |
p123_6_84 | 160709 | 4.1% |
p197_47_166 | 137010 | 3.5% |
p204_99_158 | 114381 | 2.9% |
p98_137_111 | 94298 | 2.4% |
p62_144_102 | 86540 | 2.2% |
p159_143_123 | 85401 | 2.2% |
p147_21_170 | 80957 | 2.1% |
p178_112_160 | 71042 | 1.8% |
Other values (469) | 2478170 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 8872392 | |
_ | 6997446 | |
P | 3498722 | 8.6% |
7 | 3069372 | 7.6% |
5 | 3022413 | 7.5% |
4 | 2644775 | 6.5% |
6 | 2218257 | 5.5% |
3 | 2204920 | 5.4% |
2 | 2070115 | 5.1% |
9 | 1904803 | 4.7% |
Other values (8) | 4058046 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 29287166 | |
Connector Punctuation | 6997446 | 17.3% |
Uppercase Letter | 3498723 | 8.6% |
Lowercase Letter | 777926 | 1.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 8872392 | |
7 | 3069372 | 10.5% |
5 | 3022413 | 10.3% |
4 | 2644775 | 9.0% |
6 | 2218257 | 7.6% |
3 | 2204920 | 7.5% |
2 | 2070115 | 7.1% |
9 | 1904803 | 6.5% |
8 | 1893444 | 6.5% |
0 | 1386675 | 4.7% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 388961 | |
b | 388961 | |
e | 2 | < 0.1% |
m | 1 | < 0.1% |
t | 1 | < 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 3498722 | |
Q | 1 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 6997446 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 36284612 | |
Latin | 4276649 | 10.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 8872392 | |
_ | 6997446 | |
7 | 3069372 | 8.5% |
5 | 3022413 | 8.3% |
4 | 2644775 | 7.3% |
6 | 2218257 | 6.1% |
3 | 2204920 | 6.1% |
2 | 2070115 | 5.7% |
9 | 1904803 | 5.2% |
8 | 1893444 | 5.2% |
Latin
Value | Count | Frequency (%) |
P | 3498722 | |
a | 388961 | 9.1% |
b | 388961 | 9.1% |
e | 2 | < 0.1% |
Q | 1 | < 0.1% |
m | 1 | < 0.1% |
t | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 40561261 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 8872392 | |
_ | 6997446 | |
P | 3498722 | 8.6% |
7 | 3069372 | 7.6% |
5 | 3022413 | 7.5% |
4 | 2644775 | 6.5% |
6 | 2218257 | 5.5% |
3 | 2204920 | 5.4% |
2 | 2070115 | 5.1% |
9 | 1904803 | 4.7% |
Other values (8) | 4058046 |
downpmt_134A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 18720 |
---|---|
Distinct (%) | 0.5% |
Missing | 123329 |
Missing (%) | 3.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 457.8898882 |
Minimum | 0 |
---|---|
Maximum | 320400 |
Zeros | 3381858 |
Zeros (%) | 87.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2000 |
Maximum | 320400 |
Range | 320400 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2697.225764 |
---|---|
Coefficient of variation (CV) | 5.890555423 |
Kurtosis | 776.6842527 |
Mean | 457.8898882 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 17.93615961 |
Sum | 1723660090 |
Variance | 7275026.82 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3381858 | |
2000 | 44574 | 1.1% |
4000 | 29703 | 0.8% |
1000 | 28357 | 0.7% |
6000 | 16262 | 0.4% |
200 | 13639 | 0.4% |
10000 | 13472 | 0.3% |
400 | 12662 | 0.3% |
3000 | 11986 | 0.3% |
8000 | 8796 | 0.2% |
Other values (18710) | 203046 | 5.2% |
(Missing) | 123329 | 3.2% |
Value | Count | Frequency (%) |
0 | 3381858 | |
0.2 | 390 | < 0.1% |
0.4 | 48 | < 0.1% |
0.6 | 124 | < 0.1% |
0.8 | 33 | < 0.1% |
Value | Count | Frequency (%) |
320400 | 1 | < 0.1% |
305200 | 1 | < 0.1% |
300000 | 2 | |
274998 | 3 | |
268134.4 | 1 | < 0.1% |
dtlastpmt_581D
Text
MISSING
 
Distinct | 2150 |
---|---|
Distinct (%) | 0.2% |
Missing | 2860375 |
Missing (%) | 73.6% |
Memory size | 29.7 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 10273090 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 214 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2019-01-10 |
---|---|
2nd row | 2019-01-03 |
3rd row | 2019-01-08 |
4th row | 2018-12-05 |
5th row | 2018-12-26 |
Value | Count | Frequency (%) |
2019-09-16 | 22398 | 2.2% |
2018-12-24 | 2210 | 0.2% |
2019-01-11 | 2176 | 0.2% |
2018-07-23 | 2052 | 0.2% |
2018-12-20 | 2012 | 0.2% |
2019-01-02 | 1982 | 0.2% |
2019-02-25 | 1939 | 0.2% |
2018-05-24 | 1929 | 0.2% |
2019-01-22 | 1922 | 0.2% |
2018-12-25 | 1916 | 0.2% |
Other values (2140) | 986773 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2267061 | |
- | 2054618 | |
1 | 1871415 | |
2 | 1671458 | |
8 | 544478 | 5.3% |
9 | 486902 | 4.7% |
7 | 418414 | 4.1% |
6 | 361158 | 3.5% |
3 | 229774 | 2.2% |
5 | 196197 | 1.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 8218472 | |
Dash Punctuation | 2054618 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 2267061 | |
1 | 1871415 | |
2 | 1671458 | |
8 | 544478 | 6.6% |
9 | 486902 | 5.9% |
7 | 418414 | 5.1% |
6 | 361158 | 4.4% |
3 | 229774 | 2.8% |
5 | 196197 | 2.4% |
4 | 171615 | 2.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2054618 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 10273090 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 2267061 | |
- | 2054618 | |
1 | 1871415 | |
2 | 1671458 | |
8 | 544478 | 5.3% |
9 | 486902 | 4.7% |
7 | 418414 | 4.1% |
6 | 361158 | 3.5% |
3 | 229774 | 2.2% |
5 | 196197 | 1.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10273090 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2267061 | |
- | 2054618 | |
1 | 1871415 | |
2 | 1671458 | |
8 | 544478 | 5.3% |
9 | 486902 | 4.7% |
7 | 418414 | 4.1% |
6 | 361158 | 3.5% |
3 | 229774 | 2.2% |
5 | 196197 | 1.9% |
MISSING
 
Distinct | 2168 |
---|---|
Distinct (%) | 0.1% |
Missing | 2434155 |
Missing (%) | 62.6% |
Memory size | 29.7 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 14535290 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 224 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2019-01-10 |
---|---|
2nd row | 2018-12-29 |
3rd row | 2019-01-03 |
4th row | 2019-01-08 |
5th row | 2019-01-05 |
Value | Count | Frequency (%) |
2019-09-16 | 24576 | 1.7% |
2019-05-22 | 4054 | 0.3% |
2019-05-20 | 3990 | 0.3% |
2019-03-25 | 3786 | 0.3% |
2019-01-22 | 3759 | 0.3% |
2019-05-27 | 3685 | 0.3% |
2019-07-19 | 3601 | 0.2% |
2019-02-25 | 3591 | 0.2% |
2019-03-19 | 3582 | 0.2% |
2019-03-20 | 3565 | 0.2% |
Other values (2158) | 1395340 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3193320 | |
- | 2907058 | |
1 | 2662034 | |
2 | 2367467 | |
9 | 969878 | 6.7% |
8 | 655291 | 4.5% |
7 | 507252 | 3.5% |
6 | 443031 | 3.0% |
3 | 318089 | 2.2% |
5 | 272558 | 1.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 11628232 | |
Dash Punctuation | 2907058 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 3193320 | |
1 | 2662034 | |
2 | 2367467 | |
9 | 969878 | 8.3% |
8 | 655291 | 5.6% |
7 | 507252 | 4.4% |
6 | 443031 | 3.8% |
3 | 318089 | 2.7% |
5 | 272558 | 2.3% |
4 | 239312 | 2.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2907058 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 14535290 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 3193320 | |
- | 2907058 | |
1 | 2662034 | |
2 | 2367467 | |
9 | 969878 | 6.7% |
8 | 655291 | 4.5% |
7 | 507252 | 3.5% |
6 | 443031 | 3.0% |
3 | 318089 | 2.2% |
5 | 272558 | 1.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 14535290 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3193320 | |
- | 2907058 | |
1 | 2662034 | |
2 | 2367467 | |
9 | 969878 | 6.7% |
8 | 655291 | 4.5% |
7 | 507252 | 3.5% |
6 | 443031 | 3.0% |
3 | 318089 | 2.2% |
5 | 272558 | 1.9% |
education_1138M
Text
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 29.7 MiB |
Length
Max length | 11 |
---|---|
Median length | 10 |
Mean length | 9.311229771 |
Min length | 8 |
Characters and Unicode
Total characters | 36199119 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | P97_36_170 |
---|---|
2nd row | P97_36_170 |
3rd row | P97_36_170 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 1693666 | |
p97_36_170 | 1458441 | |
p33_146_175 | 680819 | |
p106_81_188 | 26860 | 0.7% |
p17_36_170 | 25966 | 0.7% |
p157_18_172 | 1932 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 5763749 | |
7 | 5347163 | |
1 | 4652053 | |
_ | 4388036 | |
3 | 2846045 | |
4 | 2374485 | |
P | 2194018 | 6.1% |
6 | 2192086 | 6.1% |
a | 1693666 | 4.7% |
b | 1693666 | 4.7% |
Other values (4) | 3054152 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26229733 | |
Connector Punctuation | 4388036 | 12.1% |
Lowercase Letter | 3387332 | 9.4% |
Uppercase Letter | 2194018 | 6.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 5763749 | |
7 | 5347163 | |
1 | 4652053 | |
3 | 2846045 | |
4 | 2374485 | |
6 | 2192086 | 8.4% |
0 | 1511267 | 5.8% |
9 | 1458441 | 5.6% |
8 | 82512 | 0.3% |
2 | 1932 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 1693666 | |
b | 1693666 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 4388036 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2194018 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 30617769 | |
Latin | 5581350 | 15.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 5763749 | |
7 | 5347163 | |
1 | 4652053 | |
_ | 4388036 | |
3 | 2846045 | |
4 | 2374485 | |
6 | 2192086 | 7.2% |
0 | 1511267 | 4.9% |
9 | 1458441 | 4.8% |
8 | 82512 | 0.3% |
Latin
Value | Count | Frequency (%) |
P | 2194018 | |
a | 1693666 | |
b | 1693666 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 36199119 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 5763749 | |
7 | 5347163 | |
1 | 4652053 | |
_ | 4388036 | |
3 | 2846045 | |
4 | 2374485 | |
P | 2194018 | 6.1% |
6 | 2192086 | 6.1% |
a | 1693666 | 4.7% |
b | 1693666 | 4.7% |
Other values (4) | 3054152 |
MISSING
 
Distinct | 9285 |
---|---|
Distinct (%) | 0.5% |
Missing | 2180869 |
Missing (%) | 56.1% |
Memory size | 29.7 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 17068150 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1902 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 2010-02-15 |
---|---|
2nd row | 2010-02-15 |
3rd row | 2018-05-15 |
4th row | 2013-09-15 |
5th row | 2012-09-15 |
Value | Count | Frequency (%) |
2015-01-15 | 29305 | 1.7% |
2016-01-15 | 28713 | 1.7% |
2014-01-15 | 28642 | 1.7% |
2017-01-15 | 28477 | 1.7% |
2013-01-15 | 28312 | 1.7% |
2012-01-15 | 24185 | 1.4% |
2010-01-15 | 18632 | 1.1% |
2011-09-15 | 17503 | 1.0% |
2010-09-15 | 17495 | 1.0% |
2012-09-15 | 17356 | 1.0% |
Other values (9275) | 1468195 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3897011 | |
1 | 3611856 | |
- | 3413630 | |
2 | 1980481 | |
5 | 1922389 | |
9 | 677460 | 4.0% |
8 | 341301 | 2.0% |
6 | 318819 | 1.9% |
3 | 311076 | 1.8% |
4 | 305810 | 1.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 13654520 | |
Dash Punctuation | 3413630 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 3897011 | |
1 | 3611856 | |
2 | 1980481 | |
5 | 1922389 | |
9 | 677460 | 5.0% |
8 | 341301 | 2.5% |
6 | 318819 | 2.3% |
3 | 311076 | 2.3% |
4 | 305810 | 2.2% |
7 | 288317 | 2.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3413630 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 17068150 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 3897011 | |
1 | 3611856 | |
- | 3413630 | |
2 | 1980481 | |
5 | 1922389 | |
9 | 677460 | 4.0% |
8 | 341301 | 2.0% |
6 | 318819 | 1.9% |
3 | 311076 | 1.8% |
4 | 305810 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 17068150 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3897011 | |
1 | 3611856 | |
- | 3413630 | |
2 | 1980481 | |
5 | 1922389 | |
9 | 677460 | 4.0% |
8 | 341301 | 2.0% |
6 | 318819 | 1.9% |
3 | 311076 | 1.8% |
4 | 305810 | 1.8% |
familystate_726L
Text
MISSING
 
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 1245201 |
Missing (%) | 32.0% |
Memory size | 29.7 MiB |
Value | Count | Frequency (%) |
married | 1904175 | |
single | 391421 | 14.8% |
widowed | 220519 | 8.3% |
divorced | 85733 | 3.2% |
living_with_partner | 40635 | 1.5% |
Most occurring characters
Value | Count | Frequency (%) |
R | 3975353 | |
I | 2723753 | |
E | 2642483 | |
D | 2516679 | |
A | 1944810 | |
M | 1904175 | |
W | 481673 | 2.6% |
N | 472691 | 2.5% |
L | 432056 | 2.3% |
G | 432056 | 2.3% |
Other values (8) | 1153584 | 6.2% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 18598043 | |
Connector Punctuation | 81270 | 0.4% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
R | 3975353 | |
I | 2723753 | |
E | 2642483 | |
D | 2516679 | |
A | 1944810 | |
M | 1904175 | |
W | 481673 | 2.6% |
N | 472691 | 2.5% |
L | 432056 | 2.3% |
G | 432056 | 2.3% |
Other values (7) | 1072314 | 5.8% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 81270 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 18598043 | |
Common | 81270 | 0.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
R | 3975353 | |
I | 2723753 | |
E | 2642483 | |
D | 2516679 | |
A | 1944810 | |
M | 1904175 | |
W | 481673 | 2.6% |
N | 472691 | 2.5% |
L | 432056 | 2.3% |
G | 432056 | 2.3% |
Other values (7) | 1072314 | 5.8% |
Common
Value | Count | Frequency (%) |
_ | 81270 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 18679313 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
R | 3975353 | |
I | 2723753 | |
E | 2642483 | |
D | 2516679 | |
A | 1944810 | |
M | 1904175 | |
W | 481673 | 2.6% |
N | 472691 | 2.5% |
L | 432056 | 2.3% |
G | 432056 | 2.3% |
Other values (8) | 1153584 | 6.2% |
firstnonzeroinstldate_307D
Text
MISSING
 
Distinct | 4886 |
---|---|
Distinct (%) | 0.1% |
Missing | 365175 |
Missing (%) | 9.4% |
Memory size | 29.7 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 35225090 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 14 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2013-05-04 |
---|---|
2nd row | 2013-05-04 |
3rd row | 2019-02-07 |
4th row | 2019-02-08 |
5th row | 2018-10-12 |
Value | Count | Frequency (%) |
2019-03-14 | 8057 | 0.2% |
2018-12-15 | 6898 | 0.2% |
2018-09-15 | 6596 | 0.2% |
2018-02-15 | 6117 | 0.2% |
2019-02-11 | 6078 | 0.2% |
2018-03-14 | 5897 | 0.2% |
2018-02-11 | 5858 | 0.2% |
2018-07-15 | 5657 | 0.2% |
2019-02-15 | 5602 | 0.2% |
2017-12-15 | 5221 | 0.1% |
Other values (4876) | 3460528 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8125548 | |
- | 7045018 | |
1 | 6501171 | |
2 | 5801762 | |
8 | 1472398 | 4.2% |
9 | 1257862 | 3.6% |
7 | 1174381 | 3.3% |
5 | 1090650 | 3.1% |
3 | 972274 | 2.8% |
6 | 949158 | 2.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 28180072 | |
Dash Punctuation | 7045018 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 8125548 | |
1 | 6501171 | |
2 | 5801762 | |
8 | 1472398 | 5.2% |
9 | 1257862 | 4.5% |
7 | 1174381 | 4.2% |
5 | 1090650 | 3.9% |
3 | 972274 | 3.5% |
6 | 949158 | 3.4% |
4 | 834868 | 3.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 7045018 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 35225090 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 8125548 | |
- | 7045018 | |
1 | 6501171 | |
2 | 5801762 | |
8 | 1472398 | 4.2% |
9 | 1257862 | 3.6% |
7 | 1174381 | 3.3% |
5 | 1090650 | 3.1% |
3 | 972274 | 2.8% |
6 | 949158 | 2.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35225090 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8125548 | |
- | 7045018 | |
1 | 6501171 | |
2 | 5801762 | |
8 | 1472398 | 4.2% |
9 | 1257862 | 3.6% |
7 | 1174381 | 3.3% |
5 | 1090650 | 3.1% |
3 | 972274 | 2.8% |
6 | 949158 | 2.7% |
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 123329 |
Missing (%) | 3.2% |
Memory size | 29.7 MiB |
Value | Count | Frequency (%) |
pos | 2167053 | |
cash | 1478997 | |
ndf | 118305 | 3.1% |
Most occurring characters
Value | Count | Frequency (%) |
S | 3646050 | |
P | 2167053 | |
O | 2167053 | |
C | 1478997 | |
A | 1478997 | |
H | 1478997 | |
N | 118305 | 0.9% |
D | 118305 | 0.9% |
F | 118305 | 0.9% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 12772062 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
S | 3646050 | |
P | 2167053 | |
O | 2167053 | |
C | 1478997 | |
A | 1478997 | |
H | 1478997 | |
N | 118305 | 0.9% |
D | 118305 | 0.9% |
F | 118305 | 0.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 12772062 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
S | 3646050 | |
P | 2167053 | |
O | 2167053 | |
C | 1478997 | |
A | 1478997 | |
H | 1478997 | |
N | 118305 | 0.9% |
D | 118305 | 0.9% |
F | 118305 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 12772062 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
S | 3646050 | |
P | 2167053 | |
O | 2167053 | |
C | 1478997 | |
A | 1478997 | |
H | 1478997 | |
N | 118305 | 0.9% |
D | 118305 | 0.9% |
F | 118305 | 0.9% |
isbidproduct_390L
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 35 |
Missing (%) | < 0.1% |
Memory size | 29.7 MiB |
False | |
---|---|
True | 218833 |
(Missing) | 35 |
Value | Count | Frequency (%) |
False | 3668816 | |
True | 218833 | 5.6% |
(Missing) | 35 | < 0.1% |
isdebitcard_527L
Boolean
MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 3637550 |
Missing (%) | 93.6% |
Memory size | 29.7 MiB |
False | 212706 |
---|---|
True | 37428 |
(Missing) |
Value | Count | Frequency (%) |
False | 212706 | 5.5% |
True | 37428 | 1.0% |
(Missing) | 3637550 |
mainoccupationinc_437A
Real number (ℝ)
Distinct | 21957 |
---|---|
Distinct (%) | 0.6% |
Missing | 36612 |
Missing (%) | 0.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40029.69379 |
Minimum | 0 |
---|---|
Maximum | 196000 |
Zeros | 47 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 6000 |
Q1 | 18000 |
median | 34000 |
Q3 | 52000 |
95-th percentile | 100000 |
Maximum | 196000 |
Range | 196000 |
Interquartile range (IQR) | 34000 |
Descriptive statistics
Standard deviation | 31396.36665 |
---|---|
Coefficient of variation (CV) | 0.7843269254 |
Kurtosis | 5.201732139 |
Mean | 40029.69379 |
Median Absolute Deviation (MAD) | 17000 |
Skewness | 1.863363842 |
Sum | 1.541572329 × 1011 |
Variance | 985731838.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
30000 | 278622 | 7.2% |
40000 | 272284 | 7.0% |
50000 | 230602 | 5.9% |
60000 | 184523 | 4.7% |
20000 | 145935 | 3.8% |
24000 | 124888 | 3.2% |
70000 | 114524 | 2.9% |
36000 | 113091 | 2.9% |
16000 | 74780 | 1.9% |
12000 | 69980 | 1.8% |
Other values (21947) | 2241843 |
Value | Count | Frequency (%) |
0 | 47 | |
0.038 | 1 | < 0.1% |
0.2 | 78 | |
0.4 | 6 | < 0.1% |
0.6 | 6 | < 0.1% |
Value | Count | Frequency (%) |
196000 | 19426 | |
195800 | 4 | < 0.1% |
195600 | 16 | < 0.1% |
195540 | 1 | < 0.1% |
195400 | 1 | < 0.1% |
maxdpdtolerance_577P
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 3199 |
---|---|
Distinct (%) | 0.2% |
Missing | 1817378 |
Missing (%) | 46.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.26581674 |
Minimum | 0 |
---|---|
Maximum | 4058 |
Zeros | 1527003 |
Zeros (%) | 39.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 15 |
Maximum | 4058 |
Range | 4058 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 134.8893241 |
---|---|
Coefficient of variation (CV) | 10.16818841 |
Kurtosis | 328.0205761 |
Mean | 13.26581674 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 16.64564203 |
Sum | 27464300 |
Variance | 18195.12975 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1527003 | |
1 | 283391 | 7.3% |
5 | 35934 | 0.9% |
6 | 28382 | 0.7% |
10 | 21438 | 0.6% |
4 | 11330 | 0.3% |
9 | 11187 | 0.3% |
14 | 10079 | 0.3% |
7 | 9775 | 0.3% |
18 | 9442 | 0.2% |
Other values (3189) | 122345 | 3.1% |
(Missing) | 1817378 |
Value | Count | Frequency (%) |
0 | 1527003 | |
1 | 283391 | 7.3% |
2 | 8816 | 0.2% |
3 | 2658 | 0.1% |
4 | 11330 | 0.3% |
Value | Count | Frequency (%) |
4058 | 1 | |
4025 | 1 | |
4024 | 1 | |
4000 | 2 | |
3999 | 1 |
num_group1
Real number (ℝ)
ZEROS
 
Distinct | 20 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.910262254 |
Minimum | 0 |
---|---|
Maximum | 19 |
Zeros | 782997 |
Zeros (%) | 20.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 3 |
Q3 | 6 |
95-th percentile | 13 |
Maximum | 19 |
Range | 19 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 4.101353263 |
---|---|
Coefficient of variation (CV) | 1.048869103 |
Kurtosis | 1.58474303 |
Mean | 3.910262254 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.393787504 |
Sum | 15201864 |
Variance | 16.82109859 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 782997 | |
1 | 622546 | |
2 | 494850 | |
3 | 394066 | |
4 | 314724 | |
5 | 251725 | 6.5% |
6 | 202039 | 5.2% |
7 | 162746 | 4.2% |
8 | 131605 | 3.4% |
9 | 106507 | 2.7% |
Other values (10) | 423879 |
Value | Count | Frequency (%) |
0 | 782997 | |
1 | 622546 | |
2 | 494850 | |
3 | 394066 | |
4 | 314724 |
Value | Count | Frequency (%) |
19 | 16520 | |
18 | 19472 | |
17 | 23111 | |
16 | 27575 | |
15 | 33102 |
outstandingdebt_522A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 269095 |
---|---|
Distinct (%) | 10.3% |
Missing | 1277922 |
Missing (%) | 32.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6994.587846 |
Minimum | 0 |
---|---|
Maximum | 1210629.1 |
Zeros | 2229538 |
Zeros (%) | 57.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 41496.3719 |
Maximum | 1210629.1 |
Range | 1210629.1 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 29162.48428 |
---|---|
Coefficient of variation (CV) | 4.169292734 |
Kurtosis | 88.30188385 |
Mean | 6994.587846 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 7.63303055 |
Sum | 1.825420957 × 1010 |
Variance | 850450489.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2229538 | |
10 | 188 | < 0.1% |
19998 | 82 | < 0.1% |
17998 | 75 | < 0.1% |
11998 | 73 | < 0.1% |
7998 | 72 | < 0.1% |
15998 | 66 | < 0.1% |
14998 | 66 | < 0.1% |
9978 | 64 | < 0.1% |
13998 | 64 | < 0.1% |
Other values (269085) | 379474 | 9.8% |
(Missing) | 1277922 |
Value | Count | Frequency (%) |
0 | 2229538 | |
0.002 | 3 | < 0.1% |
0.004 | 1 | < 0.1% |
0.008 | 1 | < 0.1% |
0.010000001 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1210629.1 | 1 | |
1192100.9 | 1 | |
1092393 | 1 | |
1085048.1 | 1 | |
1071760.9 | 1 |
pmtnum_8L
Real number (ℝ)
MISSING
 
Distinct | 57 |
---|---|
Distinct (%) | < 0.1% |
Missing | 312833 |
Missing (%) | 8.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.78210253 |
Minimum | 3 |
---|---|
Maximum | 62 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 3 |
Q1 | 6 |
median | 12 |
Q3 | 24 |
95-th percentile | 36 |
Maximum | 62 |
Range | 59 |
Interquartile range (IQR) | 18 |
Descriptive statistics
Standard deviation | 10.46206069 |
---|---|
Coefficient of variation (CV) | 0.662906648 |
Kurtosis | 1.294429223 |
Mean | 15.78210253 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 1.219734053 |
Sum | 56418665 |
Variance | 109.4547138 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12 | 883553 | |
6 | 589676 | |
24 | 562114 | |
18 | 296861 | 7.6% |
36 | 190785 | 4.9% |
3 | 188014 | 4.8% |
16 | 133452 | 3.4% |
48 | 102226 | 2.6% |
9 | 78979 | 2.0% |
10 | 75231 | 1.9% |
Other values (47) | 473960 | |
(Missing) | 312833 | 8.0% |
Value | Count | Frequency (%) |
3 | 188014 | 4.8% |
4 | 73219 | 1.9% |
5 | 43429 | 1.1% |
6 | 589676 | |
7 | 7541 | 0.2% |
Value | Count | Frequency (%) |
62 | 1 | < 0.1% |
61 | 3 | < 0.1% |
60 | 3148 | |
58 | 46 | < 0.1% |
56 | 23 | < 0.1% |
postype_4733339M
Text
Distinct | 9 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 29.7 MiB |
Length
Max length | 12 |
---|---|
Median length | 8 |
Mean length | 8.092561535 |
Min length | 8 |
Characters and Unicode
Total characters | 31461322 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | a55475b1 |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 3779480 | |
p177_117_192 | 57712 | 1.5% |
p46_145_78 | 24034 | 0.6% |
p149_40_170 | 11308 | 0.3% |
p60_146_156 | 8054 | 0.2% |
p67_102_161 | 4470 | 0.1% |
p217_110_186 | 1560 | < 0.1% |
p169_115_83 | 848 | < 0.1% |
p140_48_169 | 218 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 11371376 | |
1 | 4095716 | 13.0% |
7 | 3993988 | 12.7% |
4 | 3858654 | 12.3% |
a | 3779480 | 12.0% |
b | 3779480 | 12.0% |
_ | 216408 | 0.7% |
P | 108204 | 0.3% |
9 | 70086 | 0.2% |
2 | 63742 | 0.2% |
Other values (4) | 124188 | 0.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 23577750 | |
Lowercase Letter | 7558960 | 24.0% |
Connector Punctuation | 216408 | 0.7% |
Uppercase Letter | 108204 | 0.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 11371376 | |
1 | 4095716 | 17.4% |
7 | 3993988 | 16.9% |
4 | 3858654 | 16.4% |
9 | 70086 | 0.3% |
2 | 63742 | 0.3% |
6 | 59762 | 0.3% |
0 | 36918 | 0.2% |
8 | 26660 | 0.1% |
3 | 848 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 3779480 | |
b | 3779480 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 216408 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 108204 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 23794158 | |
Latin | 7667164 | 24.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 11371376 | |
1 | 4095716 | 17.2% |
7 | 3993988 | 16.8% |
4 | 3858654 | 16.2% |
_ | 216408 | 0.9% |
9 | 70086 | 0.3% |
2 | 63742 | 0.3% |
6 | 59762 | 0.3% |
0 | 36918 | 0.2% |
8 | 26660 | 0.1% |
Latin
Value | Count | Frequency (%) |
a | 3779480 | |
b | 3779480 | |
P | 108204 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 31461322 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 11371376 | |
1 | 4095716 | 13.0% |
7 | 3993988 | 12.7% |
4 | 3858654 | 12.3% |
a | 3779480 | 12.0% |
b | 3779480 | 12.0% |
_ | 216408 | 0.7% |
P | 108204 | 0.3% |
9 | 70086 | 0.2% |
2 | 63742 | 0.2% |
Other values (4) | 124188 | 0.4% |
profession_152M
Text
Distinct | 9028 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 29.7 MiB |
Length
Max length | 17 |
---|---|
Median length | 8 |
Mean length | 8.031521852 |
Min length | 7 |
Characters and Unicode
Total characters | 31224019 |
---|---|
Distinct characters | 36 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 5636 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | a55475b1 |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 3839511 | |
p46_72_80 | 1281 | < 0.1% |
p104_137_180 | 959 | < 0.1% |
p167_22_171 | 699 | < 0.1% |
p139_125_64 | 671 | < 0.1% |
p143_116_69 | 665 | < 0.1% |
p25_111_112 | 640 | < 0.1% |
p21_76_53 | 612 | < 0.1% |
p116_59_165 | 532 | < 0.1% |
p103_114_185 | 526 | < 0.1% |
Other values (9018) | 41588 | 1.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 11548196 | |
1 | 3943241 | 12.6% |
7 | 3873726 | 12.4% |
4 | 3869256 | 12.4% |
a | 3839535 | 12.3% |
b | 3839515 | 12.3% |
_ | 96346 | 0.3% |
P | 48092 | 0.2% |
2 | 34187 | 0.1% |
6 | 33655 | 0.1% |
Other values (26) | 98270 | 0.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 23400120 | |
Lowercase Letter | 7679380 | 24.6% |
Connector Punctuation | 96346 | 0.3% |
Uppercase Letter | 48173 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 3839535 | |
b | 3839515 | |
e | 46 | < 0.1% |
o | 39 | < 0.1% |
r | 37 | < 0.1% |
t | 35 | < 0.1% |
u | 22 | < 0.1% |
s | 16 | < 0.1% |
k | 16 | < 0.1% |
h | 15 | < 0.1% |
Other values (13) | 104 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
5 | 11548196 | |
1 | 3943241 | 16.9% |
7 | 3873726 | 16.6% |
4 | 3869256 | 16.5% |
2 | 34187 | 0.1% |
6 | 33655 | 0.1% |
3 | 25672 | 0.1% |
0 | 25385 | 0.1% |
8 | 24952 | 0.1% |
9 | 21850 | 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 48092 | |
Q | 81 | 0.2% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 96346 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 23496466 | |
Latin | 7727553 | 24.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 3839535 | |
b | 3839515 | |
P | 48092 | 0.6% |
Q | 81 | < 0.1% |
e | 46 | < 0.1% |
o | 39 | < 0.1% |
r | 37 | < 0.1% |
t | 35 | < 0.1% |
u | 22 | < 0.1% |
s | 16 | < 0.1% |
Other values (15) | 135 | < 0.1% |
Common
Value | Count | Frequency (%) |
5 | 11548196 | |
1 | 3943241 | 16.8% |
7 | 3873726 | 16.5% |
4 | 3869256 | 16.5% |
_ | 96346 | 0.4% |
2 | 34187 | 0.1% |
6 | 33655 | 0.1% |
3 | 25672 | 0.1% |
0 | 25385 | 0.1% |
8 | 24952 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 31224019 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 11548196 | |
1 | 3943241 | 12.6% |
7 | 3873726 | 12.4% |
4 | 3869256 | 12.4% |
a | 3839535 | 12.3% |
b | 3839515 | 12.3% |
_ | 96346 | 0.3% |
P | 48092 | 0.2% |
2 | 34187 | 0.1% |
6 | 33655 | 0.1% |
Other values (26) | 98270 | 0.3% |
Distinct | 18 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 29.7 MiB |
Length
Max length | 11 |
---|---|
Median length | 8 |
Mean length | 8.667440306 |
Min length | 8 |
Characters and Unicode
Total characters | 33696269 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | a55475b1 |
---|---|
2nd row | a55475b1 |
3rd row | P94_109_143 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 2848762 | |
p94_109_143 | 524764 | 13.5% |
p99_56_166 | 358720 | 9.2% |
p45_84_106 | 71489 | 1.8% |
p198_131_9 | 65648 | 1.7% |
p30_86_84 | 5808 | 0.1% |
p52_67_90 | 3027 | 0.1% |
p48_22_32 | 2793 | 0.1% |
p196_88_176 | 2112 | 0.1% |
p121_60_164 | 1636 | < 0.1% |
Other values (8) | 2925 | 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 8980061 | |
1 | 4540393 | |
4 | 4053100 | |
7 | 2855155 | 8.5% |
a | 2848762 | 8.5% |
b | 2848762 | 8.5% |
_ | 2077844 | 6.2% |
9 | 1905359 | 5.7% |
6 | 1167599 | 3.5% |
P | 1038922 | 3.1% |
Other values (4) | 1380312 | 4.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 24881979 | |
Lowercase Letter | 5697524 | 16.9% |
Connector Punctuation | 2077844 | 6.2% |
Uppercase Letter | 1038922 | 3.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 8980061 | |
1 | 4540393 | |
4 | 4053100 | |
7 | 2855155 | 11.5% |
9 | 1905359 | 7.7% |
6 | 1167599 | 4.7% |
0 | 607651 | 2.4% |
3 | 599414 | 2.4% |
8 | 157338 | 0.6% |
2 | 15909 | 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 2848762 | |
b | 2848762 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2077844 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 1038922 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 26959823 | |
Latin | 6736446 | 20.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 8980061 | |
1 | 4540393 | |
4 | 4053100 | |
7 | 2855155 | 10.6% |
_ | 2077844 | 7.7% |
9 | 1905359 | 7.1% |
6 | 1167599 | 4.3% |
0 | 607651 | 2.3% |
3 | 599414 | 2.2% |
8 | 157338 | 0.6% |
Latin
Value | Count | Frequency (%) |
a | 2848762 | |
b | 2848762 | |
P | 1038922 | 15.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 33696269 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 8980061 | |
1 | 4540393 | |
4 | 4053100 | |
7 | 2855155 | 8.5% |
a | 2848762 | 8.5% |
b | 2848762 | 8.5% |
_ | 2077844 | 6.2% |
9 | 1905359 | 5.7% |
6 | 1167599 | 3.5% |
P | 1038922 | 3.1% |
Other values (4) | 1380312 | 4.1% |
Distinct | 11 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 29.7 MiB |
Length
Max length | 11 |
---|---|
Median length | 8 |
Mean length | 8.610769548 |
Min length | 8 |
Characters and Unicode
Total characters | 33475951 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | a55475b1 |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 3062831 | |
p94_109_143 | 762661 | 19.6% |
p30_86_84 | 29604 | 0.8% |
p52_67_90 | 12511 | 0.3% |
p69_72_116 | 7443 | 0.2% |
p129_162_80 | 7216 | 0.2% |
p84_14_61 | 2946 | 0.1% |
p64_121_167 | 897 | < 0.1% |
p5_143_178 | 635 | < 0.1% |
p19_25_34 | 622 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 9202897 | |
1 | 4628582 | |
4 | 4625803 | |
7 | 3084317 | 9.2% |
a | 3062831 | 9.1% |
b | 3062831 | 9.1% |
_ | 1649706 | 4.9% |
9 | 1553114 | 4.6% |
P | 824853 | 2.5% |
0 | 812310 | 2.4% |
Other values (4) | 968707 | 2.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 24875730 | |
Lowercase Letter | 6125662 | 18.3% |
Connector Punctuation | 1649706 | 4.9% |
Uppercase Letter | 824853 | 2.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 9202897 | |
1 | 4628582 | |
4 | 4625803 | |
7 | 3084317 | 12.4% |
9 | 1553114 | 6.2% |
0 | 812310 | 3.3% |
3 | 793840 | 3.2% |
8 | 70005 | 0.3% |
6 | 68957 | 0.3% |
2 | 35905 | 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 3062831 | |
b | 3062831 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1649706 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 824853 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 26525436 | |
Latin | 6950515 | 20.8% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 9202897 | |
1 | 4628582 | |
4 | 4625803 | |
7 | 3084317 | 11.6% |
_ | 1649706 | 6.2% |
9 | 1553114 | 5.9% |
0 | 812310 | 3.1% |
3 | 793840 | 3.0% |
8 | 70005 | 0.3% |
6 | 68957 | 0.3% |
Latin
Value | Count | Frequency (%) |
a | 3062831 | |
b | 3062831 | |
P | 824853 | 11.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 33475951 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 9202897 | |
1 | 4628582 | |
4 | 4625803 | |
7 | 3084317 | 9.2% |
a | 3062831 | 9.1% |
b | 3062831 | 9.1% |
_ | 1649706 | 4.9% |
9 | 1553114 | 4.6% |
P | 824853 | 2.5% |
0 | 812310 | 2.4% |
Other values (4) | 968707 | 2.9% |
revolvingaccount_394A
Real number (ℝ)
MISSING
 
Distinct | 60659 |
---|---|
Distinct (%) | 38.7% |
Missing | 3731033 |
Missing (%) | 96.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 740354832.4 |
Minimum | 540342400 |
---|---|
Maximum | 780865400 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 540342400 |
---|---|
5-th percentile | 561029650 |
Q1 | 740688450 |
median | 760248700 |
Q3 | 760724950 |
95-th percentile | 780564600 |
Maximum | 780865400 |
Range | 240523000 |
Interquartile range (IQR) | 20036500 |
Descriptive statistics
Standard deviation | 54986858.03 |
---|---|
Coefficient of variation (CV) | 0.07427095174 |
Kurtosis | 5.600146409 |
Mean | 740354832.4 |
Median Absolute Deviation (MAD) | 19415740 |
Skewness | -2.485268947 |
Sum | 1.159773249 × 1014 |
Variance | 3.023554556 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
760540100 | 35 | < 0.1% |
742529500 | 31 | < 0.1% |
760482240 | 30 | < 0.1% |
760434100 | 27 | < 0.1% |
760635840 | 26 | < 0.1% |
760467400 | 25 | < 0.1% |
760470700 | 25 | < 0.1% |
760540350 | 24 | < 0.1% |
760558800 | 24 | < 0.1% |
760447170 | 23 | < 0.1% |
Other values (60649) | 156381 | 4.0% |
(Missing) | 3731033 |
Value | Count | Frequency (%) |
540342400 | 3 | |
540342460 | 7 | |
540342500 | 5 | |
540342600 | 4 | |
540342660 | 3 |
Value | Count | Frequency (%) |
780865400 | 1 | |
780865200 | 1 | |
780864800 | 2 | |
780864700 | 2 | |
780864260 | 1 |
status_219L
Text
Distinct | 11 |
---|---|
Distinct (%) | < 0.1% |
Missing | 35 |
Missing (%) | < 0.1% |
Memory size | 29.7 MiB |
Value | Count | Frequency (%) |
k | 1605077 | |
d | 1563834 | |
a | 431299 | 11.1% |
t | 263947 | 6.8% |
n | 15668 | 0.4% |
q | 4766 | 0.1% |
s | 2265 | 0.1% |
l | 470 | < 0.1% |
h | 276 | < 0.1% |
p | 39 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
K | 1605077 | |
D | 1563834 | |
A | 431299 | 11.1% |
T | 263947 | 6.8% |
N | 15668 | 0.4% |
Q | 4766 | 0.1% |
S | 2265 | 0.1% |
L | 470 | < 0.1% |
H | 276 | < 0.1% |
P | 39 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 3887649 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
K | 1605077 | |
D | 1563834 | |
A | 431299 | 11.1% |
T | 263947 | 6.8% |
N | 15668 | 0.4% |
Q | 4766 | 0.1% |
S | 2265 | 0.1% |
L | 470 | < 0.1% |
H | 276 | < 0.1% |
P | 39 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 3887649 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
K | 1605077 | |
D | 1563834 | |
A | 431299 | 11.1% |
T | 263947 | 6.8% |
N | 15668 | 0.4% |
Q | 4766 | 0.1% |
S | 2265 | 0.1% |
L | 470 | < 0.1% |
H | 276 | < 0.1% |
P | 39 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3887649 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
K | 1605077 | |
D | 1563834 | |
A | 431299 | 11.1% |
T | 263947 | 6.8% |
N | 15668 | 0.4% |
Q | 4766 | 0.1% |
S | 2265 | 0.1% |
L | 470 | < 0.1% |
H | 276 | < 0.1% |
P | 39 | < 0.1% |
tenor_203L
Real number (ℝ)
MISSING
 
Distinct | 57 |
---|---|
Distinct (%) | < 0.1% |
Missing | 312833 |
Missing (%) | 8.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.78210253 |
Minimum | 3 |
---|---|
Maximum | 62 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.7 MiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 3 |
Q1 | 6 |
median | 12 |
Q3 | 24 |
95-th percentile | 36 |
Maximum | 62 |
Range | 59 |
Interquartile range (IQR) | 18 |
Descriptive statistics
Standard deviation | 10.46206069 |
---|---|
Coefficient of variation (CV) | 0.662906648 |
Kurtosis | 1.294429223 |
Mean | 15.78210253 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 1.219734053 |
Sum | 56418665 |
Variance | 109.4547138 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12 | 883553 | |
6 | 589676 | |
24 | 562114 | |
18 | 296861 | 7.6% |
36 | 190785 | 4.9% |
3 | 188014 | 4.8% |
16 | 133452 | 3.4% |
48 | 102226 | 2.6% |
9 | 78979 | 2.0% |
10 | 75231 | 1.9% |
Other values (47) | 473960 | |
(Missing) | 312833 | 8.0% |
Value | Count | Frequency (%) |
3 | 188014 | 4.8% |
4 | 73219 | 1.9% |
5 | 43429 | 1.1% |
6 | 589676 | |
7 | 7541 | 0.2% |
Value | Count | Frequency (%) |
62 | 1 | < 0.1% |
61 | 3 | < 0.1% |
60 | 3148 | |
58 | 46 | < 0.1% |
56 | 23 | < 0.1% |