Dataset statistics
Number of variables | 41 |
---|---|
Number of observations | 2638295 |
Missing cells | 34503185 |
Missing cells (%) | 31.9% |
Total size in memory | 825.3 MiB |
Average record size in memory | 328.0 B |
Variable types
Numeric | 20 |
---|---|
Text | 19 |
Boolean | 2 |
isbidproduct_390L is highly imbalanced (69.3%) | Imbalance |
annuity_853A has 94885 (3.6%) missing values | Missing |
approvaldate_319D has 1244273 (47.2%) missing values | Missing |
byoccupationinc_3656910L has 2095507 (79.4%) missing values | Missing |
childnum_21L has 1605531 (60.9%) missing values | Missing |
credacc_actualbalance_314A has 2486439 (94.2%) missing values | Missing |
credacc_credlmt_575A has 75062 (2.8%) missing values | Missing |
credacc_maxhisbal_375A has 2486439 (94.2%) missing values | Missing |
credacc_minhisbal_90A has 2486439 (94.2%) missing values | Missing |
credacc_status_367L has 2486439 (94.2%) missing values | Missing |
credacc_transactions_402L has 2486439 (94.2%) missing values | Missing |
credamount_590A has 78869 (3.0%) missing values | Missing |
credtype_587L has 78869 (3.0%) missing values | Missing |
currdebt_94A has 976135 (37.0%) missing values | Missing |
dateactivated_425D has 1297051 (49.2%) missing values | Missing |
downpmt_134A has 78869 (3.0%) missing values | Missing |
dtlastpmt_581D has 1890009 (71.6%) missing values | Missing |
dtlastpmtallstes_3545839D has 1609466 (61.0%) missing values | Missing |
employedfrom_700D has 1705609 (64.6%) missing values | Missing |
familystate_726L has 1148691 (43.5%) missing values | Missing |
firstnonzeroinstldate_307D has 287307 (10.9%) missing values | Missing |
inittransactioncode_279L has 78869 (3.0%) missing values | Missing |
isdebitcard_527L has 2425416 (91.9%) missing values | Missing |
mainoccupationinc_437A has 65371 (2.5%) missing values | Missing |
maxdpdtolerance_577P has 1278326 (48.5%) missing values | Missing |
outstandingdebt_522A has 980346 (37.2%) missing values | Missing |
pmtnum_8L has 238987 (9.1%) missing values | Missing |
revolvingaccount_394A has 2498196 (94.7%) missing values | Missing |
tenor_203L has 238987 (9.1%) missing values | Missing |
actualdpd_943P is highly skewed (γ1 = 530.2918463) | Skewed |
credacc_maxhisbal_375A is highly skewed (γ1 = 43.42768751) | Skewed |
downpmt_134A is highly skewed (γ1 = 21.07123108) | Skewed |
actualdpd_943P has 2636655 (99.9%) zeros | Zeros |
annuity_853A has 200456 (7.6%) zeros | Zeros |
byoccupationinc_3656910L has 34448 (1.3%) zeros | Zeros |
childnum_21L has 574109 (21.8%) zeros | Zeros |
credacc_actualbalance_314A has 47706 (1.8%) zeros | Zeros |
credacc_credlmt_575A has 2369092 (89.8%) zeros | Zeros |
credacc_maxhisbal_375A has 84335 (3.2%) zeros | Zeros |
credacc_minhisbal_90A has 89232 (3.4%) zeros | Zeros |
credacc_transactions_402L has 134552 (5.1%) zeros | Zeros |
credamount_590A has 73846 (2.8%) zeros | Zeros |
currdebt_94A has 1441963 (54.7%) zeros | Zeros |
downpmt_134A has 2348164 (89.0%) zeros | Zeros |
maxdpdtolerance_577P has 986233 (37.4%) zeros | Zeros |
num_group1 has 438525 (16.6%) zeros | Zeros |
outstandingdebt_522A has 1436446 (54.4%) zeros | Zeros |
Reproduction
Analysis started | 2024-02-13 19:36:58.685528 |
---|---|
Analysis finished | 2024-02-13 19:37:23.604870 |
Duration | 24.92 seconds |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
Distinct | 438525 |
---|---|
Distinct (%) | 16.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1482078.441 |
Minimum | 40704 |
---|---|
Maximum | 2703454 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 40704 |
---|---|
5-th percentile | 198771 |
Q1 | 257431 |
median | 1788760 |
Q3 | 1895740 |
95-th percentile | 2683686.3 |
Maximum | 2703454 |
Range | 2662750 |
Interquartile range (IQR) | 1638309 |
Descriptive statistics
Standard deviation | 822852.6011 |
---|---|
Coefficient of variation (CV) | 0.5552017887 |
Kurtosis | -0.9550918823 |
Mean | 1482078.441 |
Median Absolute Deviation (MAD) | 129385 |
Skewness | -0.5094453682 |
Sum | 3.910160139 × 1012 |
Variance | 6.770864032 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2660368 | 20 | < 0.1% |
229396 | 20 | < 0.1% |
229411 | 20 | < 0.1% |
195358 | 20 | < 0.1% |
250069 | 20 | < 0.1% |
1809155 | 20 | < 0.1% |
1906631 | 20 | < 0.1% |
250081 | 20 | < 0.1% |
1715312 | 20 | < 0.1% |
250087 | 20 | < 0.1% |
Other values (438515) | 2638095 |
Value | Count | Frequency (%) |
40704 | 1 | < 0.1% |
40734 | 1 | < 0.1% |
40737 | 1 | < 0.1% |
40791 | 3 | |
40821 | 2 |
Value | Count | Frequency (%) |
2703454 | 2 | < 0.1% |
2703453 | 9 | |
2703452 | 3 | < 0.1% |
2703451 | 6 | |
2703450 | 13 |
actualdpd_943P
Real number (ℝ)
SKEWED
  ZEROS
 
Distinct | 157 |
---|---|
Distinct (%) | < 0.1% |
Missing | 266 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.02098536445 |
Minimum | 0 |
---|---|
Maximum | 4206 |
Zeros | 2636655 |
Zeros (%) | 99.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 4206 |
Range | 4206 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 5.707747487 |
---|---|
Coefficient of variation (CV) | 271.9870555 |
Kurtosis | 320201.8546 |
Mean | 0.02098536445 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 530.2918463 |
Sum | 55360 |
Variance | 32.57838137 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2636655 | |
1 | 520 | < 0.1% |
2 | 220 | < 0.1% |
3 | 153 | < 0.1% |
4 | 53 | < 0.1% |
6 | 38 | < 0.1% |
5 | 32 | < 0.1% |
7 | 21 | < 0.1% |
8 | 20 | < 0.1% |
9 | 14 | < 0.1% |
Other values (147) | 303 | < 0.1% |
(Missing) | 266 | < 0.1% |
Value | Count | Frequency (%) |
0 | 2636655 | |
1 | 520 | < 0.1% |
2 | 220 | < 0.1% |
3 | 153 | < 0.1% |
4 | 53 | < 0.1% |
Value | Count | Frequency (%) |
4206 | 1 | |
3980 | 1 | |
3623 | 1 | |
2617 | 1 | |
2505 | 1 |
annuity_853A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 76848 |
---|---|
Distinct (%) | 3.0% |
Missing | 94885 |
Missing (%) | 3.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3502.854599 |
Minimum | 0 |
---|---|
Maximum | 103000 |
Zeros | 200456 |
Zeros (%) | 7.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1711.6 |
median | 2825.8 |
Q3 | 4595.6 |
95-th percentile | 8829.601 |
Maximum | 103000 |
Range | 103000 |
Interquartile range (IQR) | 2884 |
Descriptive statistics
Standard deviation | 2963.25686 |
---|---|
Coefficient of variation (CV) | 0.8459548564 |
Kurtosis | 28.49244386 |
Mean | 3502.854599 |
Median Absolute Deviation (MAD) | 1324.8 |
Skewness | 3.117039065 |
Sum | 8909195416 |
Variance | 8780891.216 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 200456 | 7.6% |
1580 | 2372 | 0.1% |
1508 | 1974 | 0.1% |
2716 | 1633 | 0.1% |
3820 | 1096 | < 0.1% |
2000 | 1083 | < 0.1% |
2103 | 1015 | < 0.1% |
2558.4001 | 1001 | < 0.1% |
3837.4001 | 997 | < 0.1% |
1668 | 990 | < 0.1% |
Other values (76838) | 2330793 | |
(Missing) | 94885 | 3.6% |
Value | Count | Frequency (%) |
0 | 200456 | |
2 | 1 | < 0.1% |
2.2 | 1 | < 0.1% |
2.4 | 1 | < 0.1% |
2.6000001 | 1 | < 0.1% |
Value | Count | Frequency (%) |
103000 | 1 | |
99646.6 | 2 | |
96987.9 | 1 | |
95685.2 | 1 | |
94012.2 | 1 |
MISSING
 
Distinct | 5398 |
---|---|
Distinct (%) | 0.4% |
Missing | 1244273 |
Missing (%) | 47.2% |
Memory size | 20.1 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 13940220 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 6 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2019-10-28 |
---|---|
2nd row | 2019-09-13 |
3rd row | 2019-10-09 |
4th row | 2019-12-01 |
5th row | 2019-10-27 |
Value | Count | Frequency (%) |
2019-12-14 | 1795 | 0.1% |
2019-12-13 | 1717 | 0.1% |
2019-09-21 | 1506 | 0.1% |
2019-08-30 | 1503 | 0.1% |
2018-12-07 | 1474 | 0.1% |
2019-12-27 | 1454 | 0.1% |
2019-09-20 | 1449 | 0.1% |
2019-11-30 | 1432 | 0.1% |
2019-11-29 | 1419 | 0.1% |
2019-06-28 | 1403 | 0.1% |
Other values (5388) | 1378870 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3265444 | |
- | 2788044 | |
1 | 2513445 | |
2 | 2385626 | |
9 | 614752 | 4.4% |
8 | 543699 | 3.9% |
7 | 448386 | 3.2% |
3 | 393048 | 2.8% |
6 | 367360 | 2.6% |
5 | 313079 | 2.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 11152176 | |
Dash Punctuation | 2788044 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 3265444 | |
1 | 2513445 | |
2 | 2385626 | |
9 | 614752 | 5.5% |
8 | 543699 | 4.9% |
7 | 448386 | 4.0% |
3 | 393048 | 3.5% |
6 | 367360 | 3.3% |
5 | 313079 | 2.8% |
4 | 307337 | 2.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2788044 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 13940220 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 3265444 | |
- | 2788044 | |
1 | 2513445 | |
2 | 2385626 | |
9 | 614752 | 4.4% |
8 | 543699 | 3.9% |
7 | 448386 | 3.2% |
3 | 393048 | 2.8% |
6 | 367360 | 2.6% |
5 | 313079 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13940220 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3265444 | |
- | 2788044 | |
1 | 2513445 | |
2 | 2385626 | |
9 | 614752 | 4.4% |
8 | 543699 | 3.9% |
7 | 448386 | 3.2% |
3 | 393048 | 2.8% |
6 | 367360 | 2.6% |
5 | 313079 | 2.2% |
byoccupationinc_3656910L
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 17995 |
---|---|
Distinct (%) | 3.3% |
Missing | 2095507 |
Missing (%) | 79.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20411.15626 |
Minimum | 0 |
---|---|
Maximum | 200000 |
Zeros | 34448 |
Zeros (%) | 1.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 10000 |
Q3 | 30000 |
95-th percentile | 75000 |
Maximum | 200000 |
Range | 200000 |
Interquartile range (IQR) | 29999 |
Descriptive statistics
Standard deviation | 30931.99075 |
---|---|
Coefficient of variation (CV) | 1.515445296 |
Kurtosis | 10.36416476 |
Mean | 20411.15626 |
Median Absolute Deviation (MAD) | 9999 |
Skewness | 2.743120942 |
Sum | 1.107893068 × 1010 |
Variance | 956788051.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 224806 | 8.5% |
0 | 34448 | 1.3% |
15000 | 27344 | 1.0% |
20000 | 23247 | 0.9% |
30000 | 21042 | 0.8% |
25000 | 19112 | 0.7% |
50000 | 17923 | 0.7% |
10000 | 13261 | 0.5% |
35000 | 10526 | 0.4% |
40000 | 10433 | 0.4% |
Other values (17985) | 140646 | 5.3% |
(Missing) | 2095507 |
Value | Count | Frequency (%) |
0 | 34448 | 1.3% |
1 | 224806 | |
2 | 3 | < 0.1% |
3 | 1 | < 0.1% |
4 | 2 | < 0.1% |
Value | Count | Frequency (%) |
200000 | 3856 | |
199000 | 13 | < 0.1% |
198000 | 12 | < 0.1% |
197000 | 6 | < 0.1% |
196300 | 3 | < 0.1% |
Distinct | 73 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.1 MiB |
Length
Max length | 12 |
---|---|
Median length | 8 |
Mean length | 8.961243909 |
Min length | 8 |
Characters and Unicode
Total characters | 23642405 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 4 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | P94_109_143 |
---|---|
2nd row | P94_109_143 |
3rd row | a55475b1 |
4th row | P94_109_143 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 1723449 | |
p94_109_143 | 654286 | 24.8% |
p30_86_84 | 48402 | 1.8% |
p180_60_137 | 31915 | 1.2% |
p198_89_166 | 26012 | 1.0% |
p73_130_169 | 25691 | 1.0% |
p85_114_140 | 23658 | 0.9% |
p52_67_90 | 18020 | 0.7% |
p24_27_36 | 14627 | 0.6% |
p69_72_116 | 12954 | 0.5% |
Other values (63) | 59281 | 2.2% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 5244804 | |
1 | 3440737 | |
4 | 3170300 | |
7 | 1838516 | 7.8% |
_ | 1829692 | 7.7% |
a | 1723449 | 7.3% |
b | 1723449 | 7.3% |
9 | 1447957 | 6.1% |
P | 914846 | 3.9% |
0 | 873198 | 3.7% |
Other values (4) | 1435457 | 6.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 17450969 | |
Lowercase Letter | 3446898 | 14.6% |
Connector Punctuation | 1829692 | 7.7% |
Uppercase Letter | 914846 | 3.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 5244804 | |
1 | 3440737 | |
4 | 3170300 | |
7 | 1838516 | 10.5% |
9 | 1447957 | 8.3% |
0 | 873198 | 5.0% |
3 | 840337 | 4.8% |
6 | 265561 | 1.5% |
8 | 233431 | 1.3% |
2 | 96128 | 0.6% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 1723449 | |
b | 1723449 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1829692 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 914846 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 19280661 | |
Latin | 4361744 | 18.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 5244804 | |
1 | 3440737 | |
4 | 3170300 | |
7 | 1838516 | 9.5% |
_ | 1829692 | 9.5% |
9 | 1447957 | 7.5% |
0 | 873198 | 4.5% |
3 | 840337 | 4.4% |
6 | 265561 | 1.4% |
8 | 233431 | 1.2% |
Latin
Value | Count | Frequency (%) |
a | 1723449 | |
b | 1723449 | |
P | 914846 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 23642405 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 5244804 | |
1 | 3440737 | |
4 | 3170300 | |
7 | 1838516 | 7.8% |
_ | 1829692 | 7.7% |
a | 1723449 | 7.3% |
b | 1723449 | 7.3% |
9 | 1447957 | 6.1% |
P | 914846 | 3.9% |
0 | 873198 | 3.7% |
Other values (4) | 1435457 | 6.1% |
childnum_21L
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 19 |
---|---|
Distinct (%) | < 0.1% |
Missing | 1605531 |
Missing (%) | 60.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.8416530785 |
Minimum | 0 |
---|---|
Maximum | 20 |
Zeros | 574109 |
Zeros (%) | 21.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 3 |
Maximum | 20 |
Range | 20 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.238723149 |
---|---|
Coefficient of variation (CV) | 1.471774037 |
Kurtosis | 6.280759591 |
Mean | 0.8416530785 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.047090682 |
Sum | 869229 |
Variance | 1.534435041 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 574109 | 21.8% |
1 | 220608 | 8.4% |
2 | 142948 | 5.4% |
3 | 52869 | 2.0% |
4 | 22176 | 0.8% |
5 | 11446 | 0.4% |
6 | 5079 | 0.2% |
7 | 1959 | 0.1% |
8 | 813 | < 0.1% |
9 | 406 | < 0.1% |
Other values (9) | 351 | < 0.1% |
(Missing) | 1605531 |
Value | Count | Frequency (%) |
0 | 574109 | |
1 | 220608 | 8.4% |
2 | 142948 | 5.4% |
3 | 52869 | 2.0% |
4 | 22176 | 0.8% |
Value | Count | Frequency (%) |
20 | 10 | |
17 | 1 | < 0.1% |
16 | 1 | < 0.1% |
15 | 8 | |
14 | 7 |
Distinct | 5402 |
---|---|
Distinct (%) | 0.2% |
Missing | 31 |
Missing (%) | < 0.1% |
Memory size | 20.1 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 26382640 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 5 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2018-11-20 |
---|---|
2nd row | 2019-12-26 |
3rd row | 2014-07-17 |
4th row | 2017-08-21 |
5th row | 2014-12-28 |
Value | Count | Frequency (%) |
2019-12-13 | 3083 | 0.1% |
2019-12-27 | 2862 | 0.1% |
2019-12-14 | 2856 | 0.1% |
2020-01-01 | 2836 | 0.1% |
2019-12-02 | 2740 | 0.1% |
2019-08-30 | 2716 | 0.1% |
2019-09-30 | 2693 | 0.1% |
2019-09-27 | 2674 | 0.1% |
2019-11-29 | 2672 | 0.1% |
2020-01-10 | 2637 | 0.1% |
Other values (5392) | 2610495 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 6241503 | |
- | 5276528 | |
1 | 4657136 | |
2 | 4569753 | |
9 | 1160245 | 4.4% |
8 | 996039 | 3.8% |
7 | 821862 | 3.1% |
3 | 749668 | 2.8% |
6 | 685941 | 2.6% |
4 | 617725 | 2.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 21106112 | |
Dash Punctuation | 5276528 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 6241503 | |
1 | 4657136 | |
2 | 4569753 | |
9 | 1160245 | 5.5% |
8 | 996039 | 4.7% |
7 | 821862 | 3.9% |
3 | 749668 | 3.6% |
6 | 685941 | 3.2% |
4 | 617725 | 2.9% |
5 | 606240 | 2.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5276528 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 26382640 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 6241503 | |
- | 5276528 | |
1 | 4657136 | |
2 | 4569753 | |
9 | 1160245 | 4.4% |
8 | 996039 | 3.8% |
7 | 821862 | 3.1% |
3 | 749668 | 2.8% |
6 | 685941 | 2.6% |
4 | 617725 | 2.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 26382640 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 6241503 | |
- | 5276528 | |
1 | 4657136 | |
2 | 4569753 | |
9 | 1160245 | 4.4% |
8 | 996039 | 3.8% |
7 | 821862 | 3.1% |
3 | 749668 | 2.8% |
6 | 685941 | 2.6% |
4 | 617725 | 2.3% |
credacc_actualbalance_314A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 48131 |
---|---|
Distinct (%) | 31.7% |
Missing | 2486439 |
Missing (%) | 94.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16055.30345 |
Minimum | -134008.42 |
---|---|
Maximum | 1600000 |
Zeros | 47706 |
Zeros (%) | 1.8% |
Negative | 526 |
Negative (%) | < 0.1% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | -134008.42 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 182 |
Q3 | 23136 |
95-th percentile | 75998 |
Maximum | 1600000 |
Range | 1734008.42 |
Interquartile range (IQR) | 23136 |
Descriptive statistics
Standard deviation | 27948.64962 |
---|---|
Coefficient of variation (CV) | 1.740773677 |
Kurtosis | 88.97889429 |
Mean | 16055.30345 |
Median Absolute Deviation (MAD) | 182 |
Skewness | 4.150777153 |
Sum | 2438094161 |
Variance | 781127015.3 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 47706 | 1.8% |
100000 | 2563 | 0.1% |
2 | 785 | < 0.1% |
0.2 | 509 | < 0.1% |
190 | 462 | < 0.1% |
10 | 462 | < 0.1% |
4 | 450 | < 0.1% |
42640 | 437 | < 0.1% |
12000 | 435 | < 0.1% |
20300 | 427 | < 0.1% |
Other values (48121) | 97620 | 3.7% |
(Missing) | 2486439 |
Value | Count | Frequency (%) |
-134008.42 | 1 | |
-99800 | 1 | |
-94996 | 1 | |
-83473.94 | 1 | |
-70909.414 | 1 |
Value | Count | Frequency (%) |
1600000 | 1 | |
952181.6 | 1 | |
459241.6 | 1 | |
419952.4 | 1 | |
400000.28 | 2 |
credacc_credlmt_575A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 30376 |
---|---|
Distinct (%) | 1.2% |
Missing | 75062 |
Missing (%) | 2.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3608.607928 |
Minimum | 0 |
---|---|
Maximum | 400000 |
Zeros | 2369092 |
Zeros (%) | 89.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 24000 |
Maximum | 400000 |
Range | 400000 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 16507.4851 |
---|---|
Coefficient of variation (CV) | 4.574474543 |
Kurtosis | 68.64487772 |
Mean | 3608.607928 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 6.800101576 |
Sum | 9249702926 |
Variance | 272497064.5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2369092 | |
100000 | 21455 | 0.8% |
12000 | 6982 | 0.3% |
40000 | 4799 | 0.2% |
20000 | 4736 | 0.2% |
60000 | 3721 | 0.1% |
150000 | 2546 | 0.1% |
30000 | 2469 | 0.1% |
10000 | 2161 | 0.1% |
24000 | 1817 | 0.1% |
Other values (30366) | 143455 | 5.4% |
(Missing) | 75062 | 2.8% |
Value | Count | Frequency (%) |
0 | 2369092 | |
0.2 | 773 | < 0.1% |
0.8 | 4 | < 0.1% |
1.2 | 2 | < 0.1% |
3 | 2 | < 0.1% |
Value | Count | Frequency (%) |
400000 | 195 | |
394600 | 2 | < 0.1% |
391400 | 2 | < 0.1% |
382200 | 2 | < 0.1% |
377400 | 1 | < 0.1% |
credacc_maxhisbal_375A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 39326 |
---|---|
Distinct (%) | 25.9% |
Missing | 2486439 |
Missing (%) | 94.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -1878.262158 |
Minimum | -290265.1 |
---|---|
Maximum | 3800000 |
Zeros | 84335 |
Zeros (%) | 3.2% |
Negative | 21622 |
Negative (%) | 0.8% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | -290265.1 |
---|---|
5-th percentile | -29680.941 |
Q1 | 0 |
median | 0 |
Q3 | 7.651499925 |
95-th percentile | 3544.169 |
Maximum | 3800000 |
Range | 4090265.1 |
Interquartile range (IQR) | 7.651499925 |
Descriptive statistics
Standard deviation | 29606.77848 |
---|---|
Coefficient of variation (CV) | -15.76285736 |
Kurtosis | 4283.776058 |
Mean | -1878.262158 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 43.42768751 |
Sum | -285225378.2 |
Variance | 876561331.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 84335 | 3.2% |
2 | 1352 | 0.1% |
4 | 683 | < 0.1% |
10 | 444 | < 0.1% |
190 | 435 | < 0.1% |
22 | 374 | < 0.1% |
6 | 367 | < 0.1% |
180 | 328 | < 0.1% |
90 | 307 | < 0.1% |
80 | 282 | < 0.1% |
Other values (39316) | 62949 | 2.4% |
(Missing) | 2486439 |
Value | Count | Frequency (%) |
-290265.1 | 1 | |
-199950 | 1 | |
-198762 | 1 | |
-197850.3 | 1 | |
-196450 | 1 |
Value | Count | Frequency (%) |
3800000 | 1 | |
3640000 | 1 | |
2400200 | 1 | |
2000000 | 2 | |
1999990 | 1 |
credacc_minhisbal_90A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 37539 |
---|---|
Distinct (%) | 24.7% |
Missing | 2486439 |
Missing (%) | 94.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -5670.541866 |
Minimum | -350532.6 |
---|---|
Maximum | 239000 |
Zeros | 89232 |
Zeros (%) | 3.4% |
Negative | 29161 |
Negative (%) | 1.1% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | -350532.6 |
---|---|
5-th percentile | -39975 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 157.7795 |
Maximum | 239000 |
Range | 589532.6 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 18143.37868 |
---|---|
Coefficient of variation (CV) | -3.199584644 |
Kurtosis | 27.84215621 |
Mean | -5670.541866 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -4.501135816 |
Sum | -861105805.7 |
Variance | 329182189.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 89232 | 3.4% |
2 | 998 | < 0.1% |
4 | 564 | < 0.1% |
10 | 444 | < 0.1% |
190 | 430 | < 0.1% |
-10 | 366 | < 0.1% |
6 | 325 | < 0.1% |
180 | 324 | < 0.1% |
90 | 310 | < 0.1% |
80 | 269 | < 0.1% |
Other values (37529) | 58594 | 2.2% |
(Missing) | 2486439 |
Value | Count | Frequency (%) |
-350532.6 | 1 | |
-319998.6 | 1 | |
-319856 | 1 | |
-309628.03 | 1 | |
-299717.5 | 1 |
Value | Count | Frequency (%) |
239000 | 1 | |
120000 | 1 | |
101840 | 1 | |
100000 | 2 | |
99990 | 1 |
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 2486439 |
Missing (%) | 94.2% |
Memory size | 20.1 MiB |
Value | Count | Frequency (%) |
ac | 100509 | |
cl | 43060 | |
ca | 7098 | 4.7% |
pcl | 916 | 0.6% |
po | 249 | 0.2% |
cr | 24 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
C | 151607 | |
A | 107607 | |
L | 43976 | 14.4% |
P | 1165 | 0.4% |
O | 249 | 0.1% |
R | 24 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 304628 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 151607 | |
A | 107607 | |
L | 43976 | 14.4% |
P | 1165 | 0.4% |
O | 249 | 0.1% |
R | 24 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 304628 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
C | 151607 | |
A | 107607 | |
L | 43976 | 14.4% |
P | 1165 | 0.4% |
O | 249 | 0.1% |
R | 24 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 304628 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 151607 | |
A | 107607 | |
L | 43976 | 14.4% |
P | 1165 | 0.4% |
O | 249 | 0.1% |
R | 24 | < 0.1% |
credacc_transactions_402L
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 93 |
---|---|
Distinct (%) | 0.1% |
Missing | 2486439 |
Missing (%) | 94.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.5760128016 |
Minimum | 0 |
---|---|
Maximum | 147 |
Zeros | 134552 |
Zeros (%) | 5.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3 |
Maximum | 147 |
Range | 147 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 3.226973612 |
---|---|
Coefficient of variation (CV) | 5.602260233 |
Kurtosis | 273.1004518 |
Mean | 0.5760128016 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 13.22151442 |
Sum | 87471 |
Variance | 10.41335869 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 134552 | 5.1% |
1 | 6112 | 0.2% |
2 | 2671 | 0.1% |
3 | 2561 | 0.1% |
4 | 1221 | < 0.1% |
5 | 851 | < 0.1% |
6 | 560 | < 0.1% |
7 | 473 | < 0.1% |
8 | 368 | < 0.1% |
9 | 290 | < 0.1% |
Other values (83) | 2197 | 0.1% |
(Missing) | 2486439 |
Value | Count | Frequency (%) |
0 | 134552 | |
1 | 6112 | 0.2% |
2 | 2671 | 0.1% |
3 | 2561 | 0.1% |
4 | 1221 | < 0.1% |
Value | Count | Frequency (%) |
147 | 1 | |
135 | 1 | |
119 | 1 | |
118 | 1 | |
117 | 1 |
credamount_590A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 165672 |
---|---|
Distinct (%) | 6.5% |
Missing | 78869 |
Missing (%) | 3.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 42985.43599 |
Minimum | 0 |
---|---|
Maximum | 1000000 |
Zeros | 73846 |
Zeros (%) | 2.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 4178 |
Q1 | 14236 |
median | 29978 |
Q3 | 59087.7005 |
95-th percentile | 127199.85 |
Maximum | 1000000 |
Range | 1000000 |
Interquartile range (IQR) | 44851.7005 |
Descriptive statistics
Standard deviation | 45796.84328 |
---|---|
Coefficient of variation (CV) | 1.065403717 |
Kurtosis | 14.53568459 |
Mean | 42985.43599 |
Median Absolute Deviation (MAD) | 17980 |
Skewness | 2.97161691 |
Sum | 1.100180425 × 1011 |
Variance | 2097350854 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100000 | 137768 | 5.2% |
60000 | 134159 | 5.1% |
40000 | 127555 | 4.8% |
20000 | 122036 | 4.6% |
30000 | 98139 | 3.7% |
0 | 73846 | 2.8% |
50000 | 46978 | 1.8% |
10000 | 38675 | 1.5% |
200000 | 34783 | 1.3% |
80000 | 34107 | 1.3% |
Other values (165662) | 1711380 | |
(Missing) | 78869 | 3.0% |
Value | Count | Frequency (%) |
0 | 73846 | |
0.2 | 410 | < 0.1% |
0.8 | 2 | < 0.1% |
1.2 | 1 | < 0.1% |
3 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1000000 | 7 | |
950000 | 1 | < 0.1% |
900000 | 1 | < 0.1% |
800000 | 3 | |
700000 | 1 | < 0.1% |
credtype_587L
Text
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 78869 |
Missing (%) | 3.0% |
Memory size | 20.1 MiB |
Value | Count | Frequency (%) |
col | 1249129 | |
cal | 1097416 | |
rel | 212881 | 8.3% |
Most occurring characters
Value | Count | Frequency (%) |
L | 2559426 | |
C | 2346545 | |
O | 1249129 | |
A | 1097416 | |
R | 212881 | 2.8% |
E | 212881 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 7678278 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
L | 2559426 | |
C | 2346545 | |
O | 1249129 | |
A | 1097416 | |
R | 212881 | 2.8% |
E | 212881 | 2.8% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 7678278 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
L | 2559426 | |
C | 2346545 | |
O | 1249129 | |
A | 1097416 | |
R | 212881 | 2.8% |
E | 212881 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 7678278 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
L | 2559426 | |
C | 2346545 | |
O | 1249129 | |
A | 1097416 | |
R | 212881 | 2.8% |
E | 212881 | 2.8% |
currdebt_94A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 201408 |
---|---|
Distinct (%) | 12.1% |
Missing | 976135 |
Missing (%) | 37.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5301.262335 |
Minimum | 0 |
---|---|
Maximum | 482980.84 |
Zeros | 1441963 |
Zeros (%) | 54.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 35702.45575 |
Maximum | 482980.84 |
Range | 482980.84 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 20463.6821 |
---|---|
Coefficient of variation (CV) | 3.860152696 |
Kurtosis | 45.25006491 |
Mean | 5301.262335 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.880501606 |
Sum | 8811546203 |
Variance | 418762285 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1441963 | |
100000 | 96 | < 0.1% |
9998 | 91 | < 0.1% |
11998 | 78 | < 0.1% |
17998 | 68 | < 0.1% |
7998 | 66 | < 0.1% |
19998 | 63 | < 0.1% |
5998 | 59 | < 0.1% |
60000 | 57 | < 0.1% |
13998 | 55 | < 0.1% |
Other values (201398) | 219564 | 8.3% |
(Missing) | 976135 |
Value | Count | Frequency (%) |
0 | 1441963 | |
0.002 | 1 | < 0.1% |
0.006 | 1 | < 0.1% |
0.020000001 | 1 | < 0.1% |
0.06600001 | 1 | < 0.1% |
Value | Count | Frequency (%) |
482980.84 | 1 | |
476617.3 | 1 | |
473946.3 | 1 | |
459136.4 | 1 | |
458601.6 | 1 |
MISSING
 
Distinct | 4211 |
---|---|
Distinct (%) | 0.3% |
Missing | 1297051 |
Missing (%) | 49.2% |
Memory size | 20.1 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 13412440 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 51 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2019-11-06 |
---|---|
2nd row | 2019-09-19 |
3rd row | 2019-10-21 |
4th row | 2019-12-06 |
5th row | 2019-11-05 |
Value | Count | Frequency (%) |
2019-01-31 | 2342 | 0.2% |
2019-10-23 | 2070 | 0.2% |
2019-10-29 | 1955 | 0.1% |
2019-10-21 | 1954 | 0.1% |
2020-01-08 | 1937 | 0.1% |
2019-12-11 | 1933 | 0.1% |
2019-10-28 | 1892 | 0.1% |
2020-01-02 | 1863 | 0.1% |
2020-01-03 | 1850 | 0.1% |
2019-11-01 | 1837 | 0.1% |
Other values (4201) | 1321611 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3156125 | |
- | 2682488 | |
1 | 2442990 | |
2 | 2273853 | |
9 | 579514 | 4.3% |
8 | 527963 | 3.9% |
7 | 429076 | 3.2% |
3 | 364959 | 2.7% |
6 | 355623 | 2.7% |
5 | 301692 | 2.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 10729952 | |
Dash Punctuation | 2682488 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 3156125 | |
1 | 2442990 | |
2 | 2273853 | |
9 | 579514 | 5.4% |
8 | 527963 | 4.9% |
7 | 429076 | 4.0% |
3 | 364959 | 3.4% |
6 | 355623 | 3.3% |
5 | 301692 | 2.8% |
4 | 298157 | 2.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2682488 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 13412440 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 3156125 | |
- | 2682488 | |
1 | 2442990 | |
2 | 2273853 | |
9 | 579514 | 4.3% |
8 | 527963 | 3.9% |
7 | 429076 | 3.2% |
3 | 364959 | 2.7% |
6 | 355623 | 2.7% |
5 | 301692 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13412440 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3156125 | |
- | 2682488 | |
1 | 2442990 | |
2 | 2273853 | |
9 | 579514 | 4.3% |
8 | 527963 | 3.9% |
7 | 429076 | 3.2% |
3 | 364959 | 2.7% |
6 | 355623 | 2.7% |
5 | 301692 | 2.2% |
district_544M
Text
Distinct | 1030 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.1 MiB |
Length
Max length | 12 |
---|---|
Median length | 11 |
Mean length | 10.29188358 |
Min length | 8 |
Characters and Unicode
Total characters | 27153025 |
---|---|
Distinct characters | 18 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 134 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | P147_6_101 |
---|---|
2nd row | P111_148_100 |
3rd row | a55475b1 |
4th row | P19_11_176 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 391791 | 14.9% |
p131_33_167 | 101299 | 3.8% |
p123_6_84 | 92997 | 3.5% |
p197_47_166 | 70453 | 2.7% |
p204_99_158 | 66400 | 2.5% |
p98_137_111 | 53077 | 2.0% |
p62_144_102 | 49284 | 1.9% |
p159_143_123 | 48525 | 1.8% |
p111_135_181 | 48495 | 1.8% |
p147_21_170 | 47763 | 1.8% |
Other values (1020) | 1668211 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 5833743 | |
_ | 4493008 | |
5 | 2412231 | |
P | 2246502 | 8.3% |
7 | 2104439 | 7.8% |
4 | 1805523 | 6.6% |
6 | 1388964 | 5.1% |
3 | 1362572 | 5.0% |
2 | 1336975 | 4.9% |
8 | 1244108 | 4.6% |
Other values (8) | 2924960 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 19629925 | |
Connector Punctuation | 4493008 | 16.5% |
Uppercase Letter | 2246504 | 8.3% |
Lowercase Letter | 783588 | 2.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 5833743 | |
5 | 2412231 | |
7 | 2104439 | 10.7% |
4 | 1805523 | 9.2% |
6 | 1388964 | 7.1% |
3 | 1362572 | 6.9% |
2 | 1336975 | 6.8% |
8 | 1244108 | 6.3% |
9 | 1231948 | 6.3% |
0 | 909422 | 4.6% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 391791 | |
b | 391791 | |
t | 2 | < 0.1% |
h | 2 | < 0.1% |
e | 2 | < 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2246502 | |
Q | 2 | < 0.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 4493008 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 24122933 | |
Latin | 3030092 | 11.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 5833743 | |
_ | 4493008 | |
5 | 2412231 | |
7 | 2104439 | 8.7% |
4 | 1805523 | 7.5% |
6 | 1388964 | 5.8% |
3 | 1362572 | 5.6% |
2 | 1336975 | 5.5% |
8 | 1244108 | 5.2% |
9 | 1231948 | 5.1% |
Latin
Value | Count | Frequency (%) |
P | 2246502 | |
a | 391791 | 12.9% |
b | 391791 | 12.9% |
Q | 2 | < 0.1% |
t | 2 | < 0.1% |
h | 2 | < 0.1% |
e | 2 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 27153025 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 5833743 | |
_ | 4493008 | |
5 | 2412231 | |
P | 2246502 | 8.3% |
7 | 2104439 | 7.8% |
4 | 1805523 | 6.6% |
6 | 1388964 | 5.1% |
3 | 1362572 | 5.0% |
2 | 1336975 | 4.9% |
8 | 1244108 | 4.6% |
Other values (8) | 2924960 |
downpmt_134A
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 13796 |
---|---|
Distinct (%) | 0.5% |
Missing | 78869 |
Missing (%) | 3.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 388.9884686 |
Minimum | 0 |
---|---|
Maximum | 420400 |
Zeros | 2348164 |
Zeros (%) | 89.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1778 |
Maximum | 420400 |
Range | 420400 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2614.634679 |
---|---|
Coefficient of variation (CV) | 6.721625163 |
Kurtosis | 1085.921984 |
Mean | 388.9884686 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 21.07123108 |
Sum | 995587200.2 |
Variance | 6836314.502 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2348164 | |
2000 | 25129 | 1.0% |
4000 | 16373 | 0.6% |
1000 | 15180 | 0.6% |
6000 | 8923 | 0.3% |
10000 | 7809 | 0.3% |
200 | 7538 | 0.3% |
400 | 6910 | 0.3% |
3000 | 6440 | 0.2% |
8000 | 4928 | 0.2% |
Other values (13786) | 112032 | 4.2% |
(Missing) | 78869 | 3.0% |
Value | Count | Frequency (%) |
0 | 2348164 | |
0.2 | 198 | < 0.1% |
0.4 | 27 | < 0.1% |
0.6 | 80 | < 0.1% |
0.8 | 17 | < 0.1% |
Value | Count | Frequency (%) |
420400 | 1 | |
320400 | 1 | |
275028 | 2 | |
230000 | 2 | |
222592.2 | 1 |
dtlastpmt_581D
Text
MISSING
 
Distinct | 2353 |
---|---|
Distinct (%) | 0.3% |
Missing | 1890009 |
Missing (%) | 71.6% |
Memory size | 20.1 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 7482860 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 179 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2019-12-15 |
---|---|
2nd row | 2019-12-15 |
3rd row | 2019-12-26 |
4th row | 2019-12-05 |
5th row | 2019-12-28 |
Value | Count | Frequency (%) |
2019-09-16 | 25909 | 3.5% |
2019-12-17 | 2089 | 0.3% |
2019-12-13 | 1475 | 0.2% |
2019-12-25 | 1338 | 0.2% |
2019-09-19 | 1300 | 0.2% |
2019-12-24 | 1251 | 0.2% |
2019-12-23 | 1234 | 0.2% |
2019-11-15 | 1186 | 0.2% |
2020-01-20 | 1179 | 0.2% |
2019-12-26 | 1179 | 0.2% |
Other values (2343) | 710146 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 1747905 | |
- | 1496572 | |
2 | 1328877 | |
1 | 1267565 | |
9 | 415093 | 5.5% |
8 | 311251 | 4.2% |
7 | 255209 | 3.4% |
6 | 238229 | 3.2% |
3 | 164963 | 2.2% |
5 | 135974 | 1.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 5986288 | |
Dash Punctuation | 1496572 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 1747905 | |
2 | 1328877 | |
1 | 1267565 | |
9 | 415093 | 6.9% |
8 | 311251 | 5.2% |
7 | 255209 | 4.3% |
6 | 238229 | 4.0% |
3 | 164963 | 2.8% |
5 | 135974 | 2.3% |
4 | 121222 | 2.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1496572 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 7482860 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 1747905 | |
- | 1496572 | |
2 | 1328877 | |
1 | 1267565 | |
9 | 415093 | 5.5% |
8 | 311251 | 4.2% |
7 | 255209 | 3.4% |
6 | 238229 | 3.2% |
3 | 164963 | 2.2% |
5 | 135974 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 7482860 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 1747905 | |
- | 1496572 | |
2 | 1328877 | |
1 | 1267565 | |
9 | 415093 | 5.5% |
8 | 311251 | 4.2% |
7 | 255209 | 3.4% |
6 | 238229 | 3.2% |
3 | 164963 | 2.2% |
5 | 135974 | 1.8% |
MISSING
 
Distinct | 2365 |
---|---|
Distinct (%) | 0.2% |
Missing | 1609466 |
Missing (%) | 61.0% |
Memory size | 20.1 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 10288290 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 174 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2019-12-28 |
---|---|
2nd row | 2019-12-15 |
3rd row | 2019-12-19 |
4th row | 2019-12-30 |
5th row | 2019-12-27 |
Value | Count | Frequency (%) |
2019-09-16 | 26380 | 2.6% |
2020-01-20 | 3832 | 0.4% |
2019-12-25 | 3506 | 0.3% |
2019-12-27 | 3456 | 0.3% |
2020-01-01 | 3395 | 0.3% |
2019-12-24 | 3365 | 0.3% |
2019-12-26 | 3346 | 0.3% |
2020-01-24 | 3264 | 0.3% |
2019-12-17 | 3250 | 0.3% |
2019-12-23 | 3196 | 0.3% |
Other values (2355) | 971839 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2591633 | |
- | 2057658 | |
2 | 2034311 | |
1 | 1541821 | |
9 | 523583 | 5.1% |
8 | 374790 | 3.6% |
7 | 306382 | 3.0% |
6 | 286842 | 2.8% |
3 | 234640 | 2.3% |
5 | 178561 | 1.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 8230632 | |
Dash Punctuation | 2057658 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 2591633 | |
2 | 2034311 | |
1 | 1541821 | |
9 | 523583 | 6.4% |
8 | 374790 | 4.6% |
7 | 306382 | 3.7% |
6 | 286842 | 3.5% |
3 | 234640 | 2.9% |
5 | 178561 | 2.2% |
4 | 158069 | 1.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2057658 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 10288290 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 2591633 | |
- | 2057658 | |
2 | 2034311 | |
1 | 1541821 | |
9 | 523583 | 5.1% |
8 | 374790 | 3.6% |
7 | 306382 | 3.0% |
6 | 286842 | 2.8% |
3 | 234640 | 2.3% |
5 | 178561 | 1.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10288290 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2591633 | |
- | 2057658 | |
2 | 2034311 | |
1 | 1541821 | |
9 | 523583 | 5.1% |
8 | 374790 | 3.6% |
7 | 306382 | 3.0% |
6 | 286842 | 2.8% |
3 | 234640 | 2.3% |
5 | 178561 | 1.7% |
education_1138M
Text
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.1 MiB |
Length
Max length | 11 |
---|---|
Median length | 8 |
Mean length | 9.101808175 |
Min length | 8 |
Characters and Unicode
Total characters | 24013255 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | a55475b1 |
---|---|
2nd row | a55475b1 |
3rd row | P97_36_170 |
4th row | a55475b1 |
5th row | P97_36_170 |
Value | Count | Frequency (%) |
a55475b1 | 1379355 | |
p97_36_170 | 852757 | |
p33_146_175 | 370707 | 14.1% |
p106_81_188 | 17178 | 0.7% |
p17_36_170 | 17168 | 0.7% |
p157_18_172 | 1130 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 4509902 | |
7 | 3492172 | |
1 | 3062786 | |
_ | 2517880 | |
4 | 1750062 | 7.3% |
3 | 1611339 | 6.7% |
a | 1379355 | 5.7% |
b | 1379355 | 5.7% |
P | 1258940 | 5.2% |
6 | 1257810 | 5.2% |
Other values (4) | 1793654 | 7.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 17477725 | |
Lowercase Letter | 2758710 | 11.5% |
Connector Punctuation | 2517880 | 10.5% |
Uppercase Letter | 1258940 | 5.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 4509902 | |
7 | 3492172 | |
1 | 3062786 | |
4 | 1750062 | 10.0% |
3 | 1611339 | 9.2% |
6 | 1257810 | 7.2% |
0 | 887103 | 5.1% |
9 | 852757 | 4.9% |
8 | 52664 | 0.3% |
2 | 1130 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 1379355 | |
b | 1379355 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2517880 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 1258940 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 19995605 | |
Latin | 4017650 | 16.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 4509902 | |
7 | 3492172 | |
1 | 3062786 | |
_ | 2517880 | |
4 | 1750062 | 8.8% |
3 | 1611339 | 8.1% |
6 | 1257810 | 6.3% |
0 | 887103 | 4.4% |
9 | 852757 | 4.3% |
8 | 52664 | 0.3% |
Latin
Value | Count | Frequency (%) |
a | 1379355 | |
b | 1379355 | |
P | 1258940 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 24013255 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 4509902 | |
7 | 3492172 | |
1 | 3062786 | |
_ | 2517880 | |
4 | 1750062 | 7.3% |
3 | 1611339 | 6.7% |
a | 1379355 | 5.7% |
b | 1379355 | 5.7% |
P | 1258940 | 5.2% |
6 | 1257810 | 5.2% |
Other values (4) | 1793654 | 7.5% |
MISSING
 
Distinct | 8343 |
---|---|
Distinct (%) | 0.9% |
Missing | 1705609 |
Missing (%) | 64.6% |
Memory size | 20.1 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 9326860 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1996 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 2014-01-15 |
---|---|
2nd row | 2013-04-15 |
3rd row | 2013-04-15 |
4th row | 2012-02-15 |
5th row | 2018-01-15 |
Value | Count | Frequency (%) |
2017-01-15 | 16608 | 1.8% |
2015-01-15 | 15436 | 1.7% |
2013-01-15 | 15290 | 1.6% |
2014-01-15 | 15105 | 1.6% |
2016-01-15 | 15074 | 1.6% |
2012-01-15 | 13022 | 1.4% |
2018-01-15 | 11930 | 1.3% |
2010-01-15 | 10067 | 1.1% |
2010-09-15 | 9608 | 1.0% |
2011-01-15 | 9585 | 1.0% |
Other values (8333) | 800961 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2123405 | |
1 | 1975825 | |
- | 1865372 | |
2 | 1075472 | |
5 | 1043024 | |
9 | 389836 | 4.2% |
8 | 191653 | 2.1% |
6 | 171491 | 1.8% |
3 | 166811 | 1.8% |
4 | 164425 | 1.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 7461488 | |
Dash Punctuation | 1865372 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 2123405 | |
1 | 1975825 | |
2 | 1075472 | |
5 | 1043024 | |
9 | 389836 | 5.2% |
8 | 191653 | 2.6% |
6 | 171491 | 2.3% |
3 | 166811 | 2.2% |
4 | 164425 | 2.2% |
7 | 159546 | 2.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1865372 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 9326860 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 2123405 | |
1 | 1975825 | |
- | 1865372 | |
2 | 1075472 | |
5 | 1043024 | |
9 | 389836 | 4.2% |
8 | 191653 | 2.1% |
6 | 171491 | 1.8% |
3 | 166811 | 1.8% |
4 | 164425 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9326860 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2123405 | |
1 | 1975825 | |
- | 1865372 | |
2 | 1075472 | |
5 | 1043024 | |
9 | 389836 | 4.2% |
8 | 191653 | 2.1% |
6 | 171491 | 1.8% |
3 | 166811 | 1.8% |
4 | 164425 | 1.8% |
familystate_726L
Text
MISSING
 
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 1148691 |
Missing (%) | 43.5% |
Memory size | 20.1 MiB |
Value | Count | Frequency (%) |
married | 1082575 | |
single | 201743 | 13.5% |
widowed | 138790 | 9.3% |
divorced | 45087 | 3.0% |
living_with_partner | 21409 | 1.4% |
Most occurring characters
Value | Count | Frequency (%) |
R | 2253055 | |
I | 1532422 | |
E | 1489604 | |
D | 1450329 | |
A | 1103984 | |
M | 1082575 | |
W | 298989 | 2.8% |
N | 244561 | 2.3% |
L | 223152 | 2.1% |
G | 223152 | 2.1% |
Other values (8) | 625657 | 5.9% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10484662 | |
Connector Punctuation | 42818 | 0.4% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
R | 2253055 | |
I | 1532422 | |
E | 1489604 | |
D | 1450329 | |
A | 1103984 | |
M | 1082575 | |
W | 298989 | 2.9% |
N | 244561 | 2.3% |
L | 223152 | 2.1% |
G | 223152 | 2.1% |
Other values (7) | 582839 | 5.6% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 42818 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10484662 | |
Common | 42818 | 0.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
R | 2253055 | |
I | 1532422 | |
E | 1489604 | |
D | 1450329 | |
A | 1103984 | |
M | 1082575 | |
W | 298989 | 2.9% |
N | 244561 | 2.3% |
L | 223152 | 2.1% |
G | 223152 | 2.1% |
Other values (7) | 582839 | 5.6% |
Common
Value | Count | Frequency (%) |
_ | 42818 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10527480 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
R | 2253055 | |
I | 1532422 | |
E | 1489604 | |
D | 1450329 | |
A | 1103984 | |
M | 1082575 | |
W | 298989 | 2.8% |
N | 244561 | 2.3% |
L | 223152 | 2.1% |
G | 223152 | 2.1% |
Other values (8) | 625657 | 5.9% |
firstnonzeroinstldate_307D
Text
MISSING
 
Distinct | 5153 |
---|---|
Distinct (%) | 0.2% |
Missing | 287307 |
Missing (%) | 10.9% |
Memory size | 20.1 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 23509880 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 25 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2018-12-20 |
---|---|
2nd row | 2020-01-26 |
3rd row | 2014-08-17 |
4th row | 2017-09-21 |
5th row | 2015-01-28 |
Value | Count | Frequency (%) |
2019-12-15 | 5044 | 0.2% |
2019-03-14 | 4785 | 0.2% |
2019-09-15 | 4775 | 0.2% |
2020-03-15 | 4335 | 0.2% |
2019-10-15 | 4115 | 0.2% |
2020-01-15 | 3797 | 0.2% |
2020-02-15 | 3791 | 0.2% |
2019-10-12 | 3728 | 0.2% |
2019-07-15 | 3722 | 0.2% |
2020-01-11 | 3704 | 0.2% |
Other values (5143) | 2309192 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 5625459 | |
- | 4701976 | |
1 | 4129869 | |
2 | 4102127 | |
9 | 990127 | 4.2% |
8 | 855287 | 3.6% |
7 | 710419 | 3.0% |
5 | 680403 | 2.9% |
3 | 621076 | 2.6% |
6 | 578670 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 18807904 | |
Dash Punctuation | 4701976 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 5625459 | |
1 | 4129869 | |
2 | 4102127 | |
9 | 990127 | 5.3% |
8 | 855287 | 4.5% |
7 | 710419 | 3.8% |
5 | 680403 | 3.6% |
3 | 621076 | 3.3% |
6 | 578670 | 3.1% |
4 | 514467 | 2.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4701976 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 23509880 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 5625459 | |
- | 4701976 | |
1 | 4129869 | |
2 | 4102127 | |
9 | 990127 | 4.2% |
8 | 855287 | 3.6% |
7 | 710419 | 3.0% |
5 | 680403 | 2.9% |
3 | 621076 | 2.6% |
6 | 578670 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 23509880 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 5625459 | |
- | 4701976 | |
1 | 4129869 | |
2 | 4102127 | |
9 | 990127 | 4.2% |
8 | 855287 | 3.6% |
7 | 710419 | 3.0% |
5 | 680403 | 2.9% |
3 | 621076 | 2.6% |
6 | 578670 | 2.5% |
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 78869 |
Missing (%) | 3.0% |
Memory size | 20.1 MiB |
Value | Count | Frequency (%) |
pos | 1334600 | |
cash | 1097773 | |
ndf | 127053 | 5.0% |
Most occurring characters
Value | Count | Frequency (%) |
S | 2432373 | |
P | 1334600 | |
O | 1334600 | |
C | 1097773 | |
A | 1097773 | |
H | 1097773 | |
N | 127053 | 1.4% |
D | 127053 | 1.4% |
F | 127053 | 1.4% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 8776051 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
S | 2432373 | |
P | 1334600 | |
O | 1334600 | |
C | 1097773 | |
A | 1097773 | |
H | 1097773 | |
N | 127053 | 1.4% |
D | 127053 | 1.4% |
F | 127053 | 1.4% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 8776051 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
S | 2432373 | |
P | 1334600 | |
O | 1334600 | |
C | 1097773 | |
A | 1097773 | |
H | 1097773 | |
N | 127053 | 1.4% |
D | 127053 | 1.4% |
F | 127053 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8776051 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
S | 2432373 | |
P | 1334600 | |
O | 1334600 | |
C | 1097773 | |
A | 1097773 | |
H | 1097773 | |
N | 127053 | 1.4% |
D | 127053 | 1.4% |
F | 127053 | 1.4% |
isbidproduct_390L
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 31 |
Missing (%) | < 0.1% |
Memory size | 20.1 MiB |
False | |
---|---|
True | 144802 |
(Missing) | 31 |
Value | Count | Frequency (%) |
False | 2493462 | |
True | 144802 | 5.5% |
(Missing) | 31 | < 0.1% |
isdebitcard_527L
Boolean
MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 2425416 |
Missing (%) | 91.9% |
Memory size | 20.1 MiB |
False | 146098 |
---|---|
True | 66781 |
(Missing) |
Value | Count | Frequency (%) |
False | 146098 | 5.5% |
True | 66781 | 2.5% |
(Missing) | 2425416 |
mainoccupationinc_437A
Real number (ℝ)
MISSING
 
Distinct | 17340 |
---|---|
Distinct (%) | 0.7% |
Missing | 65371 |
Missing (%) | 2.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43046.11571 |
Minimum | 0 |
---|---|
Maximum | 199600 |
Zeros | 32 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 6315.8003 |
Q1 | 20000 |
median | 37000 |
Q3 | 58000 |
95-th percentile | 100000 |
Maximum | 199600 |
Range | 199600 |
Interquartile range (IQR) | 38000 |
Descriptive statistics
Standard deviation | 32550.06984 |
---|---|
Coefficient of variation (CV) | 0.7561674102 |
Kurtosis | 4.670832521 |
Mean | 43046.11571 |
Median Absolute Deviation (MAD) | 18000 |
Skewness | 1.769823701 |
Sum | 1.107543842 × 1011 |
Variance | 1059507046 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40000 | 186417 | 7.1% |
30000 | 183448 | 7.0% |
50000 | 162276 | 6.2% |
60000 | 135382 | 5.1% |
20000 | 91145 | 3.5% |
70000 | 87299 | 3.3% |
24000 | 79960 | 3.0% |
36000 | 76607 | 2.9% |
80000 | 47766 | 1.8% |
100000 | 45645 | 1.7% |
Other values (17330) | 1476979 | |
(Missing) | 65371 | 2.5% |
Value | Count | Frequency (%) |
0 | 32 | |
0.2 | 51 | |
0.4 | 6 | < 0.1% |
0.6 | 10 | < 0.1% |
0.8 | 1 | < 0.1% |
Value | Count | Frequency (%) |
199600 | 12883 | |
199400 | 8 | < 0.1% |
199200 | 5 | < 0.1% |
199120 | 1 | < 0.1% |
199000 | 20 | < 0.1% |
maxdpdtolerance_577P
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 3209 |
---|---|
Distinct (%) | 0.2% |
Missing | 1278326 |
Missing (%) | 48.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.78874518 |
Minimum | 0 |
---|---|
Maximum | 4362 |
Zeros | 986233 |
Zeros (%) | 37.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 18 |
Maximum | 4362 |
Range | 4362 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 158.135784 |
---|---|
Coefficient of variation (CV) | 9.419154454 |
Kurtosis | 263.3173287 |
Mean | 16.78874518 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 14.91851137 |
Sum | 22832173 |
Variance | 25006.92618 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 986233 | |
1 | 191671 | 7.3% |
5 | 25367 | 1.0% |
6 | 19116 | 0.7% |
10 | 14207 | 0.5% |
4 | 8844 | 0.3% |
9 | 8395 | 0.3% |
14 | 7146 | 0.3% |
18 | 6781 | 0.3% |
7 | 6239 | 0.2% |
Other values (3199) | 85970 | 3.3% |
(Missing) | 1278326 |
Value | Count | Frequency (%) |
0 | 986233 | |
1 | 191671 | 7.3% |
2 | 6033 | 0.2% |
3 | 1935 | 0.1% |
4 | 8844 | 0.3% |
Value | Count | Frequency (%) |
4362 | 1 | |
4245 | 2 | |
4222 | 1 | |
4206 | 1 | |
4185 | 1 |
num_group1
Real number (ℝ)
ZEROS
 
Distinct | 20 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.613015603 |
Minimum | 0 |
---|---|
Maximum | 19 |
Zeros | 438525 |
Zeros (%) | 16.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 3 |
Q3 | 7 |
95-th percentile | 14 |
Maximum | 19 |
Range | 19 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 4.485514586 |
---|---|
Coefficient of variation (CV) | 0.9723605927 |
Kurtosis | 0.7244665374 |
Mean | 4.613015603 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.163952685 |
Sum | 12170496 |
Variance | 20.1198411 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 438525 | |
1 | 369409 | |
2 | 309947 | |
3 | 259106 | |
4 | 216484 | |
5 | 180462 | |
6 | 150620 | 5.7% |
7 | 125780 | 4.8% |
8 | 105118 | 4.0% |
9 | 88418 | 3.4% |
Other values (10) | 394426 |
Value | Count | Frequency (%) |
0 | 438525 | |
1 | 369409 | |
2 | 309947 | |
3 | 259106 | |
4 | 216484 |
Value | Count | Frequency (%) |
19 | 17541 | |
18 | 20372 | |
17 | 23687 | |
16 | 27647 | |
15 | 32524 |
outstandingdebt_522A
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 179739 |
---|---|
Distinct (%) | 10.8% |
Missing | 980346 |
Missing (%) | 37.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7097.727649 |
Minimum | 0 |
---|---|
Maximum | 1029392.8 |
Zeros | 1436446 |
Zeros (%) | 54.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 40964.368 |
Maximum | 1029392.8 |
Range | 1029392.8 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 30871.45589 |
---|---|
Coefficient of variation (CV) | 4.349484429 |
Kurtosis | 79.30366253 |
Mean | 7097.727649 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 7.504083064 |
Sum | 1.176767046 × 1010 |
Variance | 953046788.5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1436446 | |
10 | 286 | < 0.1% |
9998 | 92 | < 0.1% |
11998 | 72 | < 0.1% |
17998 | 64 | < 0.1% |
20 | 61 | < 0.1% |
7998 | 60 | < 0.1% |
19998 | 59 | < 0.1% |
5998 | 58 | < 0.1% |
8998 | 52 | < 0.1% |
Other values (179729) | 220699 | 8.4% |
(Missing) | 980346 |
Value | Count | Frequency (%) |
0 | 1436446 | |
0.002 | 1 | < 0.1% |
0.004 | 1 | < 0.1% |
0.006 | 1 | < 0.1% |
0.008 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1029392.8 | 1 | |
987535 | 1 | |
984399 | 1 | |
978072.2 | 1 | |
910766 | 1 |
pmtnum_8L
Real number (ℝ)
MISSING
 
Distinct | 60 |
---|---|
Distinct (%) | < 0.1% |
Missing | 238987 |
Missing (%) | 9.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.85803407 |
Minimum | 3 |
---|---|
Maximum | 63 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 4 |
Q1 | 9 |
median | 12 |
Q3 | 24 |
95-th percentile | 36 |
Maximum | 63 |
Range | 60 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 11.25158885 |
---|---|
Coefficient of variation (CV) | 0.6674318494 |
Kurtosis | 1.295980682 |
Mean | 16.85803407 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 1.211121972 |
Sum | 40447616 |
Variance | 126.5982517 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12 | 594080 | |
24 | 414664 | |
6 | 355258 | |
18 | 200921 | 7.6% |
36 | 156551 | 5.9% |
3 | 117252 | 4.4% |
48 | 81251 | 3.1% |
16 | 77062 | 2.9% |
9 | 51005 | 1.9% |
4 | 47083 | 1.8% |
Other values (50) | 304181 | |
(Missing) | 238987 |
Value | Count | Frequency (%) |
3 | 117252 | 4.4% |
4 | 47083 | 1.8% |
5 | 23564 | 0.9% |
6 | 355258 | |
7 | 4135 | 0.2% |
Value | Count | Frequency (%) |
63 | 3 | < 0.1% |
62 | 12 | < 0.1% |
61 | 18 | < 0.1% |
60 | 12352 | |
59 | 1 | < 0.1% |
postype_4733339M
Text
Distinct | 9 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.1 MiB |
Length
Max length | 12 |
---|---|
Median length | 11 |
Mean length | 11.24311951 |
Min length | 8 |
Characters and Unicode
Total characters | 29662666 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | P46_145_78 |
---|---|
2nd row | P149_40_170 |
3rd row | P46_145_78 |
4th row | P177_117_192 |
5th row | P60_146_156 |
Value | Count | Frequency (%) |
p177_117_192 | 1283096 | |
p46_145_78 | 671053 | |
p149_40_170 | 260036 | 9.9% |
p60_146_156 | 200271 | 7.6% |
p67_102_161 | 175416 | 6.6% |
p217_110_186 | 30413 | 1.2% |
p169_115_83 | 13766 | 0.5% |
p140_48_169 | 3899 | 0.1% |
a55475b1 | 345 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
1 | 7421392 | |
_ | 5275900 | |
7 | 4986551 | |
P | 2637950 | 8.9% |
4 | 2070592 | 7.0% |
6 | 1670776 | 5.6% |
9 | 1560797 | 5.3% |
2 | 1488925 | 5.0% |
0 | 930071 | 3.1% |
5 | 886125 | 3.0% |
Other values (4) | 733587 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 21748126 | |
Connector Punctuation | 5275900 | 17.8% |
Uppercase Letter | 2637950 | 8.9% |
Lowercase Letter | 690 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 7421392 | |
7 | 4986551 | |
4 | 2070592 | 9.5% |
6 | 1670776 | 7.7% |
9 | 1560797 | 7.2% |
2 | 1488925 | 6.8% |
0 | 930071 | 4.3% |
5 | 886125 | 4.1% |
8 | 719131 | 3.3% |
3 | 13766 | 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 345 | |
b | 345 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 5275900 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2637950 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 27024026 | |
Latin | 2638640 | 8.9% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 7421392 | |
_ | 5275900 | |
7 | 4986551 | |
4 | 2070592 | 7.7% |
6 | 1670776 | 6.2% |
9 | 1560797 | 5.8% |
2 | 1488925 | 5.5% |
0 | 930071 | 3.4% |
5 | 886125 | 3.3% |
8 | 719131 | 2.7% |
Latin
Value | Count | Frequency (%) |
P | 2637950 | |
a | 345 | < 0.1% |
b | 345 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 29662666 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 7421392 | |
_ | 5275900 | |
7 | 4986551 | |
P | 2637950 | 8.9% |
4 | 2070592 | 7.0% |
6 | 1670776 | 5.6% |
9 | 1560797 | 5.3% |
2 | 1488925 | 5.0% |
0 | 930071 | 3.1% |
5 | 886125 | 3.0% |
Other values (4) | 733587 | 2.5% |
profession_152M
Text
Distinct | 5799 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.1 MiB |
Length
Max length | 13 |
---|---|
Median length | 8 |
Mean length | 8.024117091 |
Min length | 7 |
Characters and Unicode
Total characters | 21169988 |
---|---|
Distinct characters | 34 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 3891 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | a55475b1 |
---|---|
2nd row | a55475b1 |
3rd row | a55475b1 |
4th row | a55475b1 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 2613108 | |
p46_72_80 | 682 | < 0.1% |
p104_137_180 | 436 | < 0.1% |
p167_22_171 | 374 | < 0.1% |
p21_76_53 | 372 | < 0.1% |
p143_116_69 | 342 | < 0.1% |
p25_111_112 | 335 | < 0.1% |
p139_125_64 | 322 | < 0.1% |
p121_114_58 | 283 | < 0.1% |
p103_114_185 | 279 | < 0.1% |
Other values (5789) | 21762 | 0.8% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 7854981 | |
1 | 2667040 | 12.6% |
7 | 2630879 | 12.4% |
4 | 2628479 | 12.4% |
a | 2613130 | 12.3% |
b | 2613109 | 12.3% |
_ | 50374 | 0.2% |
P | 25146 | 0.1% |
2 | 17853 | 0.1% |
6 | 17554 | 0.1% |
Other values (24) | 51443 | 0.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 15868035 | |
Lowercase Letter | 5226392 | 24.7% |
Connector Punctuation | 50374 | 0.2% |
Uppercase Letter | 25187 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 2613130 | |
b | 2613109 | |
e | 24 | < 0.1% |
r | 19 | < 0.1% |
o | 14 | < 0.1% |
t | 12 | < 0.1% |
d | 12 | < 0.1% |
k | 10 | < 0.1% |
y | 10 | < 0.1% |
c | 10 | < 0.1% |
Other values (11) | 42 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
5 | 7854981 | |
1 | 2667040 | 16.8% |
7 | 2630879 | 16.6% |
4 | 2628479 | 16.6% |
2 | 17853 | 0.1% |
6 | 17554 | 0.1% |
3 | 13264 | 0.1% |
8 | 13257 | 0.1% |
0 | 13197 | 0.1% |
9 | 11531 | 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 25146 | |
Q | 41 | 0.2% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 50374 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 15918409 | |
Latin | 5251579 | 24.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 2613130 | |
b | 2613109 | |
P | 25146 | 0.5% |
Q | 41 | < 0.1% |
e | 24 | < 0.1% |
r | 19 | < 0.1% |
o | 14 | < 0.1% |
t | 12 | < 0.1% |
d | 12 | < 0.1% |
k | 10 | < 0.1% |
Other values (13) | 62 | < 0.1% |
Common
Value | Count | Frequency (%) |
5 | 7854981 | |
1 | 2667040 | 16.8% |
7 | 2630879 | 16.5% |
4 | 2628479 | 16.5% |
_ | 50374 | 0.3% |
2 | 17853 | 0.1% |
6 | 17554 | 0.1% |
3 | 13264 | 0.1% |
8 | 13257 | 0.1% |
0 | 13197 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 21169988 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 7854981 | |
1 | 2667040 | 12.6% |
7 | 2630879 | 12.4% |
4 | 2628479 | 12.4% |
a | 2613130 | 12.3% |
b | 2613109 | 12.3% |
_ | 50374 | 0.2% |
P | 25146 | 0.1% |
2 | 17853 | 0.1% |
6 | 17554 | 0.1% |
Other values (24) | 51443 | 0.2% |
Distinct | 18 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.1 MiB |
Length
Max length | 11 |
---|---|
Median length | 8 |
Mean length | 8.730558561 |
Min length | 8 |
Characters and Unicode
Total characters | 23033789 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | P198_131_9 |
---|---|
2nd row | P45_84_106 |
3rd row | a55475b1 |
4th row | P99_56_166 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 1814894 | |
p99_56_166 | 354478 | 13.4% |
p94_109_143 | 285561 | 10.8% |
p198_131_9 | 88231 | 3.3% |
p45_84_106 | 83707 | 3.2% |
p48_22_32 | 4446 | 0.2% |
p30_86_84 | 2041 | 0.1% |
p121_60_164 | 1378 | 0.1% |
p196_88_176 | 1347 | 0.1% |
p52_67_90 | 1240 | < 0.1% |
Other values (8) | 972 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 5884277 | |
1 | 3097659 | |
4 | 2561781 | |
7 | 1817922 | 7.9% |
a | 1814894 | 7.9% |
b | 1814894 | 7.9% |
_ | 1646802 | 7.1% |
9 | 1459810 | 6.3% |
6 | 1157112 | 5.0% |
P | 823401 | 3.6% |
Other values (4) | 955237 | 4.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16933798 | |
Lowercase Letter | 3629788 | 15.8% |
Connector Punctuation | 1646802 | 7.1% |
Uppercase Letter | 823401 | 3.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 5884277 | |
1 | 3097659 | |
4 | 2561781 | |
7 | 1817922 | 10.7% |
9 | 1459810 | 8.6% |
6 | 1157112 | 6.8% |
3 | 380407 | 2.2% |
0 | 374229 | 2.2% |
8 | 183643 | 1.1% |
2 | 16958 | 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 1814894 | |
b | 1814894 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1646802 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 823401 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 18580600 | |
Latin | 4453189 | 19.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 5884277 | |
1 | 3097659 | |
4 | 2561781 | |
7 | 1817922 | 9.8% |
_ | 1646802 | 8.9% |
9 | 1459810 | 7.9% |
6 | 1157112 | 6.2% |
3 | 380407 | 2.0% |
0 | 374229 | 2.0% |
8 | 183643 | 1.0% |
Latin
Value | Count | Frequency (%) |
a | 1814894 | |
b | 1814894 | |
P | 823401 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 23033789 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 5884277 | |
1 | 3097659 | |
4 | 2561781 | |
7 | 1817922 | 7.9% |
a | 1814894 | 7.9% |
b | 1814894 | 7.9% |
_ | 1646802 | 7.1% |
9 | 1459810 | 6.3% |
6 | 1157112 | 5.0% |
P | 823401 | 3.6% |
Other values (4) | 955237 | 4.1% |
Distinct | 14 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.1 MiB |
Length
Max length | 11 |
---|---|
Median length | 8 |
Mean length | 8.792563758 |
Min length | 8 |
Characters and Unicode
Total characters | 23197377 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | P94_109_143 |
---|---|
2nd row | P94_109_143 |
3rd row | a55475b1 |
4th row | P94_109_143 |
5th row | a55475b1 |
Value | Count | Frequency (%) |
a55475b1 | 1889911 | |
p94_109_143 | 654356 | 24.8% |
p30_86_84 | 48409 | 1.8% |
p52_67_90 | 18027 | 0.7% |
p69_72_116 | 12955 | 0.5% |
p129_162_80 | 8320 | 0.3% |
p84_14_61 | 2885 | 0.1% |
p64_121_167 | 1849 | 0.1% |
p19_25_34 | 761 | < 0.1% |
p5_143_178 | 612 | < 0.1% |
Other values (4) | 210 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
5 | 5689545 | |
4 | 3256030 | |
1 | 3254895 | |
7 | 1923354 | 8.3% |
a | 1889911 | 8.1% |
b | 1889911 | 8.1% |
_ | 1496768 | 6.5% |
9 | 1348782 | 5.8% |
P | 748384 | 3.2% |
0 | 729319 | 3.1% |
Other values (4) | 970478 | 4.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 17172403 | |
Lowercase Letter | 3779822 | 16.3% |
Connector Punctuation | 1496768 | 6.5% |
Uppercase Letter | 748384 | 3.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 5689545 | |
4 | 3256030 | |
1 | 3254895 | |
7 | 1923354 | 11.2% |
9 | 1348782 | 7.9% |
0 | 729319 | 4.2% |
3 | 704345 | 4.1% |
8 | 108638 | 0.6% |
6 | 107252 | 0.6% |
2 | 50243 | 0.3% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 1889911 | |
b | 1889911 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1496768 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 748384 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 18669171 | |
Latin | 4528206 | 19.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 5689545 | |
4 | 3256030 | |
1 | 3254895 | |
7 | 1923354 | 10.3% |
_ | 1496768 | 8.0% |
9 | 1348782 | 7.2% |
0 | 729319 | 3.9% |
3 | 704345 | 3.8% |
8 | 108638 | 0.6% |
6 | 107252 | 0.6% |
Latin
Value | Count | Frequency (%) |
a | 1889911 | |
b | 1889911 | |
P | 748384 | 16.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 23197377 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 5689545 | |
4 | 3256030 | |
1 | 3254895 | |
7 | 1923354 | 8.3% |
a | 1889911 | 8.1% |
b | 1889911 | 8.1% |
_ | 1496768 | 6.5% |
9 | 1348782 | 5.8% |
P | 748384 | 3.2% |
0 | 729319 | 3.1% |
Other values (4) | 970478 | 4.2% |
revolvingaccount_394A
Real number (ℝ)
MISSING
 
Distinct | 52406 |
---|---|
Distinct (%) | 37.4% |
Missing | 2498196 |
Missing (%) | 94.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 761933430.7 |
Minimum | 540342340 |
---|---|
Maximum | 800608700 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 540342340 |
---|---|
5-th percentile | 685001930 |
Q1 | 760163480 |
median | 780311500 |
Q3 | 780789950 |
95-th percentile | 800253630 |
Maximum | 800608700 |
Range | 260266360 |
Interquartile range (IQR) | 20626470 |
Descriptive statistics
Standard deviation | 47558807.47 |
---|---|
Coefficient of variation (CV) | 0.06241858613 |
Kurtosis | 10.27654451 |
Mean | 761933430.7 |
Median Absolute Deviation (MAD) | 19745840 |
Skewness | -3.060067244 |
Sum | 1.067461117 × 1014 |
Variance | 2.261840168 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
780784600 | 29 | < 0.1% |
780851400 | 27 | < 0.1% |
800146900 | 25 | < 0.1% |
780783040 | 24 | < 0.1% |
780851650 | 24 | < 0.1% |
780661440 | 23 | < 0.1% |
780826560 | 23 | < 0.1% |
780561100 | 23 | < 0.1% |
780826300 | 22 | < 0.1% |
780621760 | 22 | < 0.1% |
Other values (52396) | 139857 | 5.3% |
(Missing) | 2498196 |
Value | Count | Frequency (%) |
540342340 | 1 | < 0.1% |
540342460 | 2 | |
540342500 | 3 | |
540342600 | 2 | |
540342660 | 1 | < 0.1% |
Value | Count | Frequency (%) |
800608700 | 1 | |
800608100 | 1 | |
800607550 | 1 | |
800607500 | 1 | |
800607400 | 1 |
status_219L
Text
Distinct | 11 |
---|---|
Distinct (%) | < 0.1% |
Missing | 31 |
Missing (%) | < 0.1% |
Memory size | 20.1 MiB |
Value | Count | Frequency (%) |
d | 1104119 | |
k | 1053357 | |
a | 284608 | 10.8% |
t | 177685 | 6.7% |
n | 14762 | 0.6% |
q | 2711 | 0.1% |
l | 489 | < 0.1% |
s | 302 | < 0.1% |
h | 196 | < 0.1% |
p | 20 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
D | 1104119 | |
K | 1053357 | |
A | 284608 | 10.8% |
T | 177685 | 6.7% |
N | 14762 | 0.6% |
Q | 2711 | 0.1% |
L | 489 | < 0.1% |
S | 302 | < 0.1% |
H | 196 | < 0.1% |
P | 20 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 2638264 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
D | 1104119 | |
K | 1053357 | |
A | 284608 | 10.8% |
T | 177685 | 6.7% |
N | 14762 | 0.6% |
Q | 2711 | 0.1% |
L | 489 | < 0.1% |
S | 302 | < 0.1% |
H | 196 | < 0.1% |
P | 20 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2638264 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
D | 1104119 | |
K | 1053357 | |
A | 284608 | 10.8% |
T | 177685 | 6.7% |
N | 14762 | 0.6% |
Q | 2711 | 0.1% |
L | 489 | < 0.1% |
S | 302 | < 0.1% |
H | 196 | < 0.1% |
P | 20 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2638264 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
D | 1104119 | |
K | 1053357 | |
A | 284608 | 10.8% |
T | 177685 | 6.7% |
N | 14762 | 0.6% |
Q | 2711 | 0.1% |
L | 489 | < 0.1% |
S | 302 | < 0.1% |
H | 196 | < 0.1% |
P | 20 | < 0.1% |
tenor_203L
Real number (ℝ)
MISSING
 
Distinct | 60 |
---|---|
Distinct (%) | < 0.1% |
Missing | 238987 |
Missing (%) | 9.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.85803407 |
Minimum | 3 |
---|---|
Maximum | 63 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.1 MiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 4 |
Q1 | 9 |
median | 12 |
Q3 | 24 |
95-th percentile | 36 |
Maximum | 63 |
Range | 60 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 11.25158885 |
---|---|
Coefficient of variation (CV) | 0.6674318494 |
Kurtosis | 1.295980682 |
Mean | 16.85803407 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 1.211121972 |
Sum | 40447616 |
Variance | 126.5982517 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12 | 594080 | |
24 | 414664 | |
6 | 355258 | |
18 | 200921 | 7.6% |
36 | 156551 | 5.9% |
3 | 117252 | 4.4% |
48 | 81251 | 3.1% |
16 | 77062 | 2.9% |
9 | 51005 | 1.9% |
4 | 47083 | 1.8% |
Other values (50) | 304181 | |
(Missing) | 238987 |
Value | Count | Frequency (%) |
3 | 117252 | 4.4% |
4 | 47083 | 1.8% |
5 | 23564 | 0.9% |
6 | 355258 | |
7 | 4135 | 0.2% |
Value | Count | Frequency (%) |
63 | 3 | < 0.1% |
62 | 12 | < 0.1% |
61 | 18 | < 0.1% |
60 | 12352 | |
59 | 1 | < 0.1% |