Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 1526659 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 58.2 MiB |
Average record size in memory | 40.0 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Reproduction
Analysis started | 2024-02-13 19:38:12.006909 |
---|---|
Analysis finished | 2024-02-13 19:38:13.818603 |
Duration | 1.81 second |
Software version | ydata-profiling vv4.6.4 |
Download configuration | config.json |
case_id
Real number (ℝ)
UNIQUE
 
Distinct | 1526659 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1286076.572 |
Minimum | 0 |
---|---|
Maximum | 2703454 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 11.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 121766.9 |
Q1 | 766197.5 |
median | 1357358 |
Q3 | 1739022.5 |
95-th percentile | 2627080.1 |
Maximum | 2703454 |
Range | 2703454 |
Interquartile range (IQR) | 972825 |
Descriptive statistics
Standard deviation | 718946.5923 |
---|---|
Coefficient of variation (CV) | 0.5590231624 |
Kurtosis | -0.5871633686 |
Mean | 1286076.572 |
Median Absolute Deviation (MAD) | 486413 |
Skewness | 0.1354512807 |
Sum | 1.963400373 × 1012 |
Variance | 5.168842026 × 1011 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
0 | 1 | < 0.1% |
1611798 | 1 | < 0.1% |
1611807 | 1 | < 0.1% |
1611806 | 1 | < 0.1% |
1611805 | 1 | < 0.1% |
1611804 | 1 | < 0.1% |
1611803 | 1 | < 0.1% |
1611802 | 1 | < 0.1% |
1611801 | 1 | < 0.1% |
1611800 | 1 | < 0.1% |
Other values (1526649) | 1526649 |
Value | Count | Frequency (%) |
0 | 1 | |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 |
Value | Count | Frequency (%) |
2703454 | 1 | |
2703453 | 1 | |
2703452 | 1 | |
2703451 | 1 | |
2703450 | 1 |
date_decision
Text
Distinct | 644 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.6 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 15266590 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2019-01-03 |
---|---|
2nd row | 2019-01-03 |
3rd row | 2019-01-04 |
4th row | 2019-01-03 |
5th row | 2019-01-04 |
Value | Count | Frequency (%) |
2019-11-29 | 8812 | 0.6% |
2019-11-30 | 8756 | 0.6% |
2019-12-28 | 6900 | 0.5% |
2019-12-29 | 6537 | 0.4% |
2019-11-17 | 6340 | 0.4% |
2019-12-30 | 6327 | 0.4% |
2019-11-16 | 5882 | 0.4% |
2019-12-14 | 5864 | 0.4% |
2019-12-02 | 5719 | 0.4% |
2019-12-15 | 5635 | 0.4% |
Other values (634) | 1459887 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3843197 | |
- | 3053318 | |
2 | 2888882 | |
1 | 2388207 | |
9 | 1382896 | 9.1% |
3 | 352108 | 2.3% |
8 | 307159 | 2.0% |
6 | 290851 | 1.9% |
7 | 283172 | 1.9% |
5 | 243675 | 1.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 12213272 | |
Dash Punctuation | 3053318 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 3843197 | |
2 | 2888882 | |
1 | 2388207 | |
9 | 1382896 | 11.3% |
3 | 352108 | 2.9% |
8 | 307159 | 2.5% |
6 | 290851 | 2.4% |
7 | 283172 | 2.3% |
5 | 243675 | 2.0% |
4 | 233125 | 1.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3053318 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 15266590 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 3843197 | |
- | 3053318 | |
2 | 2888882 | |
1 | 2388207 | |
9 | 1382896 | 9.1% |
3 | 352108 | 2.3% |
8 | 307159 | 2.0% |
6 | 290851 | 1.9% |
7 | 283172 | 1.9% |
5 | 243675 | 1.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 15266590 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3843197 | |
- | 3053318 | |
2 | 2888882 | |
1 | 2388207 | |
9 | 1382896 | 9.1% |
3 | 352108 | 2.3% |
8 | 307159 | 2.0% |
6 | 290851 | 1.9% |
7 | 283172 | 1.9% |
5 | 243675 | 1.6% |
MONTH
Real number (ℝ)
Distinct | 22 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201936.288 |
Minimum | 201901 |
---|---|
Maximum | 202010 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 11.6 MiB |
Quantile statistics
Minimum | 201901 |
---|---|
5-th percentile | 201902 |
Q1 | 201906 |
median | 201910 |
Q3 | 202001 |
95-th percentile | 202008 |
Maximum | 202010 |
Range | 109 |
Interquartile range (IQR) | 95 |
Descriptive statistics
Standard deviation | 44.7359745 |
---|---|
Coefficient of variation (CV) | 0.0002215350938 |
Kurtosis | -1.215285612 |
Mean | 201936.288 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 0.8707251911 |
Sum | 3.082878515 × 1011 |
Variance | 2001.307415 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201912 | 126011 | 8.3% |
201911 | 115845 | 7.6% |
201908 | 98741 | 6.5% |
201909 | 98706 | 6.5% |
201907 | 97566 | 6.4% |
201910 | 95149 | 6.2% |
201906 | 94398 | 6.2% |
202001 | 86750 | 5.7% |
201901 | 75529 | 4.9% |
202002 | 75183 | 4.9% |
Other values (12) | 562781 |
Value | Count | Frequency (%) |
201901 | 75529 | |
201902 | 63064 | |
201903 | 69147 | |
201904 | 72012 | |
201905 | 64594 |
Value | Count | Frequency (%) |
202010 | 8592 | 0.6% |
202009 | 61905 | |
202008 | 50831 | |
202007 | 28912 | |
202006 | 45962 |
WEEK_NUM
Real number (ℝ)
ZEROS
 
Distinct | 92 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.76903618 |
Minimum | 0 |
---|---|
Maximum | 91 |
Zeros | 16735 |
Zeros (%) | 1.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 11.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 4 |
Q1 | 23 |
median | 40 |
Q3 | 55 |
95-th percentile | 86 |
Maximum | 91 |
Range | 91 |
Interquartile range (IQR) | 32 |
Descriptive statistics
Standard deviation | 23.79798129 |
---|---|
Coefficient of variation (CV) | 0.5837268556 |
Kurtosis | -0.6533283719 |
Mean | 40.76903618 |
Median Absolute Deviation (MAD) | 17 |
Skewness | 0.2971192891 |
Sum | 62240416 |
Variance | 566.3439136 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
51 | 35920 | 2.4% |
47 | 31888 | 2.1% |
49 | 30938 | 2.0% |
45 | 28947 | 1.9% |
23 | 26734 | 1.8% |
50 | 26244 | 1.7% |
44 | 25887 | 1.7% |
35 | 24137 | 1.6% |
40 | 23907 | 1.6% |
32 | 23889 | 1.6% |
Other values (82) | 1248168 |
Value | Count | Frequency (%) |
0 | 16735 | |
1 | 18841 | |
2 | 17476 | |
3 | 16108 | |
4 | 14309 |
Value | Count | Frequency (%) |
91 | 12674 | |
90 | 12103 | |
89 | 13600 | |
88 | 14234 | |
87 | 17886 |
target
Real number (ℝ)
ZEROS
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.03143727578 |
Minimum | 0 |
---|---|
Maximum | 1 |
Zeros | 1478665 |
Zeros (%) | 96.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 11.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 1 |
Range | 1 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.1744963994 |
---|---|
Coefficient of variation (CV) | 5.550620883 |
Kurtosis | 26.8419215 |
Mean | 0.03143727578 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.370464257 |
Sum | 47994 |
Variance | 0.03044899341 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1478665 | |
1 | 47994 | 3.1% |
Value | Count | Frequency (%) |
0 | 1478665 | |
1 | 47994 | 3.1% |
Value | Count | Frequency (%) |
1 | 47994 | 3.1% |
0 | 1478665 |