Dataset statistics
| Number of variables | 5 |
|---|---|
| Number of observations | 1526659 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Total size in memory | 58.2 MiB |
| Average record size in memory | 40.0 B |
Variable types
| Numeric | 4 |
|---|---|
| Text | 1 |
Reproduction
| Analysis started | 2024-02-13 19:38:12.006909 |
|---|---|
| Analysis finished | 2024-02-13 19:38:13.818603 |
| Duration | 1.81 second |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
case_id
Real number (ℝ)
UNIQUE 
| Distinct | 1526659 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1286076.572 |
| Minimum | 0 |
|---|---|
| Maximum | 2703454 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 121766.9 |
| Q1 | 766197.5 |
| median | 1357358 |
| Q3 | 1739022.5 |
| 95-th percentile | 2627080.1 |
| Maximum | 2703454 |
| Range | 2703454 |
| Interquartile range (IQR) | 972825 |
Descriptive statistics
| Standard deviation | 718946.5923 |
|---|---|
| Coefficient of variation (CV) | 0.5590231624 |
| Kurtosis | -0.5871633686 |
| Mean | 1286076.572 |
| Median Absolute Deviation (MAD) | 486413 |
| Skewness | 0.1354512807 |
| Sum | 1.963400373 × 1012 |
| Variance | 5.168842026 × 1011 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1611798 | 1 | < 0.1% |
| 1611807 | 1 | < 0.1% |
| 1611806 | 1 | < 0.1% |
| 1611805 | 1 | < 0.1% |
| 1611804 | 1 | < 0.1% |
| 1611803 | 1 | < 0.1% |
| 1611802 | 1 | < 0.1% |
| 1611801 | 1 | < 0.1% |
| 1611800 | 1 | < 0.1% |
| Other values (1526649) | 1526649 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 |
| Value | Count | Frequency (%) |
| 2703454 | 1 | |
| 2703453 | 1 | |
| 2703452 | 1 | |
| 2703451 | 1 | |
| 2703450 | 1 |
date_decision
Text
| Distinct | 644 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 15266590 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2019-01-03 |
|---|---|
| 2nd row | 2019-01-03 |
| 3rd row | 2019-01-04 |
| 4th row | 2019-01-03 |
| 5th row | 2019-01-04 |
| Value | Count | Frequency (%) |
| 2019-11-29 | 8812 | 0.6% |
| 2019-11-30 | 8756 | 0.6% |
| 2019-12-28 | 6900 | 0.5% |
| 2019-12-29 | 6537 | 0.4% |
| 2019-11-17 | 6340 | 0.4% |
| 2019-12-30 | 6327 | 0.4% |
| 2019-11-16 | 5882 | 0.4% |
| 2019-12-14 | 5864 | 0.4% |
| 2019-12-02 | 5719 | 0.4% |
| 2019-12-15 | 5635 | 0.4% |
| Other values (634) | 1459887 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3843197 | |
| - | 3053318 | |
| 2 | 2888882 | |
| 1 | 2388207 | |
| 9 | 1382896 | 9.1% |
| 3 | 352108 | 2.3% |
| 8 | 307159 | 2.0% |
| 6 | 290851 | 1.9% |
| 7 | 283172 | 1.9% |
| 5 | 243675 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12213272 | |
| Dash Punctuation | 3053318 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3843197 | |
| 2 | 2888882 | |
| 1 | 2388207 | |
| 9 | 1382896 | 11.3% |
| 3 | 352108 | 2.9% |
| 8 | 307159 | 2.5% |
| 6 | 290851 | 2.4% |
| 7 | 283172 | 2.3% |
| 5 | 243675 | 2.0% |
| 4 | 233125 | 1.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3053318 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15266590 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3843197 | |
| - | 3053318 | |
| 2 | 2888882 | |
| 1 | 2388207 | |
| 9 | 1382896 | 9.1% |
| 3 | 352108 | 2.3% |
| 8 | 307159 | 2.0% |
| 6 | 290851 | 1.9% |
| 7 | 283172 | 1.9% |
| 5 | 243675 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15266590 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3843197 | |
| - | 3053318 | |
| 2 | 2888882 | |
| 1 | 2388207 | |
| 9 | 1382896 | 9.1% |
| 3 | 352108 | 2.3% |
| 8 | 307159 | 2.0% |
| 6 | 290851 | 1.9% |
| 7 | 283172 | 1.9% |
| 5 | 243675 | 1.6% |
MONTH
Real number (ℝ)
| Distinct | 22 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201936.288 |
| Minimum | 201901 |
|---|---|
| Maximum | 202010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 201901 |
|---|---|
| 5-th percentile | 201902 |
| Q1 | 201906 |
| median | 201910 |
| Q3 | 202001 |
| 95-th percentile | 202008 |
| Maximum | 202010 |
| Range | 109 |
| Interquartile range (IQR) | 95 |
Descriptive statistics
| Standard deviation | 44.7359745 |
|---|---|
| Coefficient of variation (CV) | 0.0002215350938 |
| Kurtosis | -1.215285612 |
| Mean | 201936.288 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.8707251911 |
| Sum | 3.082878515 × 1011 |
| Variance | 2001.307415 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=22)
| Value | Count | Frequency (%) |
| 201912 | 126011 | 8.3% |
| 201911 | 115845 | 7.6% |
| 201908 | 98741 | 6.5% |
| 201909 | 98706 | 6.5% |
| 201907 | 97566 | 6.4% |
| 201910 | 95149 | 6.2% |
| 201906 | 94398 | 6.2% |
| 202001 | 86750 | 5.7% |
| 201901 | 75529 | 4.9% |
| 202002 | 75183 | 4.9% |
| Other values (12) | 562781 |
| Value | Count | Frequency (%) |
| 201901 | 75529 | |
| 201902 | 63064 | |
| 201903 | 69147 | |
| 201904 | 72012 | |
| 201905 | 64594 |
| Value | Count | Frequency (%) |
| 202010 | 8592 | 0.6% |
| 202009 | 61905 | |
| 202008 | 50831 | |
| 202007 | 28912 | |
| 202006 | 45962 |
WEEK_NUM
Real number (ℝ)
ZEROS 
| Distinct | 92 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.76903618 |
| Minimum | 0 |
|---|---|
| Maximum | 91 |
| Zeros | 16735 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 23 |
| median | 40 |
| Q3 | 55 |
| 95-th percentile | 86 |
| Maximum | 91 |
| Range | 91 |
| Interquartile range (IQR) | 32 |
Descriptive statistics
| Standard deviation | 23.79798129 |
|---|---|
| Coefficient of variation (CV) | 0.5837268556 |
| Kurtosis | -0.6533283719 |
| Mean | 40.76903618 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.2971192891 |
| Sum | 62240416 |
| Variance | 566.3439136 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 51 | 35920 | 2.4% |
| 47 | 31888 | 2.1% |
| 49 | 30938 | 2.0% |
| 45 | 28947 | 1.9% |
| 23 | 26734 | 1.8% |
| 50 | 26244 | 1.7% |
| 44 | 25887 | 1.7% |
| 35 | 24137 | 1.6% |
| 40 | 23907 | 1.6% |
| 32 | 23889 | 1.6% |
| Other values (82) | 1248168 |
| Value | Count | Frequency (%) |
| 0 | 16735 | |
| 1 | 18841 | |
| 2 | 17476 | |
| 3 | 16108 | |
| 4 | 14309 |
| Value | Count | Frequency (%) |
| 91 | 12674 | |
| 90 | 12103 | |
| 89 | 13600 | |
| 88 | 14234 | |
| 87 | 17886 |
target
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.03143727578 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 1478665 |
| Zeros (%) | 96.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 11.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1744963994 |
|---|---|
| Coefficient of variation (CV) | 5.550620883 |
| Kurtosis | 26.8419215 |
| Mean | 0.03143727578 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.370464257 |
| Sum | 47994 |
| Variance | 0.03044899341 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=2)
| Value | Count | Frequency (%) |
| 0 | 1478665 | |
| 1 | 47994 | 3.1% |
| Value | Count | Frequency (%) |
| 0 | 1478665 | |
| 1 | 47994 | 3.1% |
| Value | Count | Frequency (%) |
| 1 | 47994 | 3.1% |
| 0 | 1478665 |