Overview

Dataset statistics

Number of variables7
Number of observations51109
Missing cells0
Missing cells (%)0.0%
Total size in memory2.7 MiB
Average record size in memory56.0 B

Variable types

Numeric7

Alerts

num_group1 has constant value ""Constant
amtdebitincoming_4809443A is highly skewed (γ1 = 70.19188222)Skewed
amtdebitoutgoing_4809440A is highly skewed (γ1 = 74.60829024)Skewed
amtdepositbalance_4809441A is highly skewed (γ1 = 20.37693222)Skewed
amtdepositincoming_4809444A is highly skewed (γ1 = 44.46249394)Skewed
amtdepositoutgoing_4809442A is highly skewed (γ1 = 40.49189338)Skewed
case_id has unique valuesUnique
amtdebitincoming_4809443A has 27053 (52.9%) zerosZeros
amtdebitoutgoing_4809440A has 27286 (53.4%) zerosZeros
amtdepositbalance_4809441A has 32235 (63.1%) zerosZeros
amtdepositincoming_4809444A has 45985 (90.0%) zerosZeros
amtdepositoutgoing_4809442A has 22433 (43.9%) zerosZeros
num_group1 has 51109 (100.0%) zerosZeros

Reproduction

Analysis started2024-02-13 19:53:33.331450
Analysis finished2024-02-13 19:53:33.501451
Duration0.17 seconds
Software versionydata-profiling vv4.6.4
Download configurationconfig.json

Variables

case_id
Real number (ℝ)

UNIQUE 

Distinct51109
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1419514.279
Minimum43801
Maximum2703453
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size399.4 KiB
2024-02-13T20:53:33.644448image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/

Quantile statistics

Minimum43801
5-th percentile204320.8
Q1242241
median1811468
Q31916206
95-th percentile2691270
Maximum2703453
Range2659652
Interquartile range (IQR)1673965

Descriptive statistics

Standard deviation924509.4909
Coefficient of variation (CV)0.6512857988
Kurtosis-1.418908462
Mean1419514.279
Median Absolute Deviation (MAD)853784
Skewness-0.2472184096
Sum7.254995527 × 1010
Variance8.547177988 × 1011
MonotonicityStrictly increasing
2024-02-13T20:53:33.850291image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
43801 1
 
< 0.1%
1881567 1
 
< 0.1%
1881591 1
 
< 0.1%
1881598 1
 
< 0.1%
1881606 1
 
< 0.1%
1881615 1
 
< 0.1%
1881619 1
 
< 0.1%
1881620 1
 
< 0.1%
1881622 1
 
< 0.1%
1881625 1
 
< 0.1%
Other values (51099) 51099
> 99.9%
ValueCountFrequency (%)
43801 1
< 0.1%
43991 1
< 0.1%
44001 1
< 0.1%
44053 1
< 0.1%
44130 1
< 0.1%
ValueCountFrequency (%)
2703453 1
< 0.1%
2703451 1
< 0.1%
2703450 1
< 0.1%
2703448 1
< 0.1%
2703443 1
< 0.1%

amtdebitincoming_4809443A
Real number (ℝ)

SKEWED  ZEROS 

Distinct9704
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7552.901686
Minimum0
Maximum4957852
Zeros27053
Zeros (%)52.9%
Negative0
Negative (%)0.0%
Memory size399.4 KiB
2024-02-13T20:53:34.025871image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q38000
95-th percentile33333.402
Maximum4957852
Range4957852
Interquartile range (IQR)8000

Descriptive statistics

Standard deviation34625.70583
Coefficient of variation (CV)4.584424275
Kurtosis8613.713316
Mean7552.901686
Median Absolute Deviation (MAD)0
Skewness70.19188222
Sum386021252.3
Variance1198939504
MonotonicityNot monotonic
2024-02-13T20:53:34.191477image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 27053
52.9%
10000 909
 
1.8%
6666.6 796
 
1.6%
13333.4 633
 
1.2%
20000 608
 
1.2%
33333.402 582
 
1.1%
16666.6 411
 
0.8%
3333.4001 403
 
0.8%
4000 389
 
0.8%
5000 364
 
0.7%
Other values (9694) 18961
37.1%
ValueCountFrequency (%)
0 27053
52.9%
0.2 2
 
< 0.1%
0.4 2
 
< 0.1%
0.6 5
 
< 0.1%
1.2 2
 
< 0.1%
ValueCountFrequency (%)
4957852 1
< 0.1%
1509287.4 2
< 0.1%
1457532.9 1
< 0.1%
1405000 2
< 0.1%
1209800 1
< 0.1%

amtdebitoutgoing_4809440A
Real number (ℝ)

SKEWED  ZEROS 

Distinct9700
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7462.384278
Minimum0
Maximum5168004.5
Zeros27286
Zeros (%)53.4%
Negative0
Negative (%)0.0%
Memory size399.4 KiB
2024-02-13T20:53:34.350475image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q37740
95-th percentile33333.402
Maximum5168004.5
Range5168004.5
Interquartile range (IQR)7740

Descriptive statistics

Standard deviation35065.28685
Coefficient of variation (CV)4.698938777
Kurtosis9583.802201
Mean7462.384278
Median Absolute Deviation (MAD)0
Skewness74.60829024
Sum381394998.1
Variance1229574342
MonotonicityNot monotonic
2024-02-13T20:53:34.509436image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 27286
53.4%
10000 779
 
1.5%
6666.6 667
 
1.3%
13333.4 557
 
1.1%
20000 536
 
1.0%
33333.402 506
 
1.0%
16666.6 350
 
0.7%
3333.4001 347
 
0.7%
4000 301
 
0.6%
5000 300
 
0.6%
Other values (9690) 19480
38.1%
ValueCountFrequency (%)
0 27286
53.4%
0.2 8
 
< 0.1%
0.4 6
 
< 0.1%
0.6 6
 
< 0.1%
0.8 6
 
< 0.1%
ValueCountFrequency (%)
5168004.5 1
< 0.1%
1502666 2
< 0.1%
1428545.2 1
< 0.1%
1405000 2
< 0.1%
1183026.6 1
< 0.1%

amtdepositbalance_4809441A
Real number (ℝ)

SKEWED  ZEROS 

Distinct8915
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9967.412999
Minimum-335718
Maximum4256314.5
Zeros32235
Zeros (%)63.1%
Negative1
Negative (%)< 0.1%
Memory size399.4 KiB
2024-02-13T20:53:34.669336image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/

Quantile statistics

Minimum-335718
5-th percentile0
Q10
median0
Q3288
95-th percentile18995.16
Maximum4256314.5
Range4592032.5
Interquartile range (IQR)288

Descriptive statistics

Standard deviation89393.42144
Coefficient of variation (CV)8.968568017
Kurtosis573.7938849
Mean9967.412999
Median Absolute Deviation (MAD)0
Skewness20.37693222
Sum509424511
Variance7991183797
MonotonicityNot monotonic
2024-02-13T20:53:34.828362image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 32235
63.1%
229.40001 67
 
0.1%
242.40001 58
 
0.1%
229.8 57
 
0.1%
230.6 55
 
0.1%
230.40001 55
 
0.1%
242.2 52
 
0.1%
231.2 51
 
0.1%
231 51
 
0.1%
230.2 50
 
0.1%
Other values (8905) 18378
36.0%
ValueCountFrequency (%)
-335718 1
 
< 0.1%
0 32235
63.1%
20 1
 
< 0.1%
22 1
 
< 0.1%
40 1
 
< 0.1%
ValueCountFrequency (%)
4256314.5 1
< 0.1%
4219593.5 1
< 0.1%
3469488.8 1
< 0.1%
3159746 1
< 0.1%
2998592.5 1
< 0.1%

amtdepositincoming_4809444A
Real number (ℝ)

SKEWED  ZEROS 

Distinct2625
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2949.3959
Minimum0
Maximum4180150.5
Zeros45985
Zeros (%)90.0%
Negative0
Negative (%)0.0%
Memory size399.4 KiB
2024-02-13T20:53:34.982074image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3330.52006
Maximum4180150.5
Range4180150.5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation41467.72607
Coefficient of variation (CV)14.05973544
Kurtosis3221.092942
Mean2949.3959
Median Absolute Deviation (MAD)0
Skewness44.46249394
Sum150740675
Variance1719572306
MonotonicityNot monotonic
2024-02-13T20:53:35.140070image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 45985
90.0%
3333.4001 108
 
0.2%
1333.4 107
 
0.2%
666.60004 91
 
0.2%
6666.6 83
 
0.2%
2000 74
 
0.1%
1000 65
 
0.1%
2666.6 63
 
0.1%
333.4 46
 
0.1%
1666.6 45
 
0.1%
Other values (2615) 4442
 
8.7%
ValueCountFrequency (%)
0 45985
90.0%
0.2 5
 
< 0.1%
0.4 4
 
< 0.1%
0.6 8
 
< 0.1%
0.8 3
 
< 0.1%
ValueCountFrequency (%)
4180150.5 1
< 0.1%
3335734.8 1
< 0.1%
1812727.2 1
< 0.1%
1617768.6 1
< 0.1%
1543610.9 1
< 0.1%

amtdepositoutgoing_4809442A
Real number (ℝ)

SKEWED  ZEROS 

Distinct5561
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3586.875118
Minimum0
Maximum4622917.5
Zeros22433
Zeros (%)43.9%
Negative0
Negative (%)0.0%
Memory size399.4 KiB
2024-02-13T20:53:35.300233image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1.8000001
Q35.4
95-th percentile4166.08
Maximum4622917.5
Range4622917.5
Interquartile range (IQR)5.4

Descriptive statistics

Standard deviation48274.93644
Coefficient of variation (CV)13.45877257
Kurtosis2590.812808
Mean3586.875118
Median Absolute Deviation (MAD)1.8000001
Skewness40.49189338
Sum183321600.4
Variance2330469488
MonotonicityNot monotonic
2024-02-13T20:53:35.457188image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 22433
43.9%
3 2029
 
4.0%
1.8000001 1738
 
3.4%
1.6 1720
 
3.4%
2.6000001 1447
 
2.8%
2.8 1027
 
2.0%
2.4 1006
 
2.0%
2 862
 
1.7%
1.4 771
 
1.5%
2.2 760
 
1.5%
Other values (5551) 17316
33.9%
ValueCountFrequency (%)
0 22433
43.9%
0.2 22
 
< 0.1%
0.4 38
 
0.1%
0.6 19
 
< 0.1%
0.8 29
 
0.1%
ValueCountFrequency (%)
4622917.5 1
< 0.1%
3333096 1
< 0.1%
2103462.8 2
< 0.1%
1799577.4 1
< 0.1%
1721187.2 1
< 0.1%

num_group1
Real number (ℝ)

CONSTANT  ZEROS 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0
Minimum0
Maximum0
Zeros51109
Zeros (%)100.0%
Negative0
Negative (%)0.0%
Memory size399.4 KiB
2024-02-13T20:53:35.577188image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum0
Range0
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0
Coefficient of variation (CV)nan
Kurtosis0
Mean0
Median Absolute Deviation (MAD)0
Skewness0
Sum0
Variance0
MonotonicityIncreasing
2024-02-13T20:53:35.680191image/svg+xmlMatplotlib v3.8.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
ValueCountFrequency (%)
0 51109
100.0%
ValueCountFrequency (%)
0 51109
100.0%
ValueCountFrequency (%)
0 51109
100.0%