Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 22784 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 1.6 MiB |
| Average record size in memory | 72.0 B |
Variable types
| NUM | 9 |
|---|
Reproduction
| Analysis started | 2020-08-24 23:55:04.603621 |
|---|---|
| Analysis finished | 2020-08-24 23:55:18.230372 |
| Duration | 13.63 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Distinct count | 5818 |
|---|---|
| Unique (%) | 25.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2935.8657391151687 |
|---|---|
| Minimum | 1.0 |
| Maximum | 2819401.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 178.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 41 |
| Q1 | 163 |
| median | 506 |
| Q3 | 1683 |
| 95-th percentile | 10129.7 |
| Maximum | 2819401 |
| Range | 2819400 |
| Interquartile range (IQR) | 1520 |
Descriptive statistics
| Standard deviation | 24949.88017 |
|---|---|
| Coefficient of variation (CV) | 8.498304209 |
| Kurtosis | 7546.166944 |
| Mean | 2935.865739 |
| Median Absolute Deviation (MAD) | 421 |
| Skewness | 74.16685526 |
| Sum | 66890765 |
| Variance | 622496520.4 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 71 | 56 | 0.2% | |
| 39 | 55 | 0.2% | |
| 88 | 55 | 0.2% | |
| 72 | 55 | 0.2% | |
| 63 | 55 | 0.2% | |
| 66 | 54 | 0.2% | |
| 60 | 53 | 0.2% | |
| 44 | 52 | 0.2% | |
| 67 | 51 | 0.2% | |
| 101 | 49 | 0.2% | |
| 56 | 49 | 0.2% | |
| 43 | 49 | 0.2% | |
| 65 | 48 | 0.2% | |
| 68 | 48 | 0.2% | |
| 76 | 48 | 0.2% | |
| 49 | 48 | 0.2% | |
| 55 | 48 | 0.2% | |
| 48 | 48 | 0.2% | |
| 58 | 47 | 0.2% | |
| 103 | 47 | 0.2% | |
| 42 | 46 | 0.2% | |
| 89 | 46 | 0.2% | |
| 83 | 46 | 0.2% | |
| 90 | 46 | 0.2% | |
| 81 | 46 | 0.2% | |
| Other values (5793) | 21539 | 94.5% |
| Value | Count | Frequency (%) | |
| 1 | 3 | < 0.1% | |
| 2 | 7 | < 0.1% | |
| 3 | 10 | < 0.1% | |
| 4 | 12 | 0.1% | |
| 5 | 10 | < 0.1% | |
| 6 | 7 | < 0.1% | |
| 7 | 16 | 0.1% | |
| 8 | 13 | 0.1% | |
| 9 | 20 | 0.1% | |
| 10 | 17 | 0.1% |
| Value | Count | Frequency (%) | |
| 2819401 | 1 | < 0.1% | |
| 1217405 | 1 | < 0.1% | |
| 1025174 | 1 | < 0.1% | |
| 616877 | 1 | < 0.1% | |
| 603075 | 1 | < 0.1% | |
| 406096 | 1 | < 0.1% | |
| 402060 | 1 | < 0.1% | |
| 374057 | 1 | < 0.1% | |
| 369921 | 1 | < 0.1% | |
| 326761 | 1 | < 0.1% |
| Distinct count | 12051 |
|---|---|
| Unique (%) | 52.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.010329851520747923 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.8944444060325623 |
| Zeros | 7495 |
| Zeros (%) | 32.9% |
| Memory size | 178.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.002361449995 |
| Q3 | 0.007428525132 |
| 95-th percentile | 0.0380582789 |
| Maximum | 0.894444406 |
| Range | 0.894444406 |
| Interquartile range (IQR) | 0.007428525132 |
Descriptive statistics
| Standard deviation | 0.04210519478 |
|---|---|
| Coefficient of variation (CV) | 4.076069699 |
| Kurtosis | 215.0800043 |
| Mean | 0.01032985152 |
| Median Absolute Deviation (MAD) | 0.002361449995 |
| Skewness | 13.28553655 |
| Sum | 235.355337 |
| Variance | 0.001772847428 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 7495 | 32.9% | |
| 0.002793299966 | 12 | 0.1% | |
| 0.004587200005 | 9 | < 0.1% | |
| 0.006134999916 | 9 | < 0.1% | |
| 0.00884960033 | 9 | < 0.1% | |
| 0.003378400113 | 9 | < 0.1% | |
| 0.003787900088 | 9 | < 0.1% | |
| 0.003322300036 | 9 | < 0.1% | |
| 0.005464499816 | 9 | < 0.1% | |
| 0.005050499924 | 9 | < 0.1% | |
| 0.009174300358 | 9 | < 0.1% | |
| 0.004098400008 | 8 | < 0.1% | |
| 0.00150829996 | 8 | < 0.1% | |
| 0.004739299882 | 8 | < 0.1% | |
| 0.006756800227 | 8 | < 0.1% | |
| 0.004854400177 | 8 | < 0.1% | |
| 0.005319099873 | 8 | < 0.1% | |
| 0.002923999913 | 8 | < 0.1% | |
| 0.002898599952 | 8 | < 0.1% | |
| 0.002832900034 | 8 | < 0.1% | |
| 0.0008741000202 | 8 | < 0.1% | |
| 0.002451000037 | 8 | < 0.1% | |
| 0.00657889992 | 8 | < 0.1% | |
| 0.007142900024 | 8 | < 0.1% | |
| 0.003891099943 | 8 | < 0.1% | |
| Other values (12026) | 15084 | 66.2% |
| Value | Count | Frequency (%) | |
| 0 | 7495 | 32.9% | |
| 0.0001784000051 | 1 | < 0.1% | |
| 0.0001907999977 | 1 | < 0.1% | |
| 0.0002007000003 | 1 | < 0.1% | |
| 0.0002021999971 | 1 | < 0.1% | |
| 0.0002062999993 | 1 | < 0.1% | |
| 0.0002103000006 | 1 | < 0.1% | |
| 0.0002137999982 | 1 | < 0.1% | |
| 0.0002154999966 | 1 | < 0.1% | |
| 0.0002205000055 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.894444406 | 1 | < 0.1% | |
| 0.8916562796 | 1 | < 0.1% | |
| 0.8623024821 | 1 | < 0.1% | |
| 0.8582863808 | 1 | < 0.1% | |
| 0.8419777155 | 1 | < 0.1% | |
| 0.8415951133 | 1 | < 0.1% | |
| 0.8379194736 | 1 | < 0.1% | |
| 0.8333333135 | 1 | < 0.1% | |
| 0.8176327944 | 1 | < 0.1% | |
| 0.8164874911 | 1 | < 0.1% |
P11p3
Real number (ℝ≥0)
| Distinct count | 18765 |
|---|---|
| Unique (%) | 82.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4840210583842532 |
|---|---|
| Minimum | 0.07987560331821443 |
| Maximum | 1.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 178.1 KiB |
Quantile statistics
| Minimum | 0.07987560332 |
|---|---|
| 5-th percentile | 0.3934582919 |
| Q1 | 0.4487662464 |
| median | 0.4833839387 |
| Q3 | 0.5217391253 |
| 95-th percentile | 0.575201562 |
| Maximum | 1 |
| Range | 0.9201243967 |
| Interquartile range (IQR) | 0.07297287881 |
Descriptive statistics
| Standard deviation | 0.06033422973 |
|---|---|
| Coefficient of variation (CV) | 0.1246520759 |
| Kurtosis | 3.677803767 |
| Mean | 0.4840210584 |
| Median Absolute Deviation (MAD) | 0.03641425073 |
| Skewness | -0.2709088479 |
| Sum | 11027.93579 |
| Variance | 0.003640219277 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.5 | 178 | 0.8% | |
| 0.4444443882 | 35 | 0.2% | |
| 0.428571403 | 33 | 0.1% | |
| 0.4615384936 | 33 | 0.1% | |
| 0.4782609046 | 25 | 0.1% | |
| 0.4545454979 | 22 | 0.1% | |
| 0.400000006 | 22 | 0.1% | |
| 0.4705882072 | 21 | 0.1% | |
| 0.4761905074 | 21 | 0.1% | |
| 0.4736841917 | 20 | 0.1% | |
| 0.4666666985 | 19 | 0.1% | |
| 0.5454545021 | 17 | 0.1% | |
| 0.4642857015 | 17 | 0.1% | |
| 0.4499999881 | 17 | 0.1% | |
| 0.4583333135 | 16 | 0.1% | |
| 0.5263158083 | 16 | 0.1% | |
| 0.6000000238 | 16 | 0.1% | |
| 0.5333333015 | 15 | 0.1% | |
| 0.4375 | 15 | 0.1% | |
| 0.46875 | 15 | 0.1% | |
| 0.4482758939 | 14 | 0.1% | |
| 0.4893617034 | 14 | 0.1% | |
| 0.472222209 | 14 | 0.1% | |
| 0.5135135055 | 13 | 0.1% | |
| 0.4827586114 | 13 | 0.1% | |
| Other values (18740) | 22143 | 97.2% |
| Value | Count | Frequency (%) | |
| 0.07987560332 | 1 | < 0.1% | |
| 0.08804479986 | 1 | < 0.1% | |
| 0.09683430195 | 1 | < 0.1% | |
| 0.1053214967 | 1 | < 0.1% | |
| 0.1067615971 | 1 | < 0.1% | |
| 0.1166414991 | 1 | < 0.1% | |
| 0.1202017963 | 1 | < 0.1% | |
| 0.1205001995 | 1 | < 0.1% | |
| 0.1245386973 | 1 | < 0.1% | |
| 0.1284939945 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9615384936 | 1 | < 0.1% | |
| 0.8974359035 | 1 | < 0.1% | |
| 0.8888888955 | 1 | < 0.1% | |
| 0.8399999738 | 1 | < 0.1% | |
| 0.8303570747 | 1 | < 0.1% | |
| 0.8166667223 | 1 | < 0.1% | |
| 0.8157895207 | 1 | < 0.1% | |
| 0.8125 | 1 | < 0.1% | |
| 0.8115941882 | 1 | < 0.1% |
P16p2
Real number (ℝ≥0)
| Distinct count | 15570 |
|---|---|
| Unique (%) | 68.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.716130718410936 |
|---|---|
| Minimum | 0.2337023019790649 |
| Maximum | 1.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 178.1 KiB |
Quantile statistics
| Minimum | 0.233702302 |
|---|---|
| 5-th percentile | 0.5800441563 |
| Q1 | 0.6622828692 |
| median | 0.7142856717 |
| Q3 | 0.7710386366 |
| 95-th percentile | 0.8604651093 |
| Maximum | 1 |
| Range | 0.766297698 |
| Interquartile range (IQR) | 0.1087557673 |
Descriptive statistics
| Standard deviation | 0.08726447653 |
|---|---|
| Coefficient of variation (CV) | 0.1218555137 |
| Kurtosis | 1.085519526 |
| Mean | 0.7161307184 |
| Median Absolute Deviation (MAD) | 0.05420303345 |
| Skewness | -0.1347583095 |
| Sum | 16316.32229 |
| Variance | 0.007615088865 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.75 | 159 | 0.7% | |
| 0.6666666865 | 147 | 0.6% | |
| 0.8000000119 | 97 | 0.4% | |
| 0.7142856717 | 81 | 0.4% | |
| 0.6999999881 | 65 | 0.3% | |
| 0.777777791 | 60 | 0.3% | |
| 0.7272726893 | 56 | 0.2% | |
| 0.6000000238 | 47 | 0.2% | |
| 0.8333333135 | 46 | 0.2% | |
| 1 | 40 | 0.2% | |
| 0.7333332896 | 40 | 0.2% | |
| 0.6923077106 | 38 | 0.2% | |
| 0.769230783 | 37 | 0.2% | |
| 0.5 | 36 | 0.2% | |
| 0.6875 | 34 | 0.1% | |
| 0.7619047761 | 34 | 0.1% | |
| 0.625 | 34 | 0.1% | |
| 0.6363636255 | 33 | 0.1% | |
| 0.7857143283 | 33 | 0.1% | |
| 0.647058785 | 32 | 0.1% | |
| 0.7058823705 | 31 | 0.1% | |
| 0.722222209 | 31 | 0.1% | |
| 0.8181818128 | 29 | 0.1% | |
| 0.7083333135 | 27 | 0.1% | |
| 0.6842104793 | 27 | 0.1% | |
| Other values (15545) | 21490 | 94.3% |
| Value | Count | Frequency (%) | |
| 0.233702302 | 1 | < 0.1% | |
| 0.2369077057 | 1 | < 0.1% | |
| 0.2455487996 | 1 | < 0.1% | |
| 0.25 | 1 | < 0.1% | |
| 0.2577987015 | 1 | < 0.1% | |
| 0.259837687 | 1 | < 0.1% | |
| 0.2805084884 | 1 | < 0.1% | |
| 0.2903226018 | 1 | < 0.1% | |
| 0.3000000119 | 1 | < 0.1% | |
| 0.3084416091 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 40 | 0.2% | |
| 0.9959893227 | 1 | < 0.1% | |
| 0.9959099889 | 1 | < 0.1% | |
| 0.9958158731 | 1 | < 0.1% | |
| 0.9949749112 | 1 | < 0.1% | |
| 0.993055582 | 1 | < 0.1% | |
| 0.9927746058 | 1 | < 0.1% | |
| 0.9925168157 | 1 | < 0.1% | |
| 0.9925094247 | 1 | < 0.1% | |
| 0.9923912883 | 1 | < 0.1% |
| Distinct count | 10941 |
|---|---|
| Unique (%) | 48.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05743685887154143 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 9203 |
| Zeros (%) | 40.4% |
| Memory size | 178.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.002538099885 |
| Q3 | 0.02992774919 |
| 95-th percentile | 0.3594396457 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.02992774919 |
Descriptive statistics
| Standard deviation | 0.1398113667 |
|---|---|
| Coefficient of variation (CV) | 2.434175013 |
| Kurtosis | 14.83941661 |
| Mean | 0.05743685887 |
| Median Absolute Deviation (MAD) | 0.002538099885 |
| Skewness | 3.605640596 |
| Sum | 1308.641393 |
| Variance | 0.01954721825 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 9203 | 40.4% | |
| 0.5 | 12 | 0.1% | |
| 0.01030930039 | 11 | < 0.1% | |
| 0.00523559982 | 11 | < 0.1% | |
| 0.25 | 10 | < 0.1% | |
| 0.0142857004 | 10 | < 0.1% | |
| 1 | 10 | < 0.1% | |
| 0.01999999955 | 10 | < 0.1% | |
| 0.005882400088 | 10 | < 0.1% | |
| 0.00408159988 | 10 | < 0.1% | |
| 0.02857140079 | 10 | < 0.1% | |
| 0.04545449838 | 10 | < 0.1% | |
| 0.02439020015 | 9 | < 0.1% | |
| 0.01515149977 | 9 | < 0.1% | |
| 0.01408450026 | 9 | < 0.1% | |
| 0.002036700025 | 9 | < 0.1% | |
| 0.03225810081 | 9 | < 0.1% | |
| 0.003322300036 | 9 | < 0.1% | |
| 0.0256409999 | 9 | < 0.1% | |
| 0.03030299954 | 9 | < 0.1% | |
| 0.003333299886 | 8 | < 0.1% | |
| 0.008928599767 | 8 | < 0.1% | |
| 0.01219510008 | 8 | < 0.1% | |
| 0.01063830033 | 8 | < 0.1% | |
| 0.003717500018 | 8 | < 0.1% | |
| Other values (10916) | 13355 | 58.6% |
| Value | Count | Frequency (%) | |
| 0 | 9203 | 40.4% | |
| 0.0001090000005 | 1 | < 0.1% | |
| 0.0002055999939 | 1 | < 0.1% | |
| 0.0002080999984 | 1 | < 0.1% | |
| 0.0002093999938 | 1 | < 0.1% | |
| 0.0002271999983 | 1 | < 0.1% | |
| 0.0002290999983 | 1 | < 0.1% | |
| 0.0002306999959 | 1 | < 0.1% | |
| 0.0002326999966 | 1 | < 0.1% | |
| 0.0002385999978 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 10 | < 0.1% | |
| 0.9992861748 | 1 | < 0.1% | |
| 0.9972066879 | 1 | < 0.1% | |
| 0.997118175 | 1 | < 0.1% | |
| 0.9969879985 | 1 | < 0.1% | |
| 0.9966102242 | 1 | < 0.1% | |
| 0.9963369966 | 1 | < 0.1% | |
| 0.9950371981 | 1 | < 0.1% | |
| 0.9942921996 | 1 | < 0.1% | |
| 0.9940298796 | 1 | < 0.1% |
| Distinct count | 6002 |
|---|---|
| Unique (%) | 26.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.17151803101478946 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 2553 |
| Zeros (%) | 11.2% |
| Memory size | 178.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.0793591477 |
| median | 0.1478261054 |
| Q3 | 0.2307692021 |
| 95-th percentile | 0.4375 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1514100544 |
Descriptive statistics
| Standard deviation | 0.1389870691 |
|---|---|
| Coefficient of variation (CV) | 0.8103350316 |
| Kurtosis | 4.25561241 |
| Mean | 0.171518031 |
| Median Absolute Deviation (MAD) | 0.07439608872 |
| Skewness | 1.550698063 |
| Sum | 3907.866819 |
| Variance | 0.01931740537 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 2553 | 11.2% | |
| 0.200000003 | 457 | 2.0% | |
| 0.25 | 431 | 1.9% | |
| 0.1666667014 | 410 | 1.8% | |
| 0.3333333135 | 350 | 1.5% | |
| 0.1428571045 | 316 | 1.4% | |
| 0.125 | 311 | 1.4% | |
| 0.111111097 | 275 | 1.2% | |
| 0.5 | 249 | 1.1% | |
| 0.1000000015 | 243 | 1.1% | |
| 0.09090910107 | 203 | 0.9% | |
| 0.2857142985 | 189 | 0.8% | |
| 0.2222221941 | 179 | 0.8% | |
| 0.08333329856 | 174 | 0.8% | |
| 0.1818182021 | 164 | 0.7% | |
| 0.07692310214 | 154 | 0.7% | |
| 0.1538462043 | 145 | 0.6% | |
| 0.400000006 | 133 | 0.6% | |
| 0.1333332956 | 126 | 0.6% | |
| 0.1176470965 | 125 | 0.5% | |
| 0.07142859697 | 120 | 0.5% | |
| 0.2727273107 | 115 | 0.5% | |
| 0.06666669995 | 108 | 0.5% | |
| 0.05555560067 | 101 | 0.4% | |
| 0.1764705926 | 96 | 0.4% | |
| Other values (5977) | 15057 | 66.1% |
| Value | Count | Frequency (%) | |
| 0 | 2553 | 11.2% | |
| 0.000843900023 | 1 | < 0.1% | |
| 0.001319300034 | 1 | < 0.1% | |
| 0.001453500008 | 1 | < 0.1% | |
| 0.001548000029 | 1 | < 0.1% | |
| 0.001733100042 | 1 | < 0.1% | |
| 0.001818799996 | 1 | < 0.1% | |
| 0.001951199956 | 1 | < 0.1% | |
| 0.002078999998 | 1 | < 0.1% | |
| 0.002481400035 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 47 | 0.2% | |
| 0.9756097794 | 1 | < 0.1% | |
| 0.9452055097 | 1 | < 0.1% | |
| 0.9230769277 | 1 | < 0.1% | |
| 0.913043499 | 1 | < 0.1% | |
| 0.9100719094 | 1 | < 0.1% | |
| 0.9090908766 | 1 | < 0.1% | |
| 0.8928570747 | 1 | < 0.1% | |
| 0.8888888955 | 1 | < 0.1% | |
| 0.8796991706 | 1 | < 0.1% |
H15p1
Real number (ℝ≥0)
| Distinct count | 18583 |
|---|---|
| Unique (%) | 81.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.978000387167453 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 21 |
| Zeros (%) | 0.1% |
| Memory size | 178.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.939043546 |
| Q1 | 5.548364162 |
| median | 5.958333492 |
| Q3 | 6.363078475 |
| 95-th percentile | 7.203769851 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 0.8147143126 |
Descriptive statistics
| Standard deviation | 0.7586486479 |
|---|---|
| Coefficient of variation (CV) | 0.1269067579 |
| Kurtosis | 5.913719514 |
| Mean | 5.978000387 |
| Median Absolute Deviation (MAD) | 0.4069666862 |
| Skewness | -0.2684810011 |
| Sum | 136202.7608 |
| Variance | 0.5755477709 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 6 | 106 | 0.5% | |
| 5 | 55 | 0.2% | |
| 5.5 | 52 | 0.2% | |
| 5.666666508 | 35 | 0.2% | |
| 5.571428776 | 28 | 0.1% | |
| 6.333333492 | 27 | 0.1% | |
| 5.714285851 | 24 | 0.1% | |
| 7 | 24 | 0.1% | |
| 5.800000191 | 22 | 0.1% | |
| 5.75 | 21 | 0.1% | |
| 0 | 21 | 0.1% | |
| 6.5 | 21 | 0.1% | |
| 5.25 | 20 | 0.1% | |
| 6.25 | 20 | 0.1% | |
| 5.333333492 | 20 | 0.1% | |
| 5.428571224 | 19 | 0.1% | |
| 5.599999905 | 18 | 0.1% | |
| 5.833333492 | 17 | 0.1% | |
| 6.142857075 | 16 | 0.1% | |
| 5.625 | 16 | 0.1% | |
| 5.727272511 | 15 | 0.1% | |
| 5.875 | 15 | 0.1% | |
| 6.400000095 | 15 | 0.1% | |
| 6.666666508 | 15 | 0.1% | |
| 6.166666508 | 15 | 0.1% | |
| Other values (18558) | 22127 | 97.1% |
| Value | Count | Frequency (%) | |
| 0 | 21 | 0.1% | |
| 1 | 1 | < 0.1% | |
| 1.818181753 | 1 | < 0.1% | |
| 1.821428657 | 1 | < 0.1% | |
| 1.84375 | 1 | < 0.1% | |
| 1.850000024 | 1 | < 0.1% | |
| 1.857142925 | 1 | < 0.1% | |
| 1.931034446 | 1 | < 0.1% | |
| 2.072727203 | 1 | < 0.1% | |
| 2.125 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10 | 1 | < 0.1% | |
| 9.740740776 | 1 | < 0.1% | |
| 9.70370388 | 1 | < 0.1% | |
| 9.51612854 | 1 | < 0.1% | |
| 9.444444656 | 1 | < 0.1% | |
| 9.428571701 | 1 | < 0.1% | |
| 9.322221756 | 1 | < 0.1% | |
| 9.264196396 | 1 | < 0.1% | |
| 9.254901886 | 1 | < 0.1% | |
| 9.253968239 | 1 | < 0.1% |
| Distinct count | 2421 |
|---|---|
| Unique (%) | 10.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4916256467710881 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 4159 |
| Zeros (%) | 18.3% |
| Memory size | 178.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.2432432026 |
| median | 0.5 |
| Q3 | 0.75 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.5067567974 |
Descriptive statistics
| Standard deviation | 0.3316551073 |
|---|---|
| Coefficient of variation (CV) | 0.6746090435 |
| Kurtosis | -1.095900308 |
| Mean | 0.4916256468 |
| Median Absolute Deviation (MAD) | 0.25 |
| Skewness | -0.02660254313 |
| Sum | 11201.19874 |
| Variance | 0.1099951102 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 4159 | 18.3% | |
| 1 | 3332 | 14.6% | |
| 0.5 | 1553 | 6.8% | |
| 0.6666666865 | 990 | 4.3% | |
| 0.3333333135 | 658 | 2.9% | |
| 0.75 | 571 | 2.5% | |
| 0.6000000238 | 425 | 1.9% | |
| 0.8000000119 | 359 | 1.6% | |
| 0.25 | 350 | 1.5% | |
| 0.400000006 | 268 | 1.2% | |
| 0.8333333135 | 245 | 1.1% | |
| 0.7142856717 | 235 | 1.0% | |
| 0.200000003 | 230 | 1.0% | |
| 0.571428597 | 197 | 0.9% | |
| 0.428571403 | 186 | 0.8% | |
| 0.625 | 175 | 0.8% | |
| 0.8571429253 | 160 | 0.7% | |
| 0.375 | 144 | 0.6% | |
| 0.2857142985 | 143 | 0.6% | |
| 0.4444443882 | 137 | 0.6% | |
| 0.555555582 | 130 | 0.6% | |
| 0.777777791 | 119 | 0.5% | |
| 0.1666667014 | 109 | 0.5% | |
| 0.4545454979 | 92 | 0.4% | |
| 0.5454545021 | 90 | 0.4% | |
| Other values (2396) | 7727 | 33.9% |
| Value | Count | Frequency (%) | |
| 0 | 4159 | 18.3% | |
| 0.01923079975 | 1 | < 0.1% | |
| 0.02325580083 | 2 | < 0.1% | |
| 0.02380950004 | 1 | < 0.1% | |
| 0.02500000037 | 1 | < 0.1% | |
| 0.02702699974 | 1 | < 0.1% | |
| 0.02857140079 | 1 | < 0.1% | |
| 0.03030299954 | 1 | < 0.1% | |
| 0.03149610013 | 1 | < 0.1% | |
| 0.03225810081 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 3332 | 14.6% | |
| 0.9857550263 | 1 | < 0.1% | |
| 0.9811320901 | 1 | < 0.1% | |
| 0.9777777791 | 1 | < 0.1% | |
| 0.9772726893 | 1 | < 0.1% | |
| 0.9750000238 | 1 | < 0.1% | |
| 0.9705882072 | 1 | < 0.1% | |
| 0.9666666985 | 1 | < 0.1% | |
| 0.9655172229 | 1 | < 0.1% | |
| 0.9615384936 | 1 | < 0.1% |
target
Real number (ℝ≥0)
| Distinct count | 2045 |
|---|---|
| Unique (%) | 9.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50074.43978230337 |
|---|---|
| Minimum | 0.0 |
| Maximum | 500001.0 |
| Zeros | 52 |
| Zeros (%) | 0.2% |
| Memory size | 178.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 14999 |
| Q1 | 21000 |
| median | 33200 |
| Q3 | 56100 |
| 95-th percentile | 150000 |
| Maximum | 500001 |
| Range | 500001 |
| Interquartile range (IQR) | 35100 |
Descriptive statistics
| Standard deviation | 52843.47555 |
|---|---|
| Coefficient of variation (CV) | 1.055298387 |
| Kurtosis | 20.326923 |
| Mean | 50074.43978 |
| Median Absolute Deviation (MAD) | 15300 |
| Skewness | 3.755120054 |
| Sum | 1140896036 |
| Variance | 2792432908 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 14999 | 3111 | 13.7% | |
| 21300 | 98 | 0.4% | |
| 17500 | 94 | 0.4% | |
| 31300 | 89 | 0.4% | |
| 16300 | 89 | 0.4% | |
| 26300 | 87 | 0.4% | |
| 23800 | 86 | 0.4% | |
| 18800 | 80 | 0.4% | |
| 22500 | 78 | 0.3% | |
| 20000 | 70 | 0.3% | |
| 36300 | 69 | 0.3% | |
| 20300 | 67 | 0.3% | |
| 32500 | 66 | 0.3% | |
| 15000 | 65 | 0.3% | |
| 27500 | 65 | 0.3% | |
| 24100 | 63 | 0.3% | |
| 27300 | 63 | 0.3% | |
| 33800 | 62 | 0.3% | |
| 25600 | 62 | 0.3% | |
| 28800 | 61 | 0.3% | |
| 23300 | 61 | 0.3% | |
| 28300 | 61 | 0.3% | |
| 38800 | 61 | 0.3% | |
| 20600 | 60 | 0.3% | |
| 30300 | 60 | 0.3% | |
| Other values (2020) | 17956 | 78.8% |
| Value | Count | Frequency (%) | |
| 0 | 52 | 0.2% | |
| 14999 | 3111 | 13.7% | |
| 15000 | 65 | 0.3% | |
| 15100 | 23 | 0.1% | |
| 15200 | 38 | 0.2% | |
| 15300 | 31 | 0.1% | |
| 15400 | 33 | 0.1% | |
| 15500 | 29 | 0.1% | |
| 15600 | 50 | 0.2% | |
| 15700 | 36 | 0.2% |
| Value | Count | Frequency (%) | |
| 500001 | 47 | 0.2% | |
| 494200 | 1 | < 0.1% | |
| 492600 | 1 | < 0.1% | |
| 481100 | 1 | < 0.1% | |
| 478600 | 1 | < 0.1% | |
| 471200 | 1 | < 0.1% | |
| 469300 | 1 | < 0.1% | |
| 468400 | 1 | < 0.1% | |
| 466900 | 1 | < 0.1% | |
| 462500 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| P3 | P6p4 | P11p3 | P16p2 | P19p2 | H5p2 | H15p1 | H40p4 | target | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 7074.0 | 0.004964 | 0.507478 | 0.579729 | 0.036613 | 0.020244 | 6.618784 | 0.774059 | 130600.0 |
| 1 | 597.0 | 0.003871 | 0.480000 | 0.695142 | 0.003350 | 0.170732 | 7.163934 | 0.142857 | 40500.0 |
| 2 | 1931.0 | 0.002320 | 0.477747 | 0.683584 | 0.000000 | 0.117647 | 6.185848 | 0.687500 | 28700.0 |
| 3 | 164.0 | 0.000000 | 0.492505 | 0.780488 | 0.000000 | 0.100000 | 6.619835 | 1.000000 | 28500.0 |
| 4 | 119.0 | 0.000000 | 0.480645 | 0.756302 | 0.672269 | 0.000000 | 6.161616 | 0.000000 | 24100.0 |
| 5 | 164.0 | 0.000000 | 0.431670 | 0.792683 | 0.054878 | 0.250000 | 5.053097 | 0.500000 | 14999.0 |
| 6 | 261.0 | 0.000000 | 0.481328 | 0.724138 | 0.000000 | 0.000000 | 6.592179 | 0.000000 | 33200.0 |
| 7 | 32112.0 | 0.023782 | 0.538467 | 0.614848 | 0.263982 | 0.069680 | 6.711423 | 0.248781 | 66300.0 |
| 8 | 1209.0 | 0.005426 | 0.523906 | 0.674938 | 0.004136 | 0.047138 | 6.731306 | 0.571429 | 138000.0 |
| 9 | 1263.0 | 0.021381 | 0.514254 | 0.885986 | 0.009501 | 0.114286 | 6.551098 | 0.000000 | 140600.0 |
Last rows
| P3 | P6p4 | P11p3 | P16p2 | P19p2 | H5p2 | H15p1 | H40p4 | target | |
|---|---|---|---|---|---|---|---|---|---|
| 22774 | 795.0 | 0.005739 | 0.494500 | 0.755975 | 0.001258 | 0.250000 | 6.662125 | 0.571429 | 61200.0 |
| 22775 | 1143.0 | 0.002326 | 0.478232 | 0.704287 | 0.000000 | 0.109091 | 6.607143 | 0.333333 | 29000.0 |
| 22776 | 3260.0 | 0.000000 | 0.191471 | 0.706442 | 0.000000 | 0.102123 | 4.312871 | 0.475248 | 54500.0 |
| 22777 | 105.0 | 0.000000 | 0.509434 | 0.714286 | 0.000000 | 0.000000 | 5.920455 | 0.000000 | 20000.0 |
| 22778 | 172.0 | 0.000000 | 0.414847 | 0.738372 | 0.424419 | 0.055556 | 6.333333 | 0.000000 | 32500.0 |
| 22779 | 3664.0 | 0.003967 | 0.461217 | 0.674945 | 0.000546 | 0.121739 | 6.584818 | 0.214286 | 38900.0 |
| 22780 | 27037.0 | 0.006755 | 0.488844 | 0.663165 | 0.102415 | 0.181772 | 5.847176 | 0.598338 | 27900.0 |
| 22781 | 376.0 | 0.014330 | 0.561924 | 0.688830 | 0.109043 | 0.166667 | 6.885715 | 0.333333 | 51100.0 |
| 22782 | 113.0 | 0.009804 | 0.516340 | 0.778761 | 0.000000 | 0.000000 | 5.578432 | 0.000000 | 17200.0 |
| 22783 | 2319.0 | 0.009842 | 0.533237 | 0.704183 | 0.015955 | 0.362903 | 6.366131 | 0.600000 | 117700.0 |
Most frequent
| P3 | P6p4 | P11p3 | P16p2 | P19p2 | H5p2 | H15p1 | H40p4 | target | count | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 14.0 | 0.0 | 0.681818 | 0.5 | 0.0 | 0.003636 | 4.444444 | 1.0 | 65600.0 | 2 |