Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 1000000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 145.0 MiB |
| Average record size in memory | 152.0 B |
Variable types
| NUM | 11 |
|---|---|
| BOOL | 6 |
| CAT | 2 |
Reproduction
| Analysis started | 2020-08-24 23:44:44.995967 |
|---|---|
| Analysis finished | 2020-08-24 23:47:26.343829 |
| Duration | 2 minutes and 41.35 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
D
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 616741 | 61.7% | |
| 0 | 383259 | 38.3% |
Z1
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 630121 | 63.0% | |
| 1 | 369879 | 37.0% |
Z2
Real number (ℝ≥0)
| Distinct count | 945079 |
|---|---|
| Unique (%) | 94.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.71160959226799 |
|---|---|
| Minimum | 22.47959899902344 |
| Maximum | 84.88188934326172 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 22.479599 |
|---|---|
| 5-th percentile | 33.92495327 |
| Q1 | 43.24672508 |
| median | 50.78588867 |
| Q3 | 57.68946648 |
| 95-th percentile | 68.41936836 |
| Maximum | 84.88188934 |
| Range | 62.40229034 |
| Interquartile range (IQR) | 14.44274139 |
Descriptive statistics
| Standard deviation | 10.45003446 |
|---|---|
| Coefficient of variation (CV) | 0.2060678914 |
| Kurtosis | -0.7074187591 |
| Mean | 50.71160959 |
| Median Absolute Deviation (MAD) | 7.223049164 |
| Skewness | 0.08102142839 |
| Sum | 50711609.59 |
| Variance | 109.2032202 |
| Value | Count | Frequency (%) | |
| 55.73774338 | 5 | < 0.1% | |
| 44.63870621 | 4 | < 0.1% | |
| 55.15983963 | 4 | < 0.1% | |
| 43.8342514 | 4 | < 0.1% | |
| 50.97534943 | 4 | < 0.1% | |
| 56.60767746 | 4 | < 0.1% | |
| 56.1217804 | 4 | < 0.1% | |
| 67.56124115 | 4 | < 0.1% | |
| 56.95583344 | 4 | < 0.1% | |
| 44.47788239 | 4 | < 0.1% | |
| 49.75960159 | 4 | < 0.1% | |
| 51.52851868 | 4 | < 0.1% | |
| 43.48391724 | 4 | < 0.1% | |
| 44.03220749 | 4 | < 0.1% | |
| 44.70715714 | 4 | < 0.1% | |
| 50.82204437 | 4 | < 0.1% | |
| 49.62920761 | 4 | < 0.1% | |
| 50.44236755 | 4 | < 0.1% | |
| 50.40220261 | 4 | < 0.1% | |
| 51.34690475 | 4 | < 0.1% | |
| 51.30337906 | 4 | < 0.1% | |
| 43.8418808 | 4 | < 0.1% | |
| 56.72174072 | 4 | < 0.1% | |
| 50.65749359 | 4 | < 0.1% | |
| 56.70775223 | 4 | < 0.1% | |
| Other values (945054) | 999899 | > 99.9% |
| Value | Count | Frequency (%) | |
| 22.479599 | 1 | < 0.1% | |
| 22.91332626 | 1 | < 0.1% | |
| 22.98233795 | 1 | < 0.1% | |
| 23.02925301 | 1 | < 0.1% | |
| 23.23048019 | 1 | < 0.1% | |
| 23.27106094 | 1 | < 0.1% | |
| 23.27632523 | 1 | < 0.1% | |
| 23.3631134 | 1 | < 0.1% | |
| 23.40042496 | 1 | < 0.1% | |
| 23.41963959 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 84.88188934 | 1 | < 0.1% | |
| 83.52048492 | 1 | < 0.1% | |
| 83.1255722 | 1 | < 0.1% | |
| 82.93149567 | 1 | < 0.1% | |
| 82.22232056 | 1 | < 0.1% | |
| 82.04473877 | 1 | < 0.1% | |
| 82.041008 | 1 | < 0.1% | |
| 82.03862 | 1 | < 0.1% | |
| 81.82262421 | 1 | < 0.1% | |
| 81.81079865 | 1 | < 0.1% |
Z3
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 0 | |
|---|---|
| 1 | 91017 |
| Value | Count | Frequency (%) | |
| 0 | 908983 | 90.9% | |
| 1 | 91017 | 9.1% |
Z4
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 0 | 66784 |
| Value | Count | Frequency (%) | |
| 1 | 933216 | 93.3% | |
| 0 | 66784 | 6.7% |
Z5
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 635315 | 63.5% | |
| 1 | 364685 | 36.5% |
Z6
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 778600 | 77.9% | |
| 0 | 221400 | 22.1% |
Z7
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 2 | 112578 |
| 0 | 55203 |
| Value | Count | Frequency (%) | |
| 1 | 832219 | 83.2% | |
| 2 | 112578 | 11.3% | |
| 0 | 55203 | 5.5% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 1055203 | 35.2% | |
| . | 1000000 | 33.3% | |
| 1 | 832219 | 27.7% | |
| 2 | 112578 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2000000 | 66.7% | |
| Other Punctuation | 1000000 | 33.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 1055203 | 52.8% | |
| 1 | 832219 | 41.6% | |
| 2 | 112578 | 5.6% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 1000000 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 3000000 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 1055203 | 35.2% | |
| . | 1000000 | 33.3% | |
| 1 | 832219 | 27.7% | |
| 2 | 112578 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 1055203 | 35.2% | |
| . | 1000000 | 33.3% | |
| 1 | 832219 | 27.7% | |
| 2 | 112578 | 3.8% |
Z8
Real number (ℝ)
| Distinct count | 862733 |
|---|---|
| Unique (%) | 86.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.274871514263747 |
|---|---|
| Minimum | -14.14468479156494 |
| Maximum | 35.740596771240234 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -14.14468479 |
|---|---|
| 5-th percentile | 0.4801328972 |
| Q1 | 0.7664745152 |
| median | 1.400047958 |
| Q3 | 3.302636266 |
| 95-th percentile | 14.26304402 |
| Maximum | 35.74059677 |
| Range | 49.88528156 |
| Interquartile range (IQR) | 2.536161751 |
Descriptive statistics
| Standard deviation | 4.481907266 |
|---|---|
| Coefficient of variation (CV) | 1.368574995 |
| Kurtosis | 4.611595775 |
| Mean | 3.274871514 |
| Median Absolute Deviation (MAD) | 0.7992949486 |
| Skewness | 2.234646919 |
| Sum | 3274871.514 |
| Variance | 20.08749274 |
| Value | Count | Frequency (%) | |
| 0.5222470164 | 7 | < 0.1% | |
| 0.5686479807 | 7 | < 0.1% | |
| 1.02156496 | 7 | < 0.1% | |
| 0.5748149753 | 7 | < 0.1% | |
| 0.5237249732 | 7 | < 0.1% | |
| 0.6798350215 | 6 | < 0.1% | |
| 0.6021890044 | 6 | < 0.1% | |
| 0.6314949989 | 6 | < 0.1% | |
| 0.9160460234 | 6 | < 0.1% | |
| 0.631321013 | 6 | < 0.1% | |
| 0.8512340188 | 6 | < 0.1% | |
| 0.5551149845 | 6 | < 0.1% | |
| 0.6932719946 | 6 | < 0.1% | |
| 0.5551499724 | 6 | < 0.1% | |
| 0.6537529826 | 6 | < 0.1% | |
| 0.660540998 | 6 | < 0.1% | |
| 0.5932610035 | 6 | < 0.1% | |
| 0.5554680228 | 6 | < 0.1% | |
| 0.5683150291 | 6 | < 0.1% | |
| 0.6217349768 | 6 | < 0.1% | |
| 0.5795670152 | 6 | < 0.1% | |
| 0.6810910106 | 6 | < 0.1% | |
| 0.5828120112 | 6 | < 0.1% | |
| 0.5574280024 | 6 | < 0.1% | |
| 0.5912359953 | 6 | < 0.1% | |
| Other values (862708) | 999845 | > 99.9% |
| Value | Count | Frequency (%) | |
| -14.14468479 | 1 | < 0.1% | |
| -13.32817364 | 1 | < 0.1% | |
| -13.21354771 | 1 | < 0.1% | |
| -12.49219608 | 1 | < 0.1% | |
| -11.86309719 | 1 | < 0.1% | |
| -11.72336292 | 1 | < 0.1% | |
| -11.51307583 | 1 | < 0.1% | |
| -11.45681095 | 1 | < 0.1% | |
| -11.37018013 | 1 | < 0.1% | |
| -11.35507107 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 35.74059677 | 1 | < 0.1% | |
| 35.42272568 | 1 | < 0.1% | |
| 33.62754059 | 1 | < 0.1% | |
| 33.35547638 | 1 | < 0.1% | |
| 33.24124908 | 1 | < 0.1% | |
| 32.73513031 | 1 | < 0.1% | |
| 32.6942215 | 1 | < 0.1% | |
| 32.43099976 | 1 | < 0.1% | |
| 32.24845123 | 1 | < 0.1% | |
| 32.11071396 | 1 | < 0.1% |
Z9
Real number (ℝ)
| Distinct count | 954746 |
|---|---|
| Unique (%) | 95.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 372.7969093925877 |
|---|---|
| Minimum | -882.482177734375 |
| Maximum | 2065.0244140625 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -882.4821777 |
|---|---|
| 5-th percentile | 186.1779961 |
| Q1 | 261.2439499 |
| median | 342.0298462 |
| Q3 | 392.5618362 |
| 95-th percentile | 845.5699036 |
| Maximum | 2065.024414 |
| Range | 2947.506592 |
| Interquartile range (IQR) | 131.3178864 |
Descriptive statistics
| Standard deviation | 201.6037313 |
|---|---|
| Coefficient of variation (CV) | 0.5407870242 |
| Kurtosis | 8.531449867 |
| Mean | 372.7969094 |
| Median Absolute Deviation (MAD) | 63.73150635 |
| Skewness | 2.580498224 |
| Sum | 372796909.4 |
| Variance | 40644.06446 |
| Value | Count | Frequency (%) | |
| 260.8553772 | 5 | < 0.1% | |
| 259.3614502 | 4 | < 0.1% | |
| 366.830719 | 4 | < 0.1% | |
| 378.3994446 | 4 | < 0.1% | |
| 388.8059692 | 4 | < 0.1% | |
| 392.1968994 | 4 | < 0.1% | |
| 258.6195374 | 4 | < 0.1% | |
| 363.1756287 | 4 | < 0.1% | |
| 351.6827087 | 4 | < 0.1% | |
| 377.64505 | 4 | < 0.1% | |
| 379.9394531 | 4 | < 0.1% | |
| 310.6798096 | 4 | < 0.1% | |
| 365.9874268 | 4 | < 0.1% | |
| 358.7298584 | 4 | < 0.1% | |
| 308.1872253 | 4 | < 0.1% | |
| 371.8473816 | 4 | < 0.1% | |
| 266.2933655 | 4 | < 0.1% | |
| 392.4492188 | 4 | < 0.1% | |
| 396.9309387 | 4 | < 0.1% | |
| 305.6862488 | 4 | < 0.1% | |
| 385.2670288 | 4 | < 0.1% | |
| 256.6996155 | 4 | < 0.1% | |
| 376.2719421 | 4 | < 0.1% | |
| 376.1568909 | 4 | < 0.1% | |
| 391.7615662 | 4 | < 0.1% | |
| Other values (954721) | 999899 | > 99.9% |
| Value | Count | Frequency (%) | |
| -882.4821777 | 1 | < 0.1% | |
| -785.8256226 | 1 | < 0.1% | |
| -773.0212402 | 1 | < 0.1% | |
| -744.7147217 | 1 | < 0.1% | |
| -719.6126099 | 1 | < 0.1% | |
| -699.5125732 | 1 | < 0.1% | |
| -684.4309692 | 1 | < 0.1% | |
| -681.9295654 | 1 | < 0.1% | |
| -673.5078125 | 1 | < 0.1% | |
| -668.0759888 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2065.024414 | 1 | < 0.1% | |
| 2024.850708 | 1 | < 0.1% | |
| 2009.595581 | 1 | < 0.1% | |
| 1992.291992 | 1 | < 0.1% | |
| 1989.187256 | 1 | < 0.1% | |
| 1981.489258 | 1 | < 0.1% | |
| 1972.56189 | 1 | < 0.1% | |
| 1971.114014 | 1 | < 0.1% | |
| 1970.685913 | 1 | < 0.1% | |
| 1962.456543 | 1 | < 0.1% |
Z10
Real number (ℝ≥0)
| Distinct count | 715569 |
|---|---|
| Unique (%) | 71.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.499945929758191 |
|---|---|
| Minimum | 1.7229160070419312 |
| Maximum | 4.784540176391602 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 1.722916007 |
|---|---|
| 5-th percentile | 2.694155836 |
| Q1 | 3.277952671 |
| median | 3.53124094 |
| Q3 | 3.760113955 |
| 95-th percentile | 4.165658045 |
| Maximum | 4.784540176 |
| Range | 3.061624169 |
| Interquartile range (IQR) | 0.4821612835 |
Descriptive statistics
| Standard deviation | 0.4223767999 |
|---|---|
| Coefficient of variation (CV) | 0.12068095 |
| Kurtosis | 0.06695382307 |
| Mean | 3.49994593 |
| Median Absolute Deviation (MAD) | 0.2404866219 |
| Skewness | -0.4180750386 |
| Sum | 3499945.93 |
| Variance | 0.1784021611 |
| Value | Count | Frequency (%) | |
| 3.725474119 | 10 | < 0.1% | |
| 3.546597004 | 9 | < 0.1% | |
| 3.501540899 | 9 | < 0.1% | |
| 3.74447608 | 8 | < 0.1% | |
| 3.511399031 | 8 | < 0.1% | |
| 3.368366957 | 8 | < 0.1% | |
| 3.53716898 | 8 | < 0.1% | |
| 3.504977942 | 8 | < 0.1% | |
| 3.319776058 | 8 | < 0.1% | |
| 3.763633966 | 8 | < 0.1% | |
| 3.532329082 | 8 | < 0.1% | |
| 3.555906057 | 8 | < 0.1% | |
| 3.503269911 | 8 | < 0.1% | |
| 3.553623915 | 8 | < 0.1% | |
| 3.279759884 | 8 | < 0.1% | |
| 3.741811991 | 8 | < 0.1% | |
| 3.561414957 | 7 | < 0.1% | |
| 3.538841009 | 7 | < 0.1% | |
| 3.687743902 | 7 | < 0.1% | |
| 3.314248085 | 7 | < 0.1% | |
| 3.503576994 | 7 | < 0.1% | |
| 3.528002024 | 7 | < 0.1% | |
| 3.548182011 | 7 | < 0.1% | |
| 3.553421974 | 7 | < 0.1% | |
| 3.521962881 | 7 | < 0.1% | |
| Other values (715544) | 999805 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1.722916007 | 1 | < 0.1% | |
| 1.770863056 | 1 | < 0.1% | |
| 1.772652984 | 1 | < 0.1% | |
| 1.773419023 | 1 | < 0.1% | |
| 1.788416028 | 1 | < 0.1% | |
| 1.79669404 | 1 | < 0.1% | |
| 1.815662026 | 1 | < 0.1% | |
| 1.818061948 | 1 | < 0.1% | |
| 1.830103993 | 1 | < 0.1% | |
| 1.831709981 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4.784540176 | 1 | < 0.1% | |
| 4.766119003 | 1 | < 0.1% | |
| 4.745099068 | 1 | < 0.1% | |
| 4.734613895 | 1 | < 0.1% | |
| 4.726620197 | 1 | < 0.1% | |
| 4.705323219 | 1 | < 0.1% | |
| 4.698208809 | 1 | < 0.1% | |
| 4.696437836 | 1 | < 0.1% | |
| 4.69584322 | 1 | < 0.1% | |
| 4.695724964 | 1 | < 0.1% |
Z11
Real number (ℝ)
| Distinct count | 980527 |
|---|---|
| Unique (%) | 98.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 86.94101598177527 |
|---|---|
| Minimum | -156.5341033935547 |
| Maximum | 632.386962890625 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -156.5341034 |
|---|---|
| 5-th percentile | 18.50917807 |
| Q1 | 42.41050339 |
| median | 56.86836052 |
| Q3 | 104.7587852 |
| 95-th percentile | 278.1340363 |
| Maximum | 632.3869629 |
| Range | 788.9210663 |
| Interquartile range (IQR) | 62.34828186 |
Descriptive statistics
| Standard deviation | 78.90432666 |
|---|---|
| Coefficient of variation (CV) | 0.9075615895 |
| Kurtosis | 4.594697272 |
| Mean | 86.94101598 |
| Median Absolute Deviation (MAD) | 22.13475609 |
| Skewness | 2.127324533 |
| Sum | 86941015.98 |
| Variance | 6225.892765 |
| Value | Count | Frequency (%) | |
| 44.95726776 | 4 | < 0.1% | |
| 45.50425339 | 4 | < 0.1% | |
| 67.9723053 | 4 | < 0.1% | |
| 75.16859436 | 4 | < 0.1% | |
| 49.06213379 | 4 | < 0.1% | |
| 74.21747589 | 4 | < 0.1% | |
| 45.5927887 | 4 | < 0.1% | |
| 44.34004211 | 3 | < 0.1% | |
| 38.30842972 | 3 | < 0.1% | |
| 51.91654587 | 3 | < 0.1% | |
| 72.32200623 | 3 | < 0.1% | |
| 42.37286377 | 3 | < 0.1% | |
| 25.98135757 | 3 | < 0.1% | |
| 47.5258522 | 3 | < 0.1% | |
| 47.98347473 | 3 | < 0.1% | |
| 48.7043457 | 3 | < 0.1% | |
| 74.64677429 | 3 | < 0.1% | |
| 65.5815506 | 3 | < 0.1% | |
| 104.8464737 | 3 | < 0.1% | |
| 71.79456329 | 3 | < 0.1% | |
| 50.08789444 | 3 | < 0.1% | |
| 45.04151917 | 3 | < 0.1% | |
| 133.2732544 | 3 | < 0.1% | |
| 53.26330185 | 3 | < 0.1% | |
| 68.32317352 | 3 | < 0.1% | |
| Other values (980502) | 999918 | > 99.9% |
| Value | Count | Frequency (%) | |
| -156.5341034 | 1 | < 0.1% | |
| -154.3092651 | 1 | < 0.1% | |
| -145.3622742 | 1 | < 0.1% | |
| -144.1547852 | 1 | < 0.1% | |
| -142.9697723 | 1 | < 0.1% | |
| -141.4416962 | 1 | < 0.1% | |
| -139.4296722 | 1 | < 0.1% | |
| -134.3579865 | 1 | < 0.1% | |
| -131.7428284 | 1 | < 0.1% | |
| -131.5534515 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 632.3869629 | 1 | < 0.1% | |
| 629.1333008 | 1 | < 0.1% | |
| 613.4134521 | 1 | < 0.1% | |
| 609.0568237 | 1 | < 0.1% | |
| 598.4865723 | 1 | < 0.1% | |
| 597.37146 | 1 | < 0.1% | |
| 596.1638794 | 1 | < 0.1% | |
| 595.6636963 | 1 | < 0.1% | |
| 590.8909912 | 1 | < 0.1% | |
| 589.8619385 | 1 | < 0.1% |
Z12
Real number (ℝ)
| Distinct count | 976926 |
|---|---|
| Unique (%) | 97.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1833.7611590784807 |
|---|---|
| Minimum | -7466.93212890625 |
| Maximum | 18792.521484375 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -7466.932129 |
|---|---|
| 5-th percentile | 546.5758514 |
| Q1 | 879.5376892 |
| median | 1095.608337 |
| Q3 | 1794.382599 |
| 95-th percentile | 6874.935352 |
| Maximum | 18792.52148 |
| Range | 26259.45361 |
| Interquartile range (IQR) | 914.8449097 |
Descriptive statistics
| Standard deviation | 2021.985799 |
|---|---|
| Coefficient of variation (CV) | 1.102644032 |
| Kurtosis | 7.454729209 |
| Mean | 1833.761159 |
| Median Absolute Deviation (MAD) | 351.0852661 |
| Skewness | 2.66779536 |
| Sum | 1833761159 |
| Variance | 4088426.572 |
| Value | Count | Frequency (%) | |
| 1077.641235 | 5 | < 0.1% | |
| 997.1378784 | 4 | < 0.1% | |
| 1028.798096 | 4 | < 0.1% | |
| 884.3886719 | 4 | < 0.1% | |
| 1021.8125 | 4 | < 0.1% | |
| 1028.904541 | 4 | < 0.1% | |
| 1352.629272 | 4 | < 0.1% | |
| 1062.356567 | 4 | < 0.1% | |
| 1227.168579 | 4 | < 0.1% | |
| 1025.69043 | 4 | < 0.1% | |
| 886.7827759 | 4 | < 0.1% | |
| 969.1589966 | 4 | < 0.1% | |
| 954.8666382 | 4 | < 0.1% | |
| 872.7441406 | 4 | < 0.1% | |
| 1126.799683 | 4 | < 0.1% | |
| 1066.2146 | 3 | < 0.1% | |
| 1080.164307 | 3 | < 0.1% | |
| 926.9638062 | 3 | < 0.1% | |
| 890.6593018 | 3 | < 0.1% | |
| 932.3843384 | 3 | < 0.1% | |
| 1000.265015 | 3 | < 0.1% | |
| 804.7680664 | 3 | < 0.1% | |
| 1021.609985 | 3 | < 0.1% | |
| 957.0615845 | 3 | < 0.1% | |
| 848.3918457 | 3 | < 0.1% | |
| Other values (976901) | 999909 | > 99.9% |
| Value | Count | Frequency (%) | |
| -7466.932129 | 1 | < 0.1% | |
| -7098.428223 | 1 | < 0.1% | |
| -7091.564941 | 1 | < 0.1% | |
| -6943.875977 | 1 | < 0.1% | |
| -6861.82373 | 1 | < 0.1% | |
| -6821.995605 | 1 | < 0.1% | |
| -6764.158691 | 1 | < 0.1% | |
| -6662.955078 | 1 | < 0.1% | |
| -6385.239258 | 1 | < 0.1% | |
| -6335.787598 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 18792.52148 | 1 | < 0.1% | |
| 18349.06836 | 1 | < 0.1% | |
| 17912.83008 | 1 | < 0.1% | |
| 17704.63086 | 1 | < 0.1% | |
| 17577.88477 | 1 | < 0.1% | |
| 17522.11133 | 1 | < 0.1% | |
| 17459.69141 | 1 | < 0.1% | |
| 17099.78906 | 1 | < 0.1% | |
| 17022.59375 | 1 | < 0.1% | |
| 16855.29102 | 1 | < 0.1% |
Z13
Real number (ℝ≥0)
| Distinct count | 969238 |
|---|---|
| Unique (%) | 96.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 118.33384144998897 |
|---|---|
| Minimum | 1.739421010017395 |
| Maximum | 433.5694274902344 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 1.73942101 |
|---|---|
| 5-th percentile | 53.56120949 |
| Q1 | 81.65940666 |
| median | 105.1774139 |
| Q3 | 143.1369324 |
| 95-th percentile | 238.6270798 |
| Maximum | 433.5694275 |
| Range | 431.8300065 |
| Interquartile range (IQR) | 61.47752571 |
Descriptive statistics
| Standard deviation | 54.84641219 |
|---|---|
| Coefficient of variation (CV) | 0.463488817 |
| Kurtosis | 1.549024714 |
| Mean | 118.3338414 |
| Median Absolute Deviation (MAD) | 28.73896408 |
| Skewness | 1.279470339 |
| Sum | 118333841.4 |
| Variance | 3008.12893 |
| Value | Count | Frequency (%) | |
| 88.82189941 | 4 | < 0.1% | |
| 64.62203979 | 4 | < 0.1% | |
| 86.58628845 | 4 | < 0.1% | |
| 87.67505646 | 4 | < 0.1% | |
| 86.31182098 | 4 | < 0.1% | |
| 85.30688477 | 4 | < 0.1% | |
| 83.57150269 | 4 | < 0.1% | |
| 86.31078339 | 4 | < 0.1% | |
| 142.3710175 | 4 | < 0.1% | |
| 84.3239975 | 4 | < 0.1% | |
| 94.84861755 | 4 | < 0.1% | |
| 83.59587097 | 4 | < 0.1% | |
| 146.0527649 | 4 | < 0.1% | |
| 84.36457825 | 4 | < 0.1% | |
| 88.14942169 | 4 | < 0.1% | |
| 87.99341583 | 4 | < 0.1% | |
| 141.10112 | 4 | < 0.1% | |
| 146.8600769 | 4 | < 0.1% | |
| 87.76580811 | 4 | < 0.1% | |
| 92.12950897 | 4 | < 0.1% | |
| 146.2153931 | 4 | < 0.1% | |
| 81.89546204 | 3 | < 0.1% | |
| 91.01166534 | 3 | < 0.1% | |
| 96.67487335 | 3 | < 0.1% | |
| 91.91039276 | 3 | < 0.1% | |
| Other values (969213) | 999904 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1.73942101 | 1 | < 0.1% | |
| 6.589003086 | 1 | < 0.1% | |
| 7.881835938 | 1 | < 0.1% | |
| 10.35864353 | 1 | < 0.1% | |
| 10.77004433 | 1 | < 0.1% | |
| 11.3925066 | 1 | < 0.1% | |
| 12.3207283 | 1 | < 0.1% | |
| 13.08386421 | 1 | < 0.1% | |
| 13.16877556 | 1 | < 0.1% | |
| 13.27734375 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 433.5694275 | 1 | < 0.1% | |
| 415.7616272 | 1 | < 0.1% | |
| 415.3772888 | 1 | < 0.1% | |
| 414.0134277 | 1 | < 0.1% | |
| 407.688446 | 1 | < 0.1% | |
| 405.5202026 | 1 | < 0.1% | |
| 405.4505005 | 1 | < 0.1% | |
| 405.0487061 | 1 | < 0.1% | |
| 404.3439636 | 1 | < 0.1% | |
| 403.8058472 | 1 | < 0.1% |
Z14
Real number (ℝ)
| Distinct count | 966659 |
|---|---|
| Unique (%) | 96.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 122.50148239126713 |
|---|---|
| Minimum | -91.03424072265624 |
| Maximum | 588.9686889648438 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -91.03424072 |
|---|---|
| 5-th percentile | 56.19245663 |
| Q1 | 86.44034767 |
| median | 108.0734978 |
| Q3 | 139.4275131 |
| 95-th percentile | 265.6085342 |
| Maximum | 588.968689 |
| Range | 680.0029297 |
| Interquartile range (IQR) | 52.98716545 |
Descriptive statistics
| Standard deviation | 61.50756736 |
|---|---|
| Coefficient of variation (CV) | 0.5020965148 |
| Kurtosis | 3.813303719 |
| Mean | 122.5014824 |
| Median Absolute Deviation (MAD) | 25.09290695 |
| Skewness | 1.833155147 |
| Sum | 122501482.4 |
| Variance | 3783.180843 |
| Value | Count | Frequency (%) | |
| 70.932724 | 4 | < 0.1% | |
| 116.4107285 | 4 | < 0.1% | |
| 87.70752716 | 4 | < 0.1% | |
| 105.8251648 | 4 | < 0.1% | |
| 87.08421326 | 4 | < 0.1% | |
| 114.4018936 | 4 | < 0.1% | |
| 129.1886597 | 4 | < 0.1% | |
| 111.5177994 | 4 | < 0.1% | |
| 83.80166626 | 4 | < 0.1% | |
| 101.5060425 | 4 | < 0.1% | |
| 107.2963638 | 4 | < 0.1% | |
| 111.0473251 | 4 | < 0.1% | |
| 90.34593964 | 4 | < 0.1% | |
| 102.6699753 | 4 | < 0.1% | |
| 108.1455078 | 4 | < 0.1% | |
| 111.9825287 | 4 | < 0.1% | |
| 109.6106796 | 4 | < 0.1% | |
| 105.1541824 | 4 | < 0.1% | |
| 108.4013596 | 4 | < 0.1% | |
| 115.3120651 | 4 | < 0.1% | |
| 116.1830902 | 3 | < 0.1% | |
| 105.4088516 | 3 | < 0.1% | |
| 148.8215942 | 3 | < 0.1% | |
| 113.1359329 | 3 | < 0.1% | |
| 116.3662415 | 3 | < 0.1% | |
| Other values (966634) | 999905 | > 99.9% |
| Value | Count | Frequency (%) | |
| -91.03424072 | 1 | < 0.1% | |
| -86.03147888 | 1 | < 0.1% | |
| -81.64179993 | 1 | < 0.1% | |
| -81.0532074 | 1 | < 0.1% | |
| -78.70722198 | 1 | < 0.1% | |
| -74.81435394 | 1 | < 0.1% | |
| -69.44786835 | 1 | < 0.1% | |
| -64.73968506 | 1 | < 0.1% | |
| -62.54127121 | 1 | < 0.1% | |
| -61.6729126 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 588.968689 | 1 | < 0.1% | |
| 559.3779907 | 1 | < 0.1% | |
| 543.5644531 | 1 | < 0.1% | |
| 541.0971069 | 1 | < 0.1% | |
| 532.9862671 | 1 | < 0.1% | |
| 522.7778931 | 1 | < 0.1% | |
| 520.6544189 | 1 | < 0.1% | |
| 518.3247681 | 1 | < 0.1% | |
| 517.5400391 | 1 | < 0.1% | |
| 515.0258789 | 1 | < 0.1% |
Z15
Real number (ℝ≥0)
| Distinct count | 968420 |
|---|---|
| Unique (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 254.4475520729866 |
|---|---|
| Minimum | 12.134506225585938 |
| Maximum | 730.5150756835938 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 12.13450623 |
|---|---|
| 5-th percentile | 110.0112743 |
| Q1 | 186.4013481 |
| median | 247.3895035 |
| Q3 | 309.4531555 |
| 95-th percentile | 445.1947723 |
| Maximum | 730.5150757 |
| Range | 718.3805695 |
| Interquartile range (IQR) | 123.0518074 |
Descriptive statistics
| Standard deviation | 99.05169557 |
|---|---|
| Coefficient of variation (CV) | 0.3892813853 |
| Kurtosis | -0.03711787 |
| Mean | 254.4475521 |
| Median Absolute Deviation (MAD) | 61.592659 |
| Skewness | 0.5323944621 |
| Sum | 254447552.1 |
| Variance | 9811.238394 |
| Value | Count | Frequency (%) | |
| 198.736618 | 5 | < 0.1% | |
| 241.8627472 | 4 | < 0.1% | |
| 311.2606201 | 4 | < 0.1% | |
| 289.3786011 | 4 | < 0.1% | |
| 333.3598938 | 4 | < 0.1% | |
| 196.7786255 | 4 | < 0.1% | |
| 293.9570312 | 4 | < 0.1% | |
| 262.6108704 | 4 | < 0.1% | |
| 303.7745361 | 4 | < 0.1% | |
| 299.7395325 | 4 | < 0.1% | |
| 258.2235413 | 4 | < 0.1% | |
| 293.5064697 | 4 | < 0.1% | |
| 245.0899658 | 4 | < 0.1% | |
| 263.0545349 | 4 | < 0.1% | |
| 284.0815125 | 4 | < 0.1% | |
| 318.4844666 | 4 | < 0.1% | |
| 269.1048279 | 4 | < 0.1% | |
| 260.3947754 | 4 | < 0.1% | |
| 273.0010986 | 4 | < 0.1% | |
| 312.1871033 | 4 | < 0.1% | |
| 258.9861755 | 4 | < 0.1% | |
| 301.511261 | 4 | < 0.1% | |
| 292.6485901 | 4 | < 0.1% | |
| 308.3053894 | 4 | < 0.1% | |
| 261.8820496 | 4 | < 0.1% | |
| Other values (968395) | 999899 | > 99.9% |
| Value | Count | Frequency (%) | |
| 12.13450623 | 1 | < 0.1% | |
| 13.46444321 | 1 | < 0.1% | |
| 15.19852829 | 1 | < 0.1% | |
| 19.82350159 | 1 | < 0.1% | |
| 21.63874817 | 1 | < 0.1% | |
| 24.50969887 | 1 | < 0.1% | |
| 25.49705124 | 1 | < 0.1% | |
| 25.63045502 | 1 | < 0.1% | |
| 27.36805153 | 1 | < 0.1% | |
| 28.03575516 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 730.5150757 | 1 | < 0.1% | |
| 700.5214233 | 1 | < 0.1% | |
| 685.1069946 | 1 | < 0.1% | |
| 670.9595947 | 1 | < 0.1% | |
| 670.5437622 | 1 | < 0.1% | |
| 669.2147217 | 1 | < 0.1% | |
| 667.5880737 | 1 | < 0.1% | |
| 666.6210327 | 1 | < 0.1% | |
| 665.7985229 | 1 | < 0.1% | |
| 662.9226685 | 1 | < 0.1% |
Z16
Real number (ℝ≥0)
| Distinct count | 819131 |
|---|---|
| Unique (%) | 81.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.72546184243536 |
|---|---|
| Minimum | 7.001218795776367 |
| Maximum | 16.992422103881836 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 7.001218796 |
|---|---|
| 5-th percentile | 9.552613115 |
| Q1 | 10.04733443 |
| median | 10.55051041 |
| Q3 | 11.03195119 |
| 95-th percentile | 12.99760518 |
| Maximum | 16.9924221 |
| Range | 9.991203308 |
| Interquartile range (IQR) | 0.9846167564 |
Descriptive statistics
| Standard deviation | 1.017053943 |
|---|---|
| Coefficient of variation (CV) | 0.09482612107 |
| Kurtosis | 2.435096066 |
| Mean | 10.72546184 |
| Median Absolute Deviation (MAD) | 0.4919743538 |
| Skewness | 1.476985115 |
| Sum | 10725461.84 |
| Variance | 1.034398723 |
| Value | Count | Frequency (%) | |
| 10.89664364 | 8 | < 0.1% | |
| 11.00351048 | 7 | < 0.1% | |
| 10.06369495 | 7 | < 0.1% | |
| 11.12623119 | 7 | < 0.1% | |
| 10.52701092 | 7 | < 0.1% | |
| 10.59115314 | 7 | < 0.1% | |
| 10.0802393 | 7 | < 0.1% | |
| 10.15571117 | 7 | < 0.1% | |
| 10.49715424 | 7 | < 0.1% | |
| 10.54247665 | 6 | < 0.1% | |
| 10.68694973 | 6 | < 0.1% | |
| 10.57380486 | 6 | < 0.1% | |
| 10.60359001 | 6 | < 0.1% | |
| 10.07097912 | 6 | < 0.1% | |
| 10.53507996 | 6 | < 0.1% | |
| 10.63565826 | 6 | < 0.1% | |
| 10.12573528 | 6 | < 0.1% | |
| 10.05338287 | 6 | < 0.1% | |
| 10.54221821 | 6 | < 0.1% | |
| 10.12690258 | 6 | < 0.1% | |
| 11.00956345 | 6 | < 0.1% | |
| 10.12915516 | 6 | < 0.1% | |
| 10.14281082 | 6 | < 0.1% | |
| 10.15753841 | 6 | < 0.1% | |
| 10.55771828 | 6 | < 0.1% | |
| Other values (819106) | 999840 | > 99.9% |
| Value | Count | Frequency (%) | |
| 7.001218796 | 1 | < 0.1% | |
| 7.432777882 | 1 | < 0.1% | |
| 7.639432907 | 1 | < 0.1% | |
| 7.710903168 | 1 | < 0.1% | |
| 7.754797935 | 1 | < 0.1% | |
| 7.825153828 | 1 | < 0.1% | |
| 7.830566883 | 1 | < 0.1% | |
| 7.833673954 | 1 | < 0.1% | |
| 7.844981194 | 1 | < 0.1% | |
| 7.846840858 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 16.9924221 | 1 | < 0.1% | |
| 16.96262932 | 1 | < 0.1% | |
| 16.86690331 | 1 | < 0.1% | |
| 16.82704163 | 1 | < 0.1% | |
| 16.74712563 | 1 | < 0.1% | |
| 16.70646095 | 1 | < 0.1% | |
| 16.70506477 | 1 | < 0.1% | |
| 16.67226219 | 1 | < 0.1% | |
| 16.66664314 | 1 | < 0.1% | |
| 16.64833069 | 1 | < 0.1% |
Z17
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 0 | |
| 2 | |
| 3 |
| Value | Count | Frequency (%) | |
| 1 | 461203 | 46.1% | |
| 0 | 253844 | 25.4% | |
| 2 | 189283 | 18.9% | |
| 3 | 95670 | 9.6% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 1253844 | 41.8% | |
| . | 1000000 | 33.3% | |
| 1 | 461203 | 15.4% | |
| 2 | 189283 | 6.3% | |
| 3 | 95670 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2000000 | 66.7% | |
| Other Punctuation | 1000000 | 33.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 1253844 | 62.7% | |
| 1 | 461203 | 23.1% | |
| 2 | 189283 | 9.5% | |
| 3 | 95670 | 4.8% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 1000000 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 3000000 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 1253844 | 41.8% | |
| . | 1000000 | 33.3% | |
| 1 | 461203 | 15.4% | |
| 2 | 189283 | 6.3% | |
| 3 | 95670 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3000000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 1253844 | 41.8% | |
| . | 1000000 | 33.3% | |
| 1 | 461203 | 15.4% | |
| 2 | 189283 | 6.3% | |
| 3 | 95670 | 3.2% |
target
Real number (ℝ)
| Distinct count | 978852 |
|---|---|
| Unique (%) | 97.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1917.5776777154279 |
|---|---|
| Minimum | -756.8543701171875 |
| Maximum | 5856.17919921875 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -756.8543701 |
|---|---|
| 5-th percentile | 338.1485519 |
| Q1 | 1133.135864 |
| median | 1745.583008 |
| Q3 | 2592.773132 |
| 95-th percentile | 3969.409265 |
| Maximum | 5856.179199 |
| Range | 6613.033569 |
| Interquartile range (IQR) | 1459.637268 |
Descriptive statistics
| Standard deviation | 1103.621876 |
|---|---|
| Coefficient of variation (CV) | 0.5755291633 |
| Kurtosis | -0.5221327473 |
| Mean | 1917.577678 |
| Median Absolute Deviation (MAD) | 728.5579834 |
| Skewness | 0.4665702559 |
| Sum | 1917577678 |
| Variance | 1217981.246 |
| Value | Count | Frequency (%) | |
| 2371.098633 | 5 | < 0.1% | |
| 2656.067139 | 4 | < 0.1% | |
| 2358.689697 | 4 | < 0.1% | |
| 3262.929443 | 4 | < 0.1% | |
| 2517.139648 | 4 | < 0.1% | |
| 1181.394531 | 4 | < 0.1% | |
| 2433.829102 | 4 | < 0.1% | |
| 1864.750244 | 3 | < 0.1% | |
| 1699.38269 | 3 | < 0.1% | |
| 2437.641602 | 3 | < 0.1% | |
| 1142.294189 | 3 | < 0.1% | |
| 2349.006836 | 3 | < 0.1% | |
| 1284.59436 | 3 | < 0.1% | |
| 2498.487793 | 3 | < 0.1% | |
| 2236.104004 | 3 | < 0.1% | |
| 1124.357178 | 3 | < 0.1% | |
| 1195.841064 | 3 | < 0.1% | |
| 1812.171265 | 3 | < 0.1% | |
| 2638.874512 | 3 | < 0.1% | |
| 2538.521973 | 3 | < 0.1% | |
| 1492.156006 | 3 | < 0.1% | |
| 2158.355713 | 3 | < 0.1% | |
| 2350.153076 | 3 | < 0.1% | |
| 1147.388306 | 3 | < 0.1% | |
| 1156.76123 | 3 | < 0.1% | |
| Other values (978827) | 999917 | > 99.9% |
| Value | Count | Frequency (%) | |
| -756.8543701 | 1 | < 0.1% | |
| -713.2109375 | 1 | < 0.1% | |
| -665.6916504 | 1 | < 0.1% | |
| -647.8264771 | 1 | < 0.1% | |
| -630.9935303 | 1 | < 0.1% | |
| -618.1260986 | 1 | < 0.1% | |
| -616.6835938 | 1 | < 0.1% | |
| -615.9442139 | 1 | < 0.1% | |
| -610.2573853 | 1 | < 0.1% | |
| -609.6702271 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5856.179199 | 1 | < 0.1% | |
| 5818.996582 | 1 | < 0.1% | |
| 5775.712402 | 1 | < 0.1% | |
| 5652.749512 | 1 | < 0.1% | |
| 5640.875977 | 1 | < 0.1% | |
| 5637.857422 | 1 | < 0.1% | |
| 5634.987793 | 1 | < 0.1% | |
| 5634.622559 | 1 | < 0.1% | |
| 5621.477539 | 1 | < 0.1% | |
| 5612.711914 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| D | Z1 | Z2 | Z3 | Z4 | Z5 | Z6 | Z7 | Z8 | Z9 | Z10 | Z11 | Z12 | Z13 | Z14 | Z15 | Z16 | Z17 | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.0 | 1.0 | 31.756531 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.720643 | 204.775208 | 4.128511 | 41.536762 | 1009.964417 | 70.513832 | 145.856323 | 240.984985 | 10.170771 | 2.0 | 2382.476074 |
| 1 | 0.0 | 0.0 | 44.626873 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 12.947697 | 324.135315 | 3.331762 | 73.896744 | 895.713806 | 114.100945 | 142.130829 | 571.043701 | 10.619975 | 0.0 | 135.560638 |
| 2 | 1.0 | 0.0 | 56.920208 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 12.805899 | 314.973969 | 3.708680 | 74.953270 | 1840.040649 | 63.579121 | 109.543335 | 299.952087 | 10.037480 | 0.0 | 2203.619629 |
| 3 | 0.0 | 0.0 | 53.490952 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 3.575025 | 358.394104 | 3.632483 | 43.917221 | 949.666931 | 108.459961 | 88.847923 | 308.771271 | 10.105953 | 1.0 | 1795.955200 |
| 4 | 0.0 | 0.0 | 41.375957 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 16.583513 | 427.360077 | 3.701448 | 52.308067 | 887.632812 | 91.052589 | 105.256760 | 326.045868 | 9.449902 | 1.0 | 1350.666992 |
| 5 | 1.0 | 0.0 | 52.508492 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 1.556410 | 360.000671 | 4.211871 | 90.175568 | 925.069458 | 100.229340 | 114.589958 | 208.419189 | 11.016337 | 0.0 | 1378.727173 |
| 6 | 0.0 | 1.0 | 47.043053 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 3.322889 | 261.094055 | 2.834660 | 24.327459 | 2025.253784 | 48.837574 | 133.741058 | 97.332848 | 13.795160 | 0.0 | 475.944000 |
| 7 | 0.0 | 1.0 | 56.240383 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 13.358179 | 261.446869 | 2.768006 | 91.510269 | 1064.851196 | 142.189285 | 322.775665 | 133.042984 | 12.093728 | 1.0 | 557.591492 |
| 8 | 0.0 | 1.0 | 37.047913 | 0.0 | 1.0 | 1.0 | 0.0 | 2.0 | 7.866230 | 263.554260 | 3.563874 | 78.839104 | 643.433411 | 78.586319 | 106.826172 | 459.255524 | 9.455293 | 0.0 | 1167.141724 |
| 9 | 1.0 | 1.0 | 35.824699 | 0.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.544455 | 227.592346 | 3.224297 | 33.839943 | 568.074158 | 141.423431 | 129.546249 | 251.202240 | 10.638258 | 1.0 | 3952.591553 |
Last rows
| D | Z1 | Z2 | Z3 | Z4 | Z5 | Z6 | Z7 | Z8 | Z9 | Z10 | Z11 | Z12 | Z13 | Z14 | Z15 | Z16 | Z17 | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 999990 | 1.0 | 0.0 | 57.600700 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 1.651521 | 356.063873 | 3.749581 | 66.043999 | 1776.243652 | 90.391586 | 218.408691 | 479.914276 | 10.324666 | 2.0 | 2202.633789 |
| 999991 | 1.0 | 1.0 | 35.536068 | 0.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.095892 | 307.779907 | 2.987472 | 18.821051 | 995.429993 | 154.451889 | 58.629288 | 297.430511 | 10.017821 | 0.0 | 1142.842041 |
| 999992 | 0.0 | 0.0 | 53.056637 | 0.0 | 1.0 | 0.0 | 1.0 | 2.0 | 1.971753 | 318.203491 | 3.311176 | 66.536819 | 5109.550293 | 136.856445 | 111.354141 | 326.175476 | 9.746990 | 0.0 | 605.966980 |
| 999993 | 1.0 | 0.0 | 70.450714 | 0.0 | 1.0 | 1.0 | 1.0 | 1.0 | 0.775659 | 360.474792 | 3.787220 | 30.464809 | 3886.373535 | 61.957294 | 110.059967 | 315.754913 | 10.357409 | 3.0 | 3174.704346 |
| 999994 | 1.0 | 0.0 | 39.755569 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 20.853916 | 245.210144 | 3.759833 | 81.082932 | 944.224243 | 272.452515 | 257.899109 | 235.874023 | 9.908377 | 1.0 | 1689.774658 |
| 999995 | 1.0 | 0.0 | 54.648628 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 0.821357 | 369.859589 | 3.406111 | 49.064838 | 904.527100 | 78.700981 | 114.336380 | 242.742355 | 11.091323 | 0.0 | 1141.450806 |
| 999996 | 0.0 | 0.0 | 41.496292 | 1.0 | 1.0 | 0.0 | 1.0 | 1.0 | 0.999576 | 857.390686 | 3.929982 | 133.071335 | 1798.910034 | 250.418610 | 109.321976 | 267.243958 | 11.711025 | 3.0 | 2149.976562 |
| 999997 | 0.0 | 1.0 | 33.830952 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 13.906055 | 312.313538 | 3.545428 | 73.009895 | 1829.911377 | 155.488190 | 83.146812 | 195.696869 | 10.770503 | 1.0 | 1678.040161 |
| 999998 | 0.0 | 0.0 | 70.523148 | 1.0 | 1.0 | 1.0 | 0.0 | 2.0 | 10.128497 | 212.766754 | 3.663546 | 268.529114 | 1046.635132 | 86.703087 | 97.095116 | 239.903885 | 11.003921 | 0.0 | 543.393616 |
| 999999 | 1.0 | 0.0 | 62.188412 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 9.892191 | 404.375275 | 3.783149 | 121.017029 | 1322.640259 | 260.806000 | 120.693352 | 233.631790 | 10.618650 | 0.0 | 3439.468506 |