Dataset statistics
| Number of variables | 1 |
|---|---|
| Number of observations | 13880 |
| Missing cells | 2368 |
| Missing cells (%) | 17.1% |
| Duplicate rows | 2056 |
| Duplicate rows (%) | 14.8% |
| Total size in memory | 216.9 KiB |
| Average record size in memory | 16.0 B |
Variable types
| TimeSeries | 1 |
|---|
Timeseries statistics
| Number of series | 1 |
|---|---|
| Time series length | 13880 |
| Starting point | 1983-01-01 00:00:00 |
| Ending point | 2020-12-31 00:00:00 |
| Period | 1 day |
| Dataset has 2056 (14.8%) duplicate rows | Duplicates |
Flow has 2368 (17.1%) missing values | Missing |
Flow is non stationary | Non stationary |
Flow is seasonal | Seasonal |
Reproduction
| Analysis started | 2024-05-12 18:18:19.553262 |
|---|---|
| Analysis finished | 2024-05-12 18:18:21.056556 |
| Duration | 1.5 second |
| Missing | Q_Station_NA_28047050_ok_Missing.csv |
| Download configuration | config.json |
Flow
Numeric time series
MISSING  NON STATIONARY  SEASONAL 
| Distinct | 4559 |
|---|---|
| Distinct (%) | 39.6% |
| Missing | 2368 |
| Missing (%) | 17.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.432012 |
|---|---|
| Minimum | 0 |
| Maximum | 115.9 |
| Zeros | 22 |
| Zeros (%) | 0.2% |
| Memory size | 216.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.417 |
| Q1 | 2.24 |
| median | 7.464 |
| Q3 | 18.8325 |
| 95-th percentile | 48.0115 |
| Maximum | 115.9 |
| Range | 115.9 |
| Interquartile range (IQR) | 16.5925 |
Descriptive statistics
| Standard deviation | 15.484489 |
|---|---|
| Coefficient of variation (CV) | 1.1528048 |
| Kurtosis | 3.5431967 |
| Mean | 13.432012 |
| Median Absolute Deviation (MAD) | 6.014 |
| Skewness | 1.7911589 |
| Sum | 154629.33 |
| Variance | 239.76938 |
| Monotonicity | Not monotonic |
| Augmented Dickey-Fuller test p-value | 1.46416046 × 10-15 |
Histogram with fixed size bins (bins=50)
Gap statistics
| number of gaps | 48 |
|---|---|
| min | 3 days |
| max | 2 years and 4 days |
| mean | 7 weeks, 1 day and 1 hour |
| std | 21 weeks, 6 days and 18 hours |
| Value | Count | Frequency (%) |
| 1.67 | 43 | 0.3% |
| 0.3 | 34 | 0.2% |
| 0.83 | 32 | 0.2% |
| 1.27 | 30 | 0.2% |
| 1.7 | 29 | 0.2% |
| 1.95 | 29 | 0.2% |
| 1.32 | 28 | 0.2% |
| 0.38 | 27 | 0.2% |
| 2.12 | 27 | 0.2% |
| 1.78 | 25 | 0.2% |
| Other values (4549) | 11208 | |
| (Missing) | 2368 | 17.1% |
| Value | Count | Frequency (%) |
| 0 | 22 | |
| 0.01 | 1 | < 0.1% |
| 0.019 | 2 | < 0.1% |
| 0.02 | 2 | < 0.1% |
| 0.03 | 4 | < 0.1% |
| 0.038 | 9 | |
| 0.04 | 1 | < 0.1% |
| 0.05 | 2 | < 0.1% |
| 0.057 | 1 | < 0.1% |
| 0.06 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 115.9 | 1 | |
| 114.7 | 1 | |
| 114.2 | 1 | |
| 110.3 | 1 | |
| 109.8 | 1 | |
| 106.1 | 1 | |
| 104.4 | 1 | |
| 102 | 1 | |
| 101.9 | 1 | |
| 99.4 | 1 |
ACF and PACF
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| Flow | |
|---|---|
| Date | |
| 1983-01-01 | 4.87 |
| 1983-01-02 | 5.00 |
| 1983-01-03 | 5.00 |
| 1983-01-04 | 4.87 |
| 1983-01-05 | 4.58 |
| 1983-01-06 | 4.02 |
| 1983-01-07 | 4.20 |
| 1983-01-08 | 3.86 |
| 1983-01-09 | 3.61 |
| 1983-01-10 | 3.52 |
| Flow | |
|---|---|
| Date | |
| 2020-12-22 | 5.9266 |
| 2020-12-23 | 5.5249 |
| 2020-12-24 | 4.9864 |
| 2020-12-25 | 4.5485 |
| 2020-12-26 | 4.1122 |
| 2020-12-27 | 3.5635 |
| 2020-12-28 | 3.2134 |
| 2020-12-29 | 2.7960 |
| 2020-12-30 | 2.4226 |
| 2020-12-31 | 2.2350 |
Most frequently occurring
| Flow | # duplicates | |
|---|---|---|
| 2055 | NaN | 2368 |
| 226 | 1.67 | 43 |
| 40 | 0.30 | 34 |
| 133 | 0.83 | 32 |
| 172 | 1.27 | 30 |
| 229 | 1.70 | 29 |
| 257 | 1.95 | 29 |
| 177 | 1.32 | 28 |
| 58 | 0.38 | 27 |
| 279 | 2.12 | 27 |