Dataset statistics
| Number of variables | 1 |
|---|---|
| Number of observations | 13880 |
| Missing cells | 2545 |
| Missing cells (%) | 18.3% |
| Duplicate rows | 1527 |
| Duplicate rows (%) | 11.0% |
| Total size in memory | 216.9 KiB |
| Average record size in memory | 16.0 B |
Variable types
| TimeSeries | 1 |
|---|
Timeseries statistics
| Number of series | 1 |
|---|---|
| Time series length | 13880 |
| Starting point | 1983-01-01 00:00:00 |
| Ending point | 2020-12-31 00:00:00 |
| Period | 1 day |
| Dataset has 1527 (11.0%) duplicate rows | Duplicates |
Flow has 2545 (18.3%) missing values | Missing |
Reproduction
| Analysis started | 2024-05-12 19:35:41.655665 |
|---|---|
| Analysis finished | 2024-05-12 19:35:43.381604 |
| Duration | 1.73 second |
| Missing | Q_Station_NA_28047050_ok_Missing.csv |
| Download configuration | config.json |
Flow
Numeric time series
MISSING 
| Distinct | 8348 |
|---|---|
| Distinct (%) | 73.6% |
| Missing | 2545 |
| Missing (%) | 18.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.00054143979 |
|---|---|
| Minimum | -69 |
| Maximum | 63.8 |
| Zeros | 79 |
| Zeros (%) | 0.6% |
| Memory size | 216.9 KiB |
Quantile statistics
| Minimum | -69 |
|---|---|
| 5-th percentile | -7.573 |
| Q1 | -0.93 |
| median | 2.7755576 × 10-17 |
| Q3 | 0.979 |
| 95-th percentile | 7.513 |
| Maximum | 63.8 |
| Range | 132.8 |
| Interquartile range (IQR) | 1.909 |
Descriptive statistics
| Standard deviation | 5.8203853 |
|---|---|
| Coefficient of variation (CV) | -10749.829 |
| Kurtosis | 25.947656 |
| Mean | -0.00054143979 |
| Median Absolute Deviation (MAD) | 0.941 |
| Skewness | -0.16190039 |
| Sum | -6.13722 |
| Variance | 33.876885 |
| Monotonicity | Not monotonic |
| Augmented Dickey-Fuller test p-value | 0 |
Histogram with fixed size bins (bins=50)
Gap statistics
| number of gaps | 50 |
|---|---|
| min | 5 days |
| max | 2 years and 1 week |
| mean | 7 weeks, 2 days and 20 hours |
| std | 21 weeks, 4 days and 7 hours |
| Value | Count | Frequency (%) |
| 0 | 79 | 0.6% |
| -0.08 | 23 | 0.2% |
| 0.08 | 21 | 0.2% |
| -0.09 | 20 | 0.1% |
| -0.02 | 16 | 0.1% |
| -0.12 | 14 | 0.1% |
| -0.01 | 13 | 0.1% |
| -0.04 | 13 | 0.1% |
| -4.440892099 × 10-16 | 13 | 0.1% |
| 0.02 | 12 | 0.1% |
| Other values (8338) | 11111 | |
| (Missing) | 2545 | 18.3% |
| Value | Count | Frequency (%) |
| -69 | 1 | |
| -63 | 1 | |
| -62.86 | 1 | |
| -58.47 | 1 | |
| -54.34 | 1 | |
| -54.2 | 1 | |
| -53.8 | 1 | |
| -52.07 | 1 | |
| -52 | 1 | |
| -50.38 | 1 |
| Value | Count | Frequency (%) |
| 63.8 | 1 | |
| 61.55 | 1 | |
| 59.41 | 1 | |
| 58.02 | 1 | |
| 56.63 | 1 | |
| 52.1 | 1 | |
| 51.97 | 1 | |
| 51.47 | 1 | |
| 50.8 | 1 | |
| 47.67 | 1 |
ACF and PACF
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| Flow | |
|---|---|
| Date | |
| 1983-01-01 | NaN |
| 1983-01-02 | NaN |
| 1983-01-03 | NaN |
| 1983-01-04 | 0.00 |
| 1983-01-05 | -0.03 |
| 1983-01-06 | -0.11 |
| 1983-01-07 | 1.01 |
| 1983-01-08 | -1.26 |
| 1983-01-09 | 0.61 |
| 1983-01-10 | 0.07 |
| Flow | |
|---|---|
| Date | |
| 2020-12-22 | -0.2046 |
| 2020-12-23 | 0.3213 |
| 2020-12-24 | -0.3444 |
| 2020-12-25 | 0.2374 |
| 2020-12-26 | -0.0990 |
| 2020-12-27 | -0.1140 |
| 2020-12-28 | 0.3110 |
| 2020-12-29 | -0.2659 |
| 2020-12-30 | 0.1113 |
| 2020-12-31 | 0.1418 |
Most frequently occurring
| Flow | # duplicates | |
|---|---|---|
| 1526 | NaN | 2545 |
| 761 | 0.000000e+00 | 79 |
| 669 | -8.000000e-02 | 23 |
| 854 | 8.000000e-02 | 21 |
| 663 | -9.000000e-02 | 20 |
| 734 | -2.000000e-02 | 16 |
| 631 | -1.200000e-01 | 14 |
| 708 | -4.000000e-02 | 13 |
| 749 | -1.000000e-02 | 13 |
| 758 | -4.440892e-16 | 13 |