Here is the documentation for RollingPoly2, with plots to illustrate the behavior for each derivative_order (0, 1, and 2), showing the endpoint, slope, and curvature of the fitted quadratic function within each window.
RollingPoly2
Description
The RollingPoly2 class fits a quadratic polynomial (second-degree equation) to the data within a specified moving window. This rolling quadratic fit provides insights into the data’s overall trend, rate of change, and curvature. Depending on the derivative_order, the function can return:
0: The y-value at the endpoint of the fitted quadratic curve, offering a smooth continuation of the data.
1: The slope (first derivative) at the endpoint, representing the rate of change.
2: The curvature (second derivative) at the endpoint, which captures the concavity of the data.
This method, which extends the idea of a causal Savitzky-Golay filter, enables more nuanced data analysis by capturing both linear and quadratic trends within each window.
Initial values: The constructor requires a positive integer window_size to define the rolling window and an integer derivative_order (0, 1, or 2) to specify the type of output.
NaN handling: NaN values are not handled natively and should be preprocessed if necessary.
Usage Example and Plot
import numpy as np
import plotly.graph_objects as go
from plotly.subplots import make_subplots
from screamer import RollingPoly2
# Generate example data
data = np.cumsum(np.random.normal(size=300))
# Create subplots with specified row heights and shared x-axis
fig = make_subplots(
rows=3, cols=1,
shared_xaxes=True,
row_heights=[1/3, 1/3, 1/3],
vertical_spacing=0.1
)
# RollingPoly2 for endpoint (derivative_order=0)
endpoint_data = RollingPoly2(window_size=30, derivative_order=0)(data)
# RollingPoly2 for slope (derivative_order=1)
slope_data = RollingPoly2(window_size=30, derivative_order=1)(data)
# RollingPoly2 for curvature (derivative_order=2)
curvature_data = RollingPoly2(window_size=30, derivative_order=2)(data)
# Add traces for input data and results with different derivative orders
fig.add_trace(go.Scatter(y=data, mode='lines', name='Input Data'), row=1, col=1)
fig.add_trace(go.Scatter(y=endpoint_data, mode='lines', name='Rolling Endpoint (Order 0)', line=dict(color='green')), row=1, col=1)
fig.add_trace(go.Scatter(y=slope_data, mode='lines', name='Rolling Slope (Order 1)', line=dict(color='blue')), row=2, col=1)
fig.add_trace(go.Scatter(y=curvature_data, mode='lines', name='Rolling Curvature (Order 2)', line=dict(color='purple')), row=3, col=1)
# Update layout with titles and axis labels
fig.update_layout(
title="RollingPoly2 with Window Size 30",
xaxis_title="Index",
yaxis=dict(title="Input Data and endpoint"),
yaxis2=dict(title="RollingPoly2 Slope"),
yaxis3=dict(title="RollingPoly2 Curvature"),
margin=dict(l=20, r=20, t=120, b=20),
legend=dict(orientation="h", yanchor="bottom", y=1.02, xanchor="right", x=1)
)
fig.show()
Implementation Details
Algorithm
RollingPoly2 uses least-squares regression to fit a quadratic function within each window. The class allows returning either the endpoint value, the slope, or the curvature at the endpoint, based on derivative_order. This fitting method effectively captures both linear and non-linear trends, making it suitable for applications needing insights into the rate of change and concavity within the data.
Complexity
Time Complexity: O(1) per element.
Space Complexity: O(window_size), since only elements within the current window are stored.
Performance
RollingPoly2is hundreds times faster thanPandas rolling apply, and the speed is constant, indepenent of the windows size.