Facebook Prophet: Quick Start

Python API

Prophet follows the sklearn model API. We create an instance of the Prophet class and then call its fit and predict methods.

The input to Prophet is always a dataframe with two columns: ds and y. The ds (datestamp) column should be of a format expected by Pandas, ideally YYYY-MM-DD for a date or YYYY-MM-DD HH:MM:SS for a timestamp. The y column must be numeric, and represents the measurement we wish to forecast.

As an example, let's look at a time series of the log daily page views for the Wikipedia page for Peyton Manning. We scraped this data using the Wikipediatrend package in R. Peyton Manning provides a nice example because it illustrates some of Prophet's features, like multiple seasonality, changing growth rates, and the ability to model special days (such as Manning's playoff and superbowl appearances). The CSV is available here.

First we'll import the data:

example_wp_log_peyton_manning.csv
import pandas as pd
from fbprophet import Prophet
df = pd.read_csv(
example_wp_log_peyton_manning.csv
) df.head()
dsy
02007-12-109.59076113897809
12007-12-118.51959031601596
22007-12-128.18367658262066
32007-12-138.072467369354769
42007-12-147.893572073504901
5 items

We fit the model by instantiating a new Prophet object. Any settings to the forecasting procedure are passed into the constructor. Then you call its fit method and pass in the historical dataframe. Fitting should take 1-5 seconds.

m = Prophet()
m.fit(df)
<fbprophet.fo...x7f1059f6ba58>

Predictions are then made on a dataframe with a column ds containing the dates for which a prediction is to be made. You can get a suitable dataframe that extends into the future a specified number of days using the helper method Prophet.make_future_dataframe. By default it will also include the dates from the history, so we will see the model fit as well.

future = m.make_future_dataframe(periods=365)
future.tail()
ds
32652017-01-15
32662017-01-16
32672017-01-17
32682017-01-18
32692017-01-19
5 items

The predict method will assign each row in future a predicted value which it names yhat. If you pass in historical dates, it will provide an in-sample fit. The forecast object here is a new dataframe that includes a column yhat with the forecast, as well as columns for components and uncertainty intervals.

forecast = m.predict(future)
forecast[['ds', 'yhat', 'yhat_lower', 'yhat_upper']].tail()
dsyhatyhat_loweryhat_upper
32652017-01-158.1992742937171057.4601775176570748.919221425036369
32662017-01-168.5242441827340187.7740245654901569.28191492403284
32672017-01-178.3116152104278067.5290063792424448.984707889038791
32682017-01-188.1442320209075557.4195452716420018.846707353890087
32692017-01-198.1560910169393487.4025786040082578.889815113112816
5 items

You can plot the forecast by calling the Prophet.plot method and passing in your forecast dataframe.

fig1 = m.plot(forecast)
fig1

If you want to see the forecast components, you can use the Prophet.plot_components method. By default you'll see the trend, yearly seasonality, and weekly seasonality of the time series. If you include holidays, you'll see those here, too.

fig2 = m.plot_components(forecast)
fig2

An interactive figure of the forecast can be created with plotly. You will need to install plotly separately, as it will not by default be installed with fbprophet.

from fbprophet.plot import plot_plotly
import plotly.offline as py
py.init_notebook_mode()

fig = plot_plotly(m, forecast)  # This returns a plotly Figure
py.iplot(fig)

More details about the options available for each method are available in the docstrings, for example, via help(Prophet) or help(Prophet.fit). The R reference manual on CRAN provides a concise list of all of the available functions, each of which has a Python equivalent.

R API

In R, we use the normal model fitting API. We provide a prophet function that performs fitting and returns a model object. You can then call predict and plot on this model object.

library(prophet)
library(ggplot2) # Required for Nextjournal plotting

First we read in the data and create the outcome variable. As in the Python API, this is a dataframe with columns ds and y, containing the date and numeric value respectively. The ds column should be YYYY-MM-DD for a date, or YYYY-MM-DD HH:MM:SS for a timestamp. As above, we use here the log number of views to Peyton Manning's Wikipedia page, available here.

df <- read.csv(
example_wp_log_peyton_manning.csv
)

We call the prophet function to fit the model. The first argument is the historical dataframe. Additional arguments control how Prophet fits the data and are described in later pages of this documentation.

m <- prophet(df)

Predictions are made on a dataframe with a column ds containing the dates for which predictions are to be made. The make_future_dataframe function takes the model object and a number of periods to forecast and produces a suitable dataframe. By default it will also include the historical dates so we can evaluate in-sample fit.

future <- make_future_dataframe(m, periods = 365)
tail(future)
0 items

As with most modeling procedures in R, we use the generic predict function to get our forecast. The forecast object is a dataframe with a column yhat containing the forecast. It has additional columns for uncertainty intervals and seasonal components.

forecast <- predict(m, future)
tail(forecast[c('ds', 'yhat', 'yhat_lower', 'yhat_upper')])
0 items

You can use the generic plot function to plot the forecast, by passing in the model and the forecast dataframe.

plot(m, forecast)

You can use the prophet_plot_components function to see the forecast broken down into trend, weekly seasonality, and yearly seasonality.

prophet_plot_components(m, forecast)

An interactive plot of the forecast using Dygraphs can be made with the command dyplot.prophet(m, forecast).

More details about the options available for each method are available in the docstrings, for example, via ?prophet or ?fit.prophet. This documentation is also available in the reference manual on CRAN.