Skip to content

RazvanO2/Household_Electricity_Forecasting

Repository files navigation

The data in forecasting_test.csv consists of data on household electricity consumption.

The variables are as follows:
- hh-id: unique household id
- year: 2010 and 2011
- month: 4-8
- zipcode: anonymized zipcode in which home is located
- mozip: location variable derived from the interaction of zipcode with year and month. proxies for local humidity/temperature
- lusage: log(kwh) log of monthly electricity consumption
- lusage1-6: log(kwh) for April - September of 2009 (ie pre-sample period)
- children: household has children
- hhsize2-5plus: household size
- income2-9: income categories <$20k, $20-30k, $30-40k, $40-50k, $50-75k, $75-100k, $100-125k, >$125k
- owner: resident owns home
- size: size of the residence (in sqft)

Note: 
household electricity usage is based on billing records
household demographics were purchased from a third party data aggregator


Objective:

The aim of this mini-project is to develop a forecasting model to predict future electricity usage (lusage) for each household based on the available data.

1. Prepare Data: Create time-series features, considering past usage, trends, and seasonal effects.
2. Modeling: Build and evaluate forecasting models.
3. Forecasting: Generate predictions with confidence intervals to assess uncertainty.

Note on Missing Data & Imputation:
* Handle any missing data, as it may impact model accuracy. 
* Consider the impact of different imputation methods on the forecasting accuracy.


About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published