While hydrological models generally rely on continuous streamflow data for calibration, previous studies have shown that a few measurements can be sufficient to constrain model parameters. Other studies have shown that continuous water level or water level class (WL‐class) data can be informative for model calibration. In this study, we combined these approaches and explored the potential value of a limited number of WL‐class observations for calibration of a bucket‐type runoff model (HBV) for four catchments in Switzerland. We generated synthetic data to represent citizen science data and examined the effects of the temporal resolution of the observations, the numbers of WL‐classes, and the magnitude of the errors in the WL‐class data on the model validation performance. Our results indicate that on average one observation per week for a one‐year period can significantly improve model performance compared to the situation without any streamflow data. Furthermore, the validation performance for model parameters calibrated with WL‐class observations was similar to the performance of the calibration with precise water level measurements. The number of WL‐classes did not influence the validation performance noticeably when at least four WL‐classes were used. The impact of typical errors for citizen‐science‐based estimates of WL‐classes on the model performance was small. These results are encouraging for citizen science projects where citizens observe water levels for otherwise ungauged streams using virtual or physical staff gauges.