Why cannabis datasets will give you a bad (data) trip