Deep dive into how to make a reliable dataset, the problems that might occur along the way, and solutions to them