The elephant in the machine: Proposing a new metric of data reliability and its application to a medical case to assess classification reliability