Towards Evidence-Based Testability Measurements