Assessing the Reliability of a Multi-Class Classifier