"Exact and Consistent Interpretation for Piecewise Linear Neural Networ" by Lingyang Chu, Xia Hu et al.

School of Engineering and Technology Publications

Title

Exact and Consistent Interpretation for Piecewise Linear Neural Networks: A Closed Form Solution

Authors

Lingyang Chu
Xia Hu
Juhua Hu, University of Washington TacomaFollow
Lanjun Wang
Jian Pei

Publication Date

2018

Document Type

Conference Proceeding

Abstract

Strong intelligent machines powered by deep neural networks are increasingly deployed as black boxes to make decisions in risk-sensitive domains, such as finance and medical. To reduce potential risk and build trust with users, it is critical to interpret how such machines make their decisions. Existing works interpret a pre-trained neural network by analyzing hidden neurons, mimicking pre-trained models or approximating local predictions. However, these methods do not provide a guarantee on the exactness and consistency of their interpretations. In this paper, we propose an elegant closed form solution named $OpenBox$ to compute exact and consistent interpretations for the family of Piecewise Linear Neural Networks (PLNN). The major idea is to first transform a PLNN into a mathematically equivalent set of linear classifiers, then interpret each linear classifier by the features that dominate its prediction. We further apply $OpenBox$ to demonstrate the effectiveness of non-negative and sparse constraints on improving the interpretability of PLNNs. The extensive experiments on both synthetic and real world data sets clearly demonstrate the exactness and consistency of our interpretation.

Publication Title

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

First Page

1244

Last Page

1253

DOI

10.1145/3219819.3220063

Recommended Citation

Chu, Lingyang; Hu, Xia; Hu, Juhua; Wang, Lanjun; and Pei, Jian, "Exact and Consistent Interpretation for Piecewise Linear Neural Networks: A Closed Form Solution" (2018). School of Engineering and Technology Publications. 200.
https://digitalcommons.tacoma.uw.edu/tech_pub/200

This document is currently not available here.

Find in your library

COinS

UW Tacoma Digital Commons

School of Engineering and Technology Publications

Title

Authors

Publication Date

Document Type

Abstract

Publication Title

First Page

Last Page

DOI

Recommended Citation

Browse

Author Corner

Links

SelectedWorks Sites

UW Tacoma Digital Commons

School of Engineering and Technology Publications

Title

Authors

Publication Date

Document Type

Abstract

Publication Title

First Page

Last Page

DOI

Recommended Citation

Share

Browse

Author Corner

Links

SelectedWorks Sites