Support applying Softmax over multidimensional tensors
Created by: vloncar
Current implementations of softmax
are incorrect when applied to multidimensional tensors (used in e.g., ENet model). The implementations in this PR handle this properly for axis=-1
(the default).