-
Notifications
You must be signed in to change notification settings - Fork 108
Open
Description
Thanks for your impressive work! After reading your code and paper, I have some questions about the fusion design. Referring to the SENET, they implement the self-attention by global pooling, and two Convs to set the channel-wise descriptor to C1. While, in your paper, you created two matrices to change the dimension back to C1. Why? How about using two single Convs to change the dimension to C*1? Especially, it looks like more efficient.
Metadata
Metadata
Assignees
Labels
No labels