Issue Date | Title | Author(s) |
2018-01-01 | Bi-directional block self-attention for fast and memory-efficient sequence modeling | Shen, T; Zhou, T; Long, G; Jiang, J; Zhang, C |
2018-01-01 | Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling | Shen, T; Zhou, T; Long, G; Jiang, J; Wang, S; Zhang, C |
2018-01-01 | Bi-directional block self-attention for fast and memory-efficient sequence modeling | Shen, T; Zhou, T; Long, G; Jiang, J; Zhang, C |
2019-06-07 | Tensorized Self-Attention: Efficiently Modeling Pairwise and Global Dependencies Together | Shen, T; Zhou, T; Long, G; Jiang, J; Zhang, C |
2018-01-01 | Reinforced self-attention network: A hybrid of hard and soft attention for sequence modeling | Shen, T; Zhou, T; Long, G; Jiang, J; Wang, S; Zhang, C |
2019-10-11 | Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base | Shen, T; Geng, X; Qin, T; Guo, D; Tang, D; Duan, N; Long, G; Jiang, D |
2019-11-07 | Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base | Shen, T; Long, G; Xiubo, G; Tao, Q; Daya, G; Duyu, T; Daxin, J |
2020-11-17 | BiteNet: Bidirectional Temporal Encoder Network to Predict Medical Outcomes | Peng, X; Long, G; Shen, T; Wang, S; Jiang, J; Zhang, C |
2022-01-01 | Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision | Li, Y; Long, G; Shen, T; Jiang, J |
2022-04-25 | EventBERT: A Pre-Trained Model for Event Correlation Reasoning | Zhou, Y; Geng, X; Shen, T; Long, G; Jiang, D |