A Hybrid Deep Model for Learning to Rank Data Tables

Mohamed Trabelsi, Zhiyu Chen, Brian D. Davison and Jeff Heflin.

Short Paper (8 pages)
Official IEEE published version: https://doi.ieeecomputersociety.org/10.1109/BigData50022.2020.9378185
Author's version: PDF (519KB)

Abstract
We address the problem of ad hoc table retrieval via a new neural architecture that incorporates both semantic and relevance matching. Understanding the connection between the structured form of a table and query tokens is an important yet neglected problem in information retrieval. We use a learning-to-rank approach to train a system to capture semantic and relevance signals within interactions between the structured form of candidate tables and query tokens. Convolutional filters that extract contextual features from query/table interactions are combined with a feature vector based on the distributions of term similarity between queries and tables. We propose using row and column summaries to incorporate table content into our new neural model. We evaluate our approach using two datasets, and we demonstrate substantial improvements in terms of retrieval metrics over state-of-the-art methods in table retrieval and document retrieval, and neural architectures from sentence, document, and table type classification adapted to the table retrieval task. Our ablation study supports the importance of both semantic and relevance matching in the table retrieval.

In Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), pages 979-986, December 2020. DOI: 10.1109/BigData50022.2020.9378185.

Back to Brian Davison's publications


Last modified: 22 November 2021
Brian D. Davison