Hello. Sign in to personalize your visit. New user? Register now.


 
 

Data Exploration Using Example-Based Methods

Synthesis Lectures on Data Management

Matteo Lissandrini​‌
Aalborg University
Davide Mottin​‌
Aarhus University
Themis Palpanas​‌
Paris Descartes University
Yannis Velegrakis​‌
University of Trento

Abstract

Data usually comes in a plethora of formats and dimensions, rendering the information extraction and exploration processes challenging. Thus, being able to perform exploratory analyses of the data with the intent of having an immediate glimpse of some of the data properties is becoming crucial. Exploratory analyses should be simple enough to avoid complicated declarative languages (such as SQL) and mechanisms, while at the same time retaining the flexibility and expressiveness of such languages. Recently, we have witnessed a rediscovery of the so-called example-based methods, in which the user, or analyst, circumvents query languages by using examples as input. An example is a representative of the intended results or, in other words, an item from the result set. Example-based methods exploit inherent characteristics of the data to infer the results that the user has in mind but may not be able to (easily) express. They can be useful in cases where a user is looking for information in an unfamiliar dataset, when they are performing a particularly challenging task like finding duplicate items, or when they are simply exploring the data. In this book, we present an excursus over the main methods for exploratory analysis, with a particular focus on example-based methods. We show how different data types require different techniques and present algorithms that are specifically designed for relational, textual, and graph data. The book also presents the challenges and new frontiers of machine learning in online settings that have recently attracted the attention of the database community. The book concludes with a vision for further research and applications in this area.

Table of Contents: Preface / Acknowledgments / Introduction / Relational Data / Graph Data / Textual Data / Unifying Example-Based Approaches / Online Learning / The Road Ahead / Conclusions / Bibliography / Authors' Biographies

PDF (7939 KB) PDF Plus (8106 KB)

Cited by

Yi Yang, Meng Li, Jian Wang, Weixing Huang, Yun Wang​‌. (2022) Entity Recommendation With Negative Feedback Memory Networks for Topic-Oriented Knowledge Graph Exploration. IEEE Transactions on Reliability 71:2, 788-802.
Online publication date: 1-Jun-2022.
Crossref
Xiangyu Wang, Lyuzhou Chen, Taiyu Ban, Muhammad Usman, Yifeng Guan, Shikang Liu, Tianhao Wu, Huanhuan Chen​‌. (2021) Knowledge Graph Quality Control: A Survey. Fundamental Research volumeĀ 28.
Online publication date: 1-Oct-2021.
Crossref
Xiangyu Wang, Lyuzhou Chen, Taiyu Ban, Muhammad Usman, Yifeng Guan, Shikang Liu, Tianhao Wu, Huanhuan Chen​‌. (2021) Knowledge graph quality control: A survey. Fundamental Research 1:5, 607-626.
Online publication date: 1-Sep-2021.
Crossref
Maria Jihan Sangil​‌. (2020) Exploratory Data Analysis of Government Procurement Data to Influence Bidding Decision and Strategy in Albay, Philippines. 2020 IEEE International Symposium on Technology and Society (ISTAS), 235-245.
Crossref
 



Prev. lecture | Next lecture
View/Print PDF (7939 KB)
View PDF Plus (8106 KB)
Add to favorites
Email to a friend
TOC Alert | Citation Alert What is RSS?

Quick Search
for
Authors:
Matteo Lissandrini
Davide Mottin
Themis Palpanas
Yannis Velegrakis
Keywords:
search by example
data exploration
information retrieval
data management
Home | Synthesis | Search | Profile | Access | Author | Help | About
Technology Partner - Atypon Systems, Inc.