Data Integration Landscapes The Case for Non-optimal Solutions in Network Diffusion Models
| Authors | |
|---|---|
| Publication date | 2023 |
| Host editors |
|
| Book title | Computational Science – ICCS 2023 |
| Book subtitle | 23rd International Conference, Prague, Czech Republic, July 3–5, 2023 : proceedings |
| ISBN |
|
| ISBN (electronic) |
|
| Series | Lecture Notes in Computer Science |
| Event | 23rd International Conference on Computational Science, ICCS 2023 |
| Volume | Issue number | I |
| Pages (from-to) | 494-508 |
| Number of pages | 15 |
| Publisher | Cham: Springer |
| Organisations |
|
| Abstract |
The successful application of computational models presupposes access to accurate, relevant, and representative datasets. The growth of public data, and the increasing practice of data sharing and reuse, emphasises the importance of data provenance and increases the need for modellers to understand how data processing decisions might impact model output. One key step in the data processing pipeline is that of data integration and entity resolution, where entities are matched across disparate datasets. In this paper, we present a new formulation of data integration in complex networks that incorporates integration uncertainty. We define an approach for understanding how different data integration setups can impact the results of network diffusion models under this uncertainty, allowing one to systematically characterise potential model outputs in order to create an output distribution that provides a more comprehensive picture. |
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1007/978-3-031-35995-8_35 |
| Other links | https://www.scopus.com/pages/publications/85164940830 |
| Downloads |
978-3-031-35995-8_35
(Final published version)
|
| Permalink to this page | |
