The Four Generations of Entity Resolution

作者： Papadakis, George,Ioannou, Ekaterini,Thanos, Emanouil
原文出版社：Springer
出版日期：2021/03/16
語言：英文

定價：3599元

分期價：(除不盡餘數於第一期收取) 分期說明

可接受VISA, Master, JCB, 聯合信用卡

3期0利率	每期1199	6期0利率	每期599

運送方式：
臺灣與離島
海外

可配送點：台灣、蘭嶼、綠島、澎湖、金門、馬祖
可取貨點：台灣、蘭嶼、綠島、澎湖、金門、馬祖

載入中...

我要評鑑

分享

內容簡介

Entity Resolution (ER) lies at the core of data integration and cleaning and, thus, a bulk of the research examines ways for improving its effectiveness and time efficiency. The initial ER methods primarily target Veracity in the context of structured (relational) data that are described by a schema of well-known quality and meaning. To achieve high effectiveness, they leverage schema, expert, and/or external knowledge. Part of these methods are extended to address Volume, processing large datasets through multi-core or massive parallelization approaches, such as the MapReduce paradigm. However, these early schema-based approaches are inapplicable to Web Data, which abound in voluminous, noisy, semi-structured, and highly heterogeneous information. To address the additional challenge of Variety, recent works on ER adopt a novel, loosely schema-aware functionality that emphasizes scalability and robustness to noise. Another line of present research focuses on the additional challenge ofVelocity, aiming to process data collections of a continuously increasing volume. The latest works, though, take advantage of the significant breakthroughs in Deep Learning and Crowdsourcing, incorporating external knowledge to enhance the existing words to a significant extent. This synthesis lecture organizes ER methods into four generations based on the challenges posed by these four Vs. For each generation, we outline the corresponding ER workflow, discuss the state-of-the-art methods per workflow step, and present current research directions. The discussion of these methods takes into account a historical perspective, explaining the evolution of the methods over time along with their similarities and differences. The lecture also discusses the available ER tools and benchmark datasets that allow expert as well as novice users to make use of the available solutions.

作者簡介

George Papadakis is a research fellow at the National and Kapodistrian University of Athens, Greece. He also worked at the NCSR Demokritos, National Technical University of Athens (NTUA), L3S Research Center, and ＂Athena＂ Research Center. He holds a Ph.D. in Computer Science from the University of Hanover and a Diploma in Electrical Computer Engineering from NTUA. His research interest focuses on web data mining.Ekaterini Ioannou is an Assistant Professor at Tilburg University, the Netherlands. Prior, she worked as an Assistant Professor at Eindhoven University of Technology, as a Lecturer at the Open University of Cyprus, an adjunct faculty at EPFL in Switzerland, a research collaborator at the Technical University of Crete, and as an Independent Expert for the European Commission. Her research focuses on information integration with an emphasis on the challenges of man aging data with uncertainties, heterogeneity or correlations, and, more recently, on achieving a deeper integration of information extraction tasks within databases, and on efficiently retrieving analytics over graphs/hypergraphs with evolving data.Emanouil Thanos is a Ph.D. candidate at CODeS research group of KU Leuven, under the supervision of Prof. Greet Vanden Berghe. He holds a Diploma in Electrical and Computer Engineering from the National Technical University of Athens and a joint Master in Com putational Logic from TU Dresden, FU Bolzano, and UN Lisbon. He has also worked as a research associate at National ICT Australia and the University of Athens. His research inter ests focus on combinatorial optimization and operations research.Themis Palpanas is Senior Member of the French University Institute (IUF), and Professor of Computer Science at the University of Paris (France) where he is director of the Data Intel ligence Institute of Paris (diiP), and of the Data Intensive and Knowledge Oriented Systems (diNo) group. He is the author of two French patents and nine U.S. patents, three of which have been implemented in world-leading commercial data management products. He is the recipient of three Best Paper awards and the IBM Shared University Research (SUR) Award. He is currently serving in the Board of Trustees for the Very Large Data Bases (VLDB) Endowment, as Editor in Chief for BDR Journal, Editorial Advisory Board member for IS Journal, and in the Senior Program Committee of SIGMOD 2021.

詳細資料

ISBN：9783031007507
規格：平裝 / 172頁 / 23.5 x 19.05 x 0.94 cm / 普通級 / 初版
出版地：美國

本書分類：自然科普> 電腦> 計算機概論
本書分類：自然科普> 電腦> 訊息資料處理與管理

主題活動

閱讀媒合所｜寫書評送25元E-Coupon

購物說明

外文館商品版本：商品之書封，為出版社提供之樣本。實際出貨商品，以出版社所提供之現有版本為主。關於外文書裝訂、版本上的差異，請參考【外文書的小知識】。

調貨時間：無庫存之商品，在您完成訂單程序之後，將以空運的方式為您下單調貨。原則上約14~20個工作天可以取書(若有將延遲另行告知)。為了縮短等待的時間，建議您將外文書與其它商品分開下單，以獲得最快的取貨速度，但若是海外專案進口的外文商品，調貨時間約1~2個月。

若您具有法人身份為常態性且大量購書者，或有特殊作業需求，建議您可洽詢「企業採購」。

退換貨說明

會員所購買的商品均享有到貨十天的猶豫期（含例假日）。退回之商品必須於猶豫期內寄回。

辦理退換貨時，商品必須是全新狀態與完整包裝(請注意保持商品本體、配件、贈品、保證書、原廠包裝及所有附隨文件或資料的完整性，切勿缺漏任何配件或損毀原廠外盒)。退回商品無法回復原狀者，恐將影響退貨權益或需負擔部分費用。

訂購本商品前請務必詳閱商品退換貨原則。

選擇語言

:::網站搜尋

The Four Generations of Entity Resolution

內容簡介

作者簡介

詳細資料

主題活動

最近瀏覽商品

相關活動

購物說明

同類商品新上架

本類新品熱銷

本類暢銷榜

得獎認證