No video

Large Scale Fuzzy Name Matching (Zhe Sun & Daniel van der Ende)

  Рет қаралды 9,395

Databricks

Databricks

Күн бұрын

Zhe Sun is currently a senior data scientist in ING Wholesale banking Advanced Analytics team, where he has applied machine learning techniques to problems ranging from entity matching to large scale payment transaction network analysis. Daniel van der Ende is currently a data engineer in the ING Wholesale Banking Advanced Analytics team.
ING bank is a Dutch multinational, multi-product bank that offers banking services to 33 million retail and commercial customers in over 40 countries. At this scale, ING naturally faces a multitude of data consolidation tasks across its disparate sources. A common consolidation problem is fuzzy name matching: given a name (streaming) or a list of names (batch), find out the most similar name(s) from a different list.
To learn more: databricks.com...
About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: databricks.com...
Connect with us:
Website: databricks.com
Facebook: / databricksinc
Twitter: / databricks
LinkedIn: / databricks
Instagram: / databricksinc Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. databricks.com...

Пікірлер: 3
@mohitpansari6603
@mohitpansari6603 2 жыл бұрын
Brilliant!!! 1st Question of LSH-Locality Sensitive Hashing is good and if implemented will boost performance.
@terrylao6344
@terrylao6344 Жыл бұрын
any information about how small the cluster size?
@Svishnupriya-rf7xj
@Svishnupriya-rf7xj 3 жыл бұрын
send me fuzzy data matching full code
Ik Heb Aardbeien Gemaakt Van Kip🍓🐔😋
00:41
Cool Tool SHORTS Netherlands
Рет қаралды 8 МЛН
ISSEI & yellow girl 💛
00:33
ISSEI / いっせい
Рет қаралды 21 МЛН
Challenge matching picture with Alfredo Larin family! 😁
00:21
BigSchool
Рет қаралды 41 МЛН
Mike Mull: The Art and Science of Data Matching
40:22
PyData
Рет қаралды 15 М.
Exploring NLP Fuzzy Matching Algorithms
44:36
Women Who Code
Рет қаралды 12 М.
PowerBI + Databricks: Have the best of both worlds
1:22
Databricks
Рет қаралды 1 М.
Ik Heb Aardbeien Gemaakt Van Kip🍓🐔😋
00:41
Cool Tool SHORTS Netherlands
Рет қаралды 8 МЛН