The "big data" era is characterized by an explosion of information in
the form of digital data collections, ranging from scientific
knowledge, to social media, news, and everyone's daily life. Examples
of such collections include scientific publications, enterprise logs,
news articles, social media, and general web pages. Valuable knowledge
about multi-typed entities is often hidden in the unstructured or
loosely structured, interconnected data. Mining latent structures
around entities uncovers hidden knowledge such as implicit topics,
phrases, entity roles and relationships. In this monograph, we
investigate the principles and methodologies of mining latent entity
structures from massive unstructured and interconnected data. We
propose a text-rich information network model for modeling data in
many different domains. This leads to a series of new principles and
powerful methodologies for mining latent structures, including (1)
latent topical hierarchy, (2) quality topical phrases, (3)entity roles
in hierarchical topical communities, and (4) entity relations. This
book also introduces applications enabled by the mined structures and
points out some promising research directions.
Les mer
Produktdetaljer
ISBN
9783031019074
Publisert
2022
Utgiver
Springer Nature
Språk
Product language
Engelsk
Format
Product format
Digital bok
Forfatter