The real-world data, though massive, is largely unstructured, in the form of natural-language text. It is challenging but highly desirable to mine structures from massive text data, without extensive human annotation and labeling. In this book, we investigate the principles and methodologies of mining structures of factual knowledge (e.g., entities and their relationships) from massive, unstructured text corpora. Departing from many existing structure extraction methods that have heavy reliance on human annotated data for model training, our effort-light approach leverages human-curated facts stored in external knowledge bases as distant supervision and exploits rich data redundancy in large text corpora for context understanding. This effort-light mining approach leads to a series of new principles and powerful methodologies for structuring text corpora, including (1) entity recognition, typing and synonym discovery, (2) entity relation extraction, and (3) open-domain attribute-valuemining and information extraction. This book introduces this new research frontier and points out some promising research directions.
Read more
Departing from many existing structure extraction methods that have heavy reliance on human annotated data for model training, our effort-light approach leverages human-curated facts stored in external knowledge bases as distant supervision and exploits rich data redundancy in large text corpora for context understanding.
Read more
Acknowledgments.- Introduction.- Background.- Literature Review.- Entity Recognition and Typing with Knowledge Bases.- Fine-Grained Entity Typing with Knowledge Bases.- Synonym Discovery from Large Corpus.- Joint Extraction of Typed Entities and Relationships.- Pattern-Enhanced Embedding Learning for Relation Extraction.- Heterogeneous Supervision for Relation Extraction.- Indirect Supervision: Leveraging Knowledge from Auxiliary Tasks.- Mining Entity Attribute Values with Meta Patterns.- Open Information Extraction with Global Structure Cohesiveness.- Open Information Extraction with Global Structure Cohesiveness.- Applications.- Conclusions.- Vision and Future Work.- Bibliography.- Authors' Biographies.
Read more
GPSR Compliance
The European Union's (EU) General Product Safety Regulation (GPSR) is a set of rules that requires consumer products to be safe and our obligations to ensure this.
If you have any concerns about our products you can contact us on ProductSafety@springernature.com.
In case Publisher is established outside the EU, the EU authorized representative is:
Springer Nature Customer Service Center GmbH
Europaplatz 3
69115 Heidelberg, Germany
ProductSafety@springernature.com
Read more
Product details
ISBN
9783031007842
Published
2018-06-26
Publisher
Springer International Publishing AG
Height
235 mm
Width
191 mm
Age
Professional/practitioner, P, 06
Language
Product language
Engelsk
Format
Product format
Heftet
Number of pages
183
Original title
Mining Structures of Factual Knowledge from Text