Description
Machine learning is often presented as a set of algorithms, while knowledge extraction is often presented as a set of information retrieval or data mining techniques. In practice, the two fields meet in operational systems that must transform records, documents, images, events, and conversations into knowledge that people can understand and use. This book treats knowledge extraction as a full process rather than a single model. The process begins with the definition of a question and ends with a maintained knowledge product: a classification, a summary, a knowledge graph, a decision rule, a searchable index, a report, or an interface used in daily work. The chapters move from conceptual foundations to project execution. They discuss data sources, representation, supervised and unsupervised learning, natural language processing, knowledge graphs, evaluation, interpretability, deployment, and governance. The approach is practical and interdisciplinary. It recognizes that technical performance is important, but that extracted knowledge must also be useful, auditable, fair, and explainable. A project that cannot be trusted or maintained cannot be considered successful, even if its model scores appear strong.