Zum Hauptinhalt springen
Dekorationsartikel gehören nicht zum Leistungsumfang.
Text Mining with Machine Learning
Principles and Techniques
Taschenbuch von Jan ¿I¿Ka (u. a.)
Sprache: Englisch

106,95 €*

inkl. MwSt.

Versandkostenfrei per Post / DHL

Lieferzeit 1-2 Wochen

Kategorien:
Beschreibung
This book provides a perspective on the application of machine learning-based methods in knowledge discovery from natural languages texts. By analysing various data sets, conclusions which are not normally evident, emerge and can be used for various purposes and applications. The book provides explanations of principles of time-proven machine learning algorithms applied in text mining together with step-by-step demonstrations of how to reveal the semantic contents in real-world datasets using the popular R-language with its implemented machine learning algorithms. The book is not only aimed at IT specialists, but is meant for a wider audience that needs to process big sets of text documents and has basic knowledge of the subject, e.g. e-mail service providers, online shoppers, librarians, etc.

The book starts with an introduction to text-based natural language data processing and its goals and problems. It focuses on machine learning, presenting various algorithms with their use and possibilities, and reviews the positives and negatives. Beginning with the initial data pre-processing, a reader can follow the steps provided in the R-language including the subsuming of various available plug-ins into the resulting software tool. A big advantage is that R also contains many libraries implementing machine learning algorithms, so a reader can concentrate on the principal target without the need to implement the details of the algorithms her- or himself. To make sense of the results, the book also provides explanations of the algorithms, which supports the final evaluation and interpretation of the results. The examples are demonstrated using realworld data from commonly accessible Internet sources.
This book provides a perspective on the application of machine learning-based methods in knowledge discovery from natural languages texts. By analysing various data sets, conclusions which are not normally evident, emerge and can be used for various purposes and applications. The book provides explanations of principles of time-proven machine learning algorithms applied in text mining together with step-by-step demonstrations of how to reveal the semantic contents in real-world datasets using the popular R-language with its implemented machine learning algorithms. The book is not only aimed at IT specialists, but is meant for a wider audience that needs to process big sets of text documents and has basic knowledge of the subject, e.g. e-mail service providers, online shoppers, librarians, etc.

The book starts with an introduction to text-based natural language data processing and its goals and problems. It focuses on machine learning, presenting various algorithms with their use and possibilities, and reviews the positives and negatives. Beginning with the initial data pre-processing, a reader can follow the steps provided in the R-language including the subsuming of various available plug-ins into the resulting software tool. A big advantage is that R also contains many libraries implementing machine learning algorithms, so a reader can concentrate on the principal target without the need to implement the details of the algorithms her- or himself. To make sense of the results, the book also provides explanations of the algorithms, which supports the final evaluation and interpretation of the results. The examples are demonstrated using realworld data from commonly accessible Internet sources.
Über den Autor

Jan Žižka is a consultant in machine learning and data mining. He has worked as a system programmer, developer of advanced software systems, and researcher. For the last 25 years, he has devoted himself to AI and machine learning, especially text mining. He has been a faculty at a number of universities and research institutes. He has authored approximately 100 international publications.

František Däena is an associate professor and the head of the Text Mining and NLP group at the Department of Informatics, Mendel University, Brno. He has published numerous articles in international scientific journals, conference proceedings, and monographs, and is a member of editorial boards of several international journals. His research includes text/data mining, intelligent data processing, and machine learning.

Arnošt Svoboda is an expert programer. His speciality includes programming languages and systems such as R, Assembler, Matlab, PL/1, Cobol, Fortran, Pascal, and others. He started as a system programmer. The last 20 years, Arnošt has worked also as a teacher and researcher at Masaryk University in Brno. His current interest are machine learning and data mining.

Inhaltsverzeichnis

Introduction to the Text Mining. Problematics. Textual Data in Natural Languages and Their Computer Representation. Typical Tasks and Problems. Basic Processing Tools. Machine Learning and Its Application. Applying Sequences of Machine Learning Algorithms. R-language and Its Use for Machine Learning-Based Text Mining. Real-World-Data Examples and Their Basic Preprocessing Using R. Advanced Text Mining Using Machine Learning and R. Selecting Appropriate Machine Learning Algorithms. Examples of Typical Task Solutions. Interpretation of Results.

Details
Erscheinungsjahr: 2021
Fachbereich: Betriebssysteme & Benutzeroberflächen
Genre: Importe, Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
ISBN-13: 9781032086217
ISBN-10: 1032086211
Sprache: Englisch
Ausstattung / Beilage: Paperback
Einband: Kartoniert / Broschiert
Autor: ¿I¿Ka, Jan
Da¿ena, Franti¿ek
Svoboda, Arno¿t
Hersteller: CRC Press
Verantwortliche Person für die EU: Books on Demand GmbH, In de Tarpen 42, D-22848 Norderstedt, info@bod.de
Maße: 234 x 156 x 20 mm
Von/Mit: Jan ¿I¿Ka (u. a.)
Erscheinungsdatum: 30.06.2021
Gewicht: 0,559 kg
Artikel-ID: 128439037
Über den Autor

Jan Žižka is a consultant in machine learning and data mining. He has worked as a system programmer, developer of advanced software systems, and researcher. For the last 25 years, he has devoted himself to AI and machine learning, especially text mining. He has been a faculty at a number of universities and research institutes. He has authored approximately 100 international publications.

František Däena is an associate professor and the head of the Text Mining and NLP group at the Department of Informatics, Mendel University, Brno. He has published numerous articles in international scientific journals, conference proceedings, and monographs, and is a member of editorial boards of several international journals. His research includes text/data mining, intelligent data processing, and machine learning.

Arnošt Svoboda is an expert programer. His speciality includes programming languages and systems such as R, Assembler, Matlab, PL/1, Cobol, Fortran, Pascal, and others. He started as a system programmer. The last 20 years, Arnošt has worked also as a teacher and researcher at Masaryk University in Brno. His current interest are machine learning and data mining.

Inhaltsverzeichnis

Introduction to the Text Mining. Problematics. Textual Data in Natural Languages and Their Computer Representation. Typical Tasks and Problems. Basic Processing Tools. Machine Learning and Its Application. Applying Sequences of Machine Learning Algorithms. R-language and Its Use for Machine Learning-Based Text Mining. Real-World-Data Examples and Their Basic Preprocessing Using R. Advanced Text Mining Using Machine Learning and R. Selecting Appropriate Machine Learning Algorithms. Examples of Typical Task Solutions. Interpretation of Results.

Details
Erscheinungsjahr: 2021
Fachbereich: Betriebssysteme & Benutzeroberflächen
Genre: Importe, Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
ISBN-13: 9781032086217
ISBN-10: 1032086211
Sprache: Englisch
Ausstattung / Beilage: Paperback
Einband: Kartoniert / Broschiert
Autor: ¿I¿Ka, Jan
Da¿ena, Franti¿ek
Svoboda, Arno¿t
Hersteller: CRC Press
Verantwortliche Person für die EU: Books on Demand GmbH, In de Tarpen 42, D-22848 Norderstedt, info@bod.de
Maße: 234 x 156 x 20 mm
Von/Mit: Jan ¿I¿Ka (u. a.)
Erscheinungsdatum: 30.06.2021
Gewicht: 0,559 kg
Artikel-ID: 128439037
Sicherheitshinweis