Text Mining
Challenges, Algorithms, Tools and Applications

Download Url(s)
https://mdpi.com/books/pdfview/book/12172Contributor(s)
Liu, Fei (editor)
Language
EnglishAbstract
Encompassing fields such as information retrieval, extraction, classification, summarization and understanding, text mining has become indispensable across diverse application domains. Methodological advances span rule-based systems, statistical approaches, support vector machines, clustering, neural networks and most recently, deep learning, with distance and similarity estimation persisting as central challenges. This Special Issue comprises two survey papers and sixteen research articles, all addressing practical aspects of text mining. The surveys review methods for financial market prediction through social media analysis and sentiment detection, and provide a synthesis of techniques, evaluation strategies, and applications of recommender systems across domains including e-commerce, social media, and online learning. The research articles extend to text classification, sentiment analysis, summarization, and natural language understanding, as well as corpus construction and e-governance applications. Contributions include a large-scale Chinese toponym corpus, CHTopo, deep learning models for tourist behaviour prediction, semantic optimization of artifact presentation, BERT-based knowledge extraction for agricultural ontology construction, and transformer-based approaches for detecting moral features in television narratives.
Keywords
Text mining; Sentiment analysis; Financial market prediction; Big data analytics; News; Social media; Aspect-based sentiment analysis; Dependent syntactic analysis; Dependency weighting; Graph attention network; Pretrained model; Extractive text summarization; Sentence scoring scheme; Graph analytics; Graph-based clustering; Opinion mining; Privacy policy; Longitudinal analysis; Text analysis; NLP; Readability; Vagueness; AI in literary analysis; Machine learning; Modern french poetry classification; Feature extraction techniques; SVM in poetry analysis; Moral features; Moral foundations theory; TV series; Storytelling; Language analysis; Genre analysis; Deep learning; Transformers; SBERT; Knowledge graph; Medicine dictionary; Structured triples; Information extraction; Question answering; Artificial intelligence; Text classification; Genre classification; Russian literature; Stylometry; Genres dataset; Wheat sharp eyespot; Knowledge extraction; Domain ontology; Automatic extraction; Tourism; Visit intention; Sentence transformers; Recommender system; Recommendation system; Applications; Data sources; Features; Challenges; Social systems; Semantic separation; Content presentation order; Web design; App design; Pagination; Infinite scroll; User engagement; Sentence classification; Hybrid architecture; Attention mechanism; Synonym augmentation; GloVe embeddings; Temporal sensitivity; Ensemble learning; Multiple-View Summarization; COVID-19 vaccine tweet summarization; Microblog summarization; Social feature-focused summarization; Entity-based summarization; Distance-centered summarization; Chinese text; Toponym; Annotated corpus; Toponym recognition; Multilingual complaint classification; Large language models; Embedding models; Instruction-based classification; Zero-shot learning; Few-shot inference; Public transportation; Resource-efficient NLP; Supervised fine-tuning; Public sector NLP; Global digital contract; Internet governance standards; Internet governance rules; SAR data processing; AI techniquesWebshop link
https://mdpi.com/books/ISBN
9783725861392, 9783725861408Publisher website
www.mdpi.com/booksPublication date and place
CH, 2026Classification
Information technology industries

