Topic Modelling: Going Beyond Token Outputs (2024)

[Submitted on 16 Jan 2024]

View PDF HTML (experimental)

Abstract:Topic modelling is a text mining technique for identifying salient themes from a number of documents. The output is commonly a set of topics consisting of isolated tokens that often co-occur in such documents. Manual effort is often associated with interpreting a topic's description from such tokens. However, from a human's perspective, such outputs may not adequately provide enough information to infer the meaning of the topics; thus, their interpretability is often inaccurately understood. Although several studies have attempted to automatically extend topic descriptions as a means of enhancing the interpretation of topic models, they rely on external language sources that may become unavailable, must be kept up-to-date to generate relevant results, and present privacy issues when training on or processing data. This paper presents a novel approach towards extending the output of traditional topic modelling methods beyond a list of isolated tokens. This approach removes the dependence on external sources by using the textual data itself by extracting high-scoring keywords and mapping them to the topic model's token outputs. To measure the interpretability of the proposed outputs against those of the traditional topic modelling approach, independent annotators manually scored each output based on their quality and usefulness, as well as the efficiency of the annotation task. The proposed approach demonstrated higher quality and usefulness, as well as higher efficiency in the annotation task, in comparison to the outputs of a traditional topic modelling method, demonstrating an increase in their interpretability.
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:2401.12990 [cs.CL]
(or arXiv:2401.12990v1 [cs.CL] for this version)
https://doi.org/10.48550/arXiv.2401.12990

arXiv-issued DOI via DataCite

Related DOI: https://doi.org/10.3390/bdcc8050044

DOI(s) linking to related resources

Submission history

From: Lowri Williams [view email]
[v1] Tue, 16 Jan 2024 16:05:54 UTC (7,465 KB)

Topic Modelling: Going Beyond Token Outputs (2024)
Top Articles
About Us | Joybird
What is my credit card account number?
Fighter Torso Ornament Kit
Nybe Business Id
J & D E-Gitarre 905 HSS Bat Mark Goth Black bei uns günstig einkaufen
Goodbye Horses: The Many Lives of Q Lazzarus
THE 10 BEST River Retreats for 2024/2025
House Share: What we learned living with strangers
Rainfall Map Oklahoma
Alaska: Lockruf der Wildnis
Tracking Your Shipments with Maher Terminal
Who called you from 6466062860 (+16466062860) ?
Nene25 Sports
Mile Split Fl
Classic | Cyclone RakeAmerica's #1 Lawn and Leaf Vacuum
Effingham Bookings Florence Sc
1989 Chevy Caprice For Sale Craigslist
Wsop Hunters Club
The Largest Banks - ​​How to Transfer Money With Only Card Number and CVV (2024)
Tips on How to Make Dutch Friends & Cultural Norms
Universal Stone Llc - Slab Warehouse & Fabrication
Evil Dead Rise Showtimes Near Regal Sawgrass & Imax
Cain Toyota Vehicles
Www.craigslist.com Austin Tx
2021 MTV Video Music Awards: See the Complete List of Nominees - E! Online
Synergy Grand Rapids Public Schools
No Limit Telegram Channel
Robotization Deviantart
5 Star Rated Nail Salons Near Me
The Posturepedic Difference | Sealy New Zealand
Have you seen this child? Caroline Victoria Teague
MethStreams Live | BoxingStreams
Movies123.Pick
Carespot Ocoee Photos
Best Restaurants In Blacksburg
Hannibal Mo Craigslist Pets
Myql Loan Login
Banana Republic Rewards Login
Td Ameritrade Learning Center
301 Priest Dr, KILLEEN, TX 76541 - HAR.com
9 oplossingen voor het laptoptouchpad dat niet werkt in Windows - TWCB (NL)
How to Get a Better Signal on Your iPhone or Android Smartphone
Lake Andes Buy Sell Trade
Doe Infohub
Cocorahs South Dakota
Pike County Buy Sale And Trade
UT Announces Physician Assistant Medicine Program
Lyons Hr Prism Login
La Qua Brothers Funeral Home
25 Hotels TRULY CLOSEST to Woollett Aquatics Center, Irvine, CA
Identogo Manahawkin
Latest Posts
Article information

Author: Greg O'Connell

Last Updated:

Views: 6324

Rating: 4.1 / 5 (42 voted)

Reviews: 81% of readers found this page helpful

Author information

Name: Greg O'Connell

Birthday: 1992-01-10

Address: Suite 517 2436 Jefferey Pass, Shanitaside, UT 27519

Phone: +2614651609714

Job: Education Developer

Hobby: Cooking, Gambling, Pottery, Shooting, Baseball, Singing, Snowboarding

Introduction: My name is Greg O'Connell, I am a delightful, colorful, talented, kind, lively, modern, tender person who loves writing and wants to share my knowledge and understanding with you.