Are Neural Topic Models Broken? (2024)

Alexander Miserlis Hoyle,Rupak Sarkar,Pranav Goel,Philip Resnik

Abstract

Recently, the relationship between automated and human evaluation of topic models has been called into question. Method developers have staked the efficacy of new topic model variants on automated measures, and their failure to approximate human preferences places these models on uncertain ground. Moreover, existing evaluation paradigms are often divorced from real-world use.Motivated by content analysis as a dominant real-world use case for topic modeling, we analyze two related aspects of topic models that affect their effectiveness and trustworthiness in practice for that purpose: the stability of their estimates and the extent to which the model’s discovered categories align with human-determined categories in the data. We find that neural topic models fare worse in both respects compared to an established classical method. We take a step toward addressing both issues in tandem by demonstrating that a straightforward ensembling method can reliably outperform the members of the ensemble.

Anthology ID:
2022.findings-emnlp.390
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2022
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates
Editors:
Yoav Goldberg,Zornitsa Kozareva,Yue Zhang
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5321–5344
Language:
URL:
https://aclanthology.org/2022.findings-emnlp.390
DOI:
10.18653/v1/2022.findings-emnlp.390
Bibkey:
Cite (ACL):
Alexander Miserlis Hoyle, Rupak Sarkar, Pranav Goel, and Philip Resnik. 2022. Are Neural Topic Models Broken?. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 5321–5344, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):
Are Neural Topic Models Broken? (Hoyle et al., Findings 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.findings-emnlp.390.pdf
Video:
https://aclanthology.org/2022.findings-emnlp.390.mp4

PDFCiteSearchVideo

Export citation
  • BibTeX
  • MODS XML
  • Endnote
  • Preformatted
@inproceedings{hoyle-etal-2022-neural, title = "Are Neural Topic Models Broken?", author = "Hoyle, Alexander Miserlis and Sarkar, Rupak and Goel, Pranav and Resnik, Philip", editor = "Goldberg, Yoav and Kozareva, Zornitsa and Zhang, Yue", booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2022", month = dec, year = "2022", address = "Abu Dhabi, United Arab Emirates", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2022.findings-emnlp.390", doi = "10.18653/v1/2022.findings-emnlp.390", pages = "5321--5344", abstract = "Recently, the relationship between automated and human evaluation of topic models has been called into question. Method developers have staked the efficacy of new topic model variants on automated measures, and their failure to approximate human preferences places these models on uncertain ground. Moreover, existing evaluation paradigms are often divorced from real-world use.Motivated by content analysis as a dominant real-world use case for topic modeling, we analyze two related aspects of topic models that affect their effectiveness and trustworthiness in practice for that purpose: the stability of their estimates and the extent to which the model{'}s discovered categories align with human-determined categories in the data. We find that neural topic models fare worse in both respects compared to an established classical method. We take a step toward addressing both issues in tandem by demonstrating that a straightforward ensembling method can reliably outperform the members of the ensemble.",}

Download as File

<?xml version="1.0" encoding="UTF-8"?><modsCollection xmlns="http://www.loc.gov/mods/v3"><mods ID="hoyle-etal-2022-neural"> <titleInfo> <title>Are Neural Topic Models Broken?</title> </titleInfo> <name type="personal"> <namePart type="given">Alexander</namePart> <namePart type="given">Miserlis</namePart> <namePart type="family">Hoyle</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Rupak</namePart> <namePart type="family">Sarkar</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Pranav</namePart> <namePart type="family">Goel</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Philip</namePart> <namePart type="family">Resnik</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2022-12</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Findings of the Association for Computational Linguistics: EMNLP 2022</title> </titleInfo> <name type="personal"> <namePart type="given">Yoav</namePart> <namePart type="family">Goldberg</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Zornitsa</namePart> <namePart type="family">Kozareva</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Yue</namePart> <namePart type="family">Zhang</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>Association for Computational Linguistics</publisher> <place> <placeTerm type="text">Abu Dhabi, United Arab Emirates</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <abstract>Recently, the relationship between automated and human evaluation of topic models has been called into question. Method developers have staked the efficacy of new topic model variants on automated measures, and their failure to approximate human preferences places these models on uncertain ground. Moreover, existing evaluation paradigms are often divorced from real-world use.Motivated by content analysis as a dominant real-world use case for topic modeling, we analyze two related aspects of topic models that affect their effectiveness and trustworthiness in practice for that purpose: the stability of their estimates and the extent to which the model’s discovered categories align with human-determined categories in the data. We find that neural topic models fare worse in both respects compared to an established classical method. We take a step toward addressing both issues in tandem by demonstrating that a straightforward ensembling method can reliably outperform the members of the ensemble.</abstract> <identifier type="citekey">hoyle-etal-2022-neural</identifier> <identifier type="doi">10.18653/v1/2022.findings-emnlp.390</identifier> <location> <url>https://aclanthology.org/2022.findings-emnlp.390</url> </location> <part> <date>2022-12</date> <extent unit="page"> <start>5321</start> <end>5344</end> </extent> </part></mods></modsCollection>

Download as File

%0 Conference Proceedings%T Are Neural Topic Models Broken?%A Hoyle, Alexander Miserlis%A Sarkar, Rupak%A Goel, Pranav%A Resnik, Philip%Y Goldberg, Yoav%Y Kozareva, Zornitsa%Y Zhang, Yue%S Findings of the Association for Computational Linguistics: EMNLP 2022%D 2022%8 December%I Association for Computational Linguistics%C Abu Dhabi, United Arab Emirates%F hoyle-etal-2022-neural%X Recently, the relationship between automated and human evaluation of topic models has been called into question. Method developers have staked the efficacy of new topic model variants on automated measures, and their failure to approximate human preferences places these models on uncertain ground. Moreover, existing evaluation paradigms are often divorced from real-world use.Motivated by content analysis as a dominant real-world use case for topic modeling, we analyze two related aspects of topic models that affect their effectiveness and trustworthiness in practice for that purpose: the stability of their estimates and the extent to which the model’s discovered categories align with human-determined categories in the data. We find that neural topic models fare worse in both respects compared to an established classical method. We take a step toward addressing both issues in tandem by demonstrating that a straightforward ensembling method can reliably outperform the members of the ensemble.%R 10.18653/v1/2022.findings-emnlp.390%U https://aclanthology.org/2022.findings-emnlp.390%U https://doi.org/10.18653/v1/2022.findings-emnlp.390%P 5321-5344

Download as File

Markdown (Informal)

[Are Neural Topic Models Broken?](https://aclanthology.org/2022.findings-emnlp.390) (Hoyle et al., Findings 2022)

  • Are Neural Topic Models Broken? (Hoyle et al., Findings 2022)
ACL
  • Alexander Miserlis Hoyle, Rupak Sarkar, Pranav Goel, and Philip Resnik. 2022. Are Neural Topic Models Broken?. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 5321–5344, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
See Also
Coherence
Are Neural Topic Models Broken? (2024)
Top Articles
Topic Modelling: A Comparison Between LDA, NMF, BERTopic and Top2Vec — Part I
Ankr on LinkedIn: 📢 Join Ankr CEO &amp; Co-founder Chandler Song at LEAP 2024, the world’s most…
Evil Dead Movies In Order & Timeline
Skyward Sinton
Cappacuolo Pronunciation
Edina Omni Portal
Cintas Pay Bill
Ofw Pinoy Channel Su
Grange Display Calculator
Horoscopes and Astrology by Yasmin Boland - Yahoo Lifestyle
Amateur Lesbian Spanking
Geometry Escape Challenge A Answer Key
Fire Rescue 1 Login
Clairememory Scam
Https://Gw.mybeacon.its.state.nc.us/App
Socket Exception Dunkin
Rosemary Beach, Panama City Beach, FL Real Estate & Homes for Sale | realtor.com®
Kinkos Whittier
Animal Eye Clinic Huntersville Nc
Dc Gas Login
Blackwolf Run Pro Shop
Mals Crazy Crab
Walgreens San Pedro And Hildebrand
Pretend Newlyweds Nikubou Maranoshin
Hobby Stores Near Me Now
PCM.daily - Discussion Forum: Classique du Grand Duché
Egusd Lunch Menu
Hrconnect Kp Login
The Goonies Showtimes Near Marcus Rosemount Cinema
Mississippi Craigslist
Sacramento Craigslist Cars And Trucks - By Owner
Poe T4 Aisling
Devargasfuneral
140000 Kilometers To Miles
Naya Padkar Newspaper Today
Nobodyhome.tv Reddit
Sept Month Weather
Craigslist Pets Plattsburgh Ny
US-amerikanisches Fernsehen 2023 in Deutschland schauen
Academic Calendar / Academics / Home
Hanco*ck County Ms Busted Newspaper
Yale College Confidential 2027
Gas Buddy Il
Best Haircut Shop Near Me
Zeeks Pizza Calories
Benjamin Franklin - Printer, Junto, Experiments on Electricity
Online TikTok Voice Generator | Accurate & Realistic
Craigslist Monterrey Ca
Land of Samurai: One Piece’s Wano Kuni Arc Explained
Ff14 Palebloom Kudzu Cloth
32 Easy Recipes That Start with Frozen Berries
Varsity Competition Results 2022
Latest Posts
Article information

Author: Neely Ledner

Last Updated:

Views: 6336

Rating: 4.1 / 5 (42 voted)

Reviews: 81% of readers found this page helpful

Author information

Name: Neely Ledner

Birthday: 1998-06-09

Address: 443 Barrows Terrace, New Jodyberg, CO 57462-5329

Phone: +2433516856029

Job: Central Legal Facilitator

Hobby: Backpacking, Jogging, Magic, Driving, Macrame, Embroidery, Foraging

Introduction: My name is Neely Ledner, I am a bright, determined, beautiful, adventurous, adventurous, spotless, calm person who loves writing and wants to share my knowledge and understanding with you.