Hungarian case study on automated detection of body-shaming comments using machine learning

Gőz, Franciska Noémi, B. Varga, Erika (2025) Hungarian case study on automated detection of body-shaming comments using machine learning In: Proceedings of the International Conference on Formal Methods and Foundations of Artificial Intelligence. Eger, Eszterházy Károly Catholic University. pp. 67-77.

pdf
fmfai2025_pp065-077.pdf
Download (809kB) [error in script]

Hivatalos webcím (URL): https://doi.org/10.17048/fmfai.2025.65

Absztrakt (kivonat)

Social media facilitates online interactions but also enables bodyshaming comments which are often ambiguous. This paper presents a machine learning-based approach for detecting Hungarian body-shaming comments, an underrepresented area in NLP. A dataset of Facebook comments was collected and expanded with synthetic data. Using HuSpaCy and HuBERT, logistic regression and MLP classification models were trained with TF-IDF and SBERT embeddings. The best-performing model achieved 88% accuracy, demonstrating the potential of NLP techniques for moderating harmful online content in low-resource languages. The results highlight key challenges, including category overlap and class imbalance, emphasizing the need for context-aware classification methods in automated content moderation.

Mű típusa:	Könyvrészlet - Book section
Szerző:	Szerző neve Email MTMT azonosító ORCID azonosító Közreműködés Gőz, Franciska Noémi gozfanni@gmail.com NEM RÉSZLETEZETT NEM RÉSZLETEZETT Szerző B. Varga, Erika erika.b.varga@uni-miskolc.hu NEM RÉSZLETEZETT NEM RÉSZLETEZETT Szerző
Kapcsolódó URL-ek:	Befoglaló kötet
Kulcsszavak:	Hungarian text analysis, toxic comment filtering, social media moderation, body-shaming detection, machine learning classification
Nyelv:	angol
DOI azonosító:	10.17048/fmfai.2025.65
Felhasználó:	Tibor Gál
Dátum:	28 Okt 2025 10:05
Utolsó módosítás:	28 Okt 2025 10:05
URI:	http://publikacio.uni-eszterhazy.hu/id/eprint/8803

Műveletek (bejelentkezés szükséges)

Tétel nézet

Intézményi Publikációk is powered by EPrints 3 which is developed by the School of Electronics and Computer Science at the University of Southampton. More information and software credits.

TÁMOP-4.2.1.D-15/1/KONV-2015-0013 Kutatás, Innováció, Együttműködések – Társadalmi innováció és kutatási hálózatok együttműködésének erősítése projekt keretében készült.