Text simplification papers at LREC-COLING 2024
The community of researchers working on text simplification and readability is rapidly growing and the research in these areas is becoming increasingly multilingual. I identified 47 papers on these topics published at LREC-COLING 2024 (including workshop papers) and ordered them by language.
![](/assets/img/ts-lrec2024-languages.png)
Arabic:
- Bashar Alhafni, Reem Hazim, Juan David Pineros Liberato, Muhamed Al Khalil, and Nizar Habash: The SAMER Arabic Text Simplification Corpus
- Mo El-Haj, Sultan Almujaiwel, Damith Premasiri, Tharindu Ranasinghe, and Ruslan Mitkov: DARES: Dataset for Arabic Readability Estimation of School Materials
Bengali:
- Nabila Ayman, Md. Akram Hossain, Abdul Aziz, Rokan Uddin Faruqui, and Abu Nowshed Chy: BengaliLCP: A Dataset for Lexical Complexity Prediction in the Bengali Texts
Chinese:
- Fengkai Liu and John S. Y. Lee: CSSWiki: A Chinese Sentence Simplification Dataset with Linguistic and Content Operations
- Ruining Chong, Luming Lu, Liner Yang, Jinran Nie, Zhenghao Liu, Shuo Wang, Shuhan Zhou, Yaoxin Li, and Erhong Yang: MCTS: A Multi-Reference Chinese Text Simplification Dataset
Dutch:
- Daniel Vlantis, Iva Gornishka, and Shuai Wang: Benchmarking the Simplification of Dutch Municipal Text
- Nadine Beks van Raaij, Daan Kolkman, and Ksenia Podoynitsyna: Clearer Governmental Communication: Text Simplification with ChatGPT Evaluated by Quantitative and Qualitative Research
English:
- Noof Abdullah Alfear, Dimitar Kazakov, and Hend Al-Khalifa: Meta-Evaluation of Sentence Simplification Metrics
- Yichen Huang and Ekaterina Kochmar: REFeREE: A REference-FREE Model-Based Metric for Text Simplification
- Yuki Hironaka, Tomoyuki Kajiwara, and Takashi Ninomiya: Transfer Fine-tuning for Quality Estimation of Text Simplification
- Gabriel Gonzalez-Delgado and Borja Navarro-Colorado: The Simplification of the Language of Public Administration: The Case of Ombudsman Institutions
- Liam Cripwell, Joël Legrand, and Claire Gardent: Evaluating Document Simplification: On the Importance of Separately Assessing Simplicity and Meaning Preservation
- Hikaru Yamanaka and Takenobu Tokunaga: SIERA: An Evaluation Metric for Text Simplification using the Ranking Model and Data Augmentation by Edit Operations
- Andreea Maria Deleanu, Constantin Orasan, Sabine Braun: Accessible Communication: a systematic review and comparative analysis of official English Easy-to-Understand (E2U) language guidelines
- Keren Tan, Kangyang Luo, Yunshi Lan, Zheng Yuan, and Jinlong Shu: An LLM-Enhanced Adversarial Editing System for Lexical Simplification
- Christine Pinney, Casey Kennington, Maria Soledad Pera, Katherine Landau Wright, and Jerry Alan Fails: Incorporating Word-level Phonemic Decoding into Readability Assessment
- Asma Farajidizaji, Vatsal Raina, and Mark Gales: Is It Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models
- Liana Ermakova and Jaap Kamps: Complexity-Aware Scientific Literature Search: Searching for Relevant and Accessible Scientific Text
- Jan Bakker and Jaap Kamps: Beyond Sentence-level Text Simplification: Reproducibility Study of Context-Aware Document Simplification
- Jenny Alexandra Ortiz-Zambrano, César Humberto Espín-Riofrío, and Arturo Montejo-Ráez: Enhancing Lexical Complexity Prediction through Few-shot Learning with GPT-3
- Nico Colic, Jin-Dong Kim, and Fabio Rinaldi: Pre-Gamus: Reducing Complexity of Scientific Literature as a Support against Misinformation
Finnish:
- Anna Dmitrieva and Jörg Tiedemann: Towards Automatic Finnish Text Simplification
French:
- Lucía Ormaechea, Nikos Tsourakis, Didier Schwab, Pierrette Bouillon, and Benjamin Lecouteux: Simplification Strategies in French Spontaneous Speech
- Rodrigo Wilkens, Patrick Watrin, and Thomas François: Paying attention to the words: explaining readability prediction for French as a foreign language
German:
- Regina Stodden and Phillip Nguyen: Can Text Simplification Help to Increase the Acceptance of E-participation?
- Regina Stodden: Reproduction of German Text Simplification Systems
- Leon Fruth, Robin Jegan, and Andreas Henrich: An Approach towards Unsupervised Text Simplification on Paragraph-Level for German Texts
- Luisa Carrer, Andreas Säuberli, Martin Kappus, and Sarah Ebling: Towards Holistic Human Evaluation of Automatic Text Simplification
Japanese:
- Yoshinari Nagai, Teruaki Oka, and Mamoru Komachi: A Document-Level Text Simplification Dataset for Japanese
- Toru Urakawa, Yuya Taguchi, Takuro Niitsuma, and Hideaki Tamori: A Japanese News Simplification Corpus with Faithfulness
Latin:
- Thomas Laurs: Towards a Readability Formula for Latin
Norwegian:
- Sondre Wold, Petter Mæhlum, and Oddbjørn Hove: Estimating Lexical Complexity from Document-Level Distributions
Romanian:
- Madalina Chitez, Mihai Dascalu, Aura Cristina Udrea, Cosmin Strilețchi, Karla Csürös, Roxana Rogobete, and Alexandru Oravițan: Towards Building the LEMI Readability Platform for Children’s Literature in the Romanian Language
Russian:
- Mark Athugodage, Olga Mitrofanove, and Vadim Gudkov: Transfer Learning for Russian Legal Text Simplification
Serbian:
- Anđelka Zečević, Milica Ćulafić, and Stefan Stojković: On Simplification of Discharge Summaries in Serbian: Facing the Challenges
Sesotho:
- Johannes Sibeko and Menno van Zaanen: Adapting Nine Traditional Text Readability Measures into Sesotho
Setswana:
- Johannes Sibeko: Compiling a List of Frequently Used Setswana Words for Developing Readability Measures
Slovene:
- Aleš Žagar, Matej Klemen, Marko Robnik-Šikonja, and Iztok Kosem: SENTA: Sentence Simplification System for Slovene
Spanish:
- Margot Madina, Itziar Gonzalez-Dios, and Melanie Siegel: A Preliminary Study of ChatGPT for Spanish E2R Text Adaptation
- Jasper Degraeuwe and Patrick Goethals: LexComSpaL2: A Lexical Complexity Corpus for Spanish as a Foreign Language
- Leonardo Campillos-Llanos, Ana Rosa Terroba, Rocío Bartolomé, Ana Valverde-Mateos, Cristina González, Adrián Capllonch-Carrión, and Jonathan Heras: Replace, Paraphrase or Fine-tune? Evaluating Automatic Simplification for Medical Texts in Spanish
- Margot Madina, Itziar Gonzalez-Dios, Melanie Siegel: LanguageTool as a CAT tool for Easy-to-Read in Spanish
Swedish:
- Julius Monsen and Arne Jonsson: Controllable Sentence Simplification in Swedish Using Control Prefixes and Mined Paraphrases
Multilingual:
- Antoine Jamelot, Solen Quiniou, and Sophie Hamon: Improving Text Readability through Segmentation into Rheses
- Matthew Shardlow, Kai North, and Marcos Zampieri: A Multilingual Survey of Recent Lexical Complexity Prediction Resources through the Recommendations of the Complex 2.0 Framework
- Matthew Shardlow, Fernando Alva-Manchego, Riza Batista-Navarro, Stefan Bott, Saul Calderon Ramirez, Rémi Cardon, Thomas François, Akio Hayakawa, Andrea Horbach, Anna Huelsing, Yusuke Ide, Joseph Marvin Imperial, Adam Nohejl, Kai North, Laura Occhipinti, Nelson Peréz Rojas, Nishat Raihan, Tharindu Ranasinghe, Martin Solis Salazar, et al.: An Extensible Massively Multilingual Lexical Simplification Pipeline Dataset using the MultiLS Framework
- Miriam Anschütz, Edoardo Mosca, and Georg Groh: Simpler Becomes Harder: Do LLMs Exhibit a Coherent Behavior on Simplified Corpora?