Ethics in machine learning publications: Peer-review analysis using NLP methods

Agisheva, Aigul

aalto1 untyped-item.component.html

Ethics in machine learning publications: Peer-review analysis using NLP methods

Perustieteiden korkeakoulu | Master's thesis

Electronic archive copy is available via Aalto Thesis Database.

Instructions

Authors

Agisheva, Aigul

Date

2023-05-15

Major/Subject

Human-Computer Interaction

Mcode

SCI3097

Degree programme

Master's Programme in Computer, Communication and Information Sciences

Language

en

Pages

44+1

Abstract

Peer review is a critical component of the scientific publishing process, since its results directly influence the decision to publish a research work. Therefore, it is crucial to maintain ethical standards in the peer review process, and this work focuses on one important aspect: the appropriate use of citation recommendations in reviews. This study developed a classification model that identifies reviews with unjustified citation recommendations using NLP methods. To train the model, reviews from ICLR 2021, a top-tier machine learning conference, were manually annotated. It was found that the Multinomial Naive Bayes classifier performed the best among all the classifiers tested, and achieved 82% F1-score, 70% precision and 100% recall for the target class. Moreover, data augmentation techniques and optimal regularization strategies were explored to overcome the dataset's limited size. This classifier could serve as an assistive tool for conference organizers and reviewers. The results of this study provide a starting point for developing a comprehensive solution to ensure adherence to quality and ethical guidelines in peer review.

Supervisor

Jung, Alex

Thesis advisor

Tian, Yu

Keywords

NLP, peer review, text classification, ethics, natural language processing

Permanent link to this item

https://urn.fi/URN:NBN:fi:aalto-202305213337

Collections

[dipl] Perustieteiden korkeakoulu / SCI

Show all metadata

Ethics in machine learning publications: Peer-review analysis using NLP methods

URL

Journal Title

Journal ISSN

Volume Title

Authors

Date

Department

Major/Subject

Mcode

Degree programme

Language

Pages

Series

Abstract

Description

Supervisor

Thesis advisor

Keywords

Other note

Citation

Permanent link to this item

Collections

Endorsement

Review

Supplemented By

Referenced By