Robust Methods for Developer Screening in Rapidly Evolving AI Contexts

要旨

The rise of AI-powered tools like ChatGPT enables non-programmers to bypass programming screening questions, undermining internal validity in usable security and privacy, and software engineering studies. Past ChatGPT-resistant tasks proposed static visual questions, which ChatGPT can now circumvent. Therefore, we tested alternative approaches such as video- and audio-based screeners that reveal key information step by step under strict time constraints to distinguish programmers from non-programmers. To this end, we conducted a study with 74 participants across three groups: programmers, non-programmers without AI assistance, and non-programmers using ChatGPT. Our results showed that audio-based screeners were robust against ChatGPT-based cheating, as non-programmers struggled to find correct answers within time limits, whereas programmers demonstrated high accuracy with minimal time pressure. Based on our findings, we recommend six audio-based ChatGPT-resistant screening questions that maximize screening effectiveness and efficiency and suggest a 215-second instrument that includes 95.87% of programmers while excluding 99.69% of non-programmers.

著者
Raphael Serafini
University of Cologne, Cologne, Germany
Nino Weber
Ruhr University Bochum, Bochum, Germany
Asli Yardim
Ruhr University Bochum, Bochum, Germany
Stefan Albert. Horstmann
Ruhr University Bochum, Bochum, Germany
Alena Naiakshina
Univeristy of Cologne, Cologne, Germany

会議: CHI 2026

ACM CHI Conference on Human Factors in Computing Systems

セッション: Privacy and Security in Software Development

Area 1 + 2 + 3: theatre
7 件の発表
2026-04-16 18:00:00
2026-04-16 19:30:00