Institutional Research Information Service
UCL Logo
Please report any queries concerning the funding data grouped in the sections named "Externally Awarded" or "Internally Disbursed" (shown on the profile page) to your Research Finance Administrator. Your can find your Research Finance Administrator at https://www.ucl.ac.uk/finance/research/rs-contacts.php by entering your department
Please report any queries concerning the student data shown on the profile page to:

Email: portico-services@ucl.ac.uk

Help Desk: http://www.ucl.ac.uk/ras/portico/helpdesk
Publication Detail
Python Coding Style Compliance on Stack Overflow
  • Publication Type:
  • Authors:
    Bafatakis N, Boecker N, Boon W, Cabello Salazar M, Krinke J, Oznacar G, White R
  • Publisher:
  • Publication date:
  • Pagination:
    210, 214
  • Published proceedings:
    Proceedings of the 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR)
  • Volume:
  • ISBN-13:
  • Status:
  • Name of conference:
    2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR)
  • Conference place:
    Montreal (QC), Canada
  • Conference start date:
  • Conference finish date:
  • Print ISSN:
© 2019 IEEE. Software developers all over the world use Stack Overflow (SO) to interact and exchange code snippets. Research also uses SO to harvest code snippets for use with recommendation systems. However, previous work has shown that code on SO may have quality issues, such as security or license problems. We analyse Python code on SO to determine its coding style compliance. From 1,962,535 code snippets tagged with 'python', we extracted 407,097 snippets of at least 6 statements of Python code. Surprisingly, 93.87% of the extracted snippets contain style violations, with an average of 0.7 violations per statement and a huge number of snippets with a considerably higher ratio. Researchers and developers should, therefore, be aware that code snippets on SO may not representative of good coding style. Furthermore, while user reputation seems to be unrelated to coding style compliance, for posts with vote scores in the range between -10 and 20, we found a strong correlation (r = -0.87, p < 10-7) between the vote score a post received and the average number of violations per statement for snippets in such posts.
Publication data is maintained in RPS. Visit https://rps.ucl.ac.uk
 More search options
UCL Researchers
Dept of Computer Science
Dept of Computer Science
University College London - Gower Street - London - WC1E 6BT Tel:+44 (0)20 7679 2000

© UCL 1999–2011

Search by