Detect plagiarised and similar code across trillions of code sources on the web See what's new

Code Intelligence Hub

Expert insights on AI code detection and academic integrity

AI-Generated Code Detection: The New Frontier in Academic Integrity

Featured

AI-Generated Code Detection: The New Frontier in Academic Integrity

As AI coding assistants become ubiquitous, learn how institutions are adapting to detect AI-generated code and maintain educational standards.

Codequiry Editorial Team · Jan 5, 2026
Read More →

Latest Articles

Stay ahead with expert analysis and practical guides

Automating Code Plagiarism Detection in Your Grading Workflow General 8 min
Emily Watson · 4 days ago

Automating Code Plagiarism Detection in Your Grading Workflow

A practical walkthrough for CS instructors who want to wire code similarity checks directly into their grading workflow. Covers tooling choices, LMS integration, and how to layer in web-source and AI-generated code detection for a complete academic integrity pipeline.

What 4,200 Python Submissions Tell Us About Code Reuse General 7 min
Alex Petrov · 6 days ago

What 4,200 Python Submissions Tell Us About Code Reuse

By aggregating similarity scores across 4,200 student Python submissions over three semesters, we uncovered distinct copy-paste behaviors tied to assignment type, submission deadline, and language features. This practical guide walks through the exact process of running a large-scale code reuse audit using Codequiry’s output and Python data analysis, then shows how to turn those numbers into actionable course design decisions.

K-gram Fingerprinting for Source Code Similarity Analysis General 9 min
Emily Watson · 1 week ago

K-gram Fingerprinting for Source Code Similarity Analysis

K-gram fingerprinting is the backbone of modern code plagiarism detection. This step-by-step guide walks through tokenization, k-gram generation, hashing, winnowing, and comparison — the exact pipeline used by MOSS and Codequiry. Includes Python code examples, algorithmic tradeoffs, and real-world scaling numbers.

Automated Code Similarity Checks in a CI Lab Pipeline General 7 min
Alex Petrov · 1 week ago

Automated Code Similarity Checks in a CI Lab Pipeline

Setting up automated code plagiarism and similarity checks inside a CI pipeline cuts manual grading time and catches copying that individual reviewers miss. This practical guide walks through the architecture, tooling choices, and honest tradeoffs of running MOSS, JPlag, or Codequiry’s API on every lab push.

How One Bootcamp Built a Code Originality Pipeline General 9 min
Emily Watson · 1 week ago

How One Bootcamp Built a Code Originality Pipeline

When CareerDevs Academy scaled from 30 to 200 students per cohort, their manual code review process couldn't keep up with plagiarism and improper code reuse. Here's how they built a tiered originality pipeline combining static analysis, similarity detection, and educational intervention — and what other programs can learn from their approach.

How Automatic Grading Evolved From Scripts to Integrity Pipelines General 9 min
Alex Petrov · 1 month ago

How Automatic Grading Evolved From Scripts to Integrity Pipelines

A retrospective on automatic grading in computer science education—from shell scripts comparing output strings to modern platforms combining unit tests, static analysis, and code similarity detection. What we gained, what we lost, and why integrity pipelines matter more than ever.

Building a Source Code Provenance Pipeline for Contractor Deliverables General 10 min
Alex Petrov · 1 month ago

Building a Source Code Provenance Pipeline for Contractor Deliverables

When contractors deliver source code, verifying originality and license compliance is critical. This guide walks through building an automated provenance pipeline that checks for code similarity, license violations, and proper attribution before accepting deliverables into your codebase.

What Code Similarity Metrics Actually Measure in Student Work General 9 min
David Kim · 1 month ago

What Code Similarity Metrics Actually Measure in Student Work

Not all code similarity is plagiarism, and not all plagiarism is caught by string matching. This article breaks down the three major detection techniques—AST comparison, token-based analysis, and algorithmic fingerprinting—and explains what each one actually reveals about student submissions.

How to Build a Source Code Similarity Pipeline for Detection General 12 min
Alex Petrov · 1 month ago

How to Build a Source Code Similarity Pipeline for Detection

A step-by-step guide to building a source code similarity detection pipeline from scratch. Covers tokenization, AST comparison, Winnowing fingerprinting, and heuristic scoring. Includes working Python code and configuration strategies used by universities and enterprises.

What Pair Programming Looks Like in a Plagiarism Detector General 8 min
Marcus Rodriguez · 1 month ago

What Pair Programming Looks Like in a Plagiarism Detector

Pair programming and plagiarism can look identical to automated detectors. This article explains the technical signals that distinguish collaborative work from unauthorized code sharing, and how educators can design assignments and detection workflows that respect both academic integrity and modern development practices.

Teaching Students to Write Attribution Comments in Group Work General 10 min
David Kim · 1 month ago

Teaching Students to Write Attribution Comments in Group Work

Attribution comments are a simple but powerful tool for teaching code integrity in collaborative programming projects. This article explains how to implement them effectively, what to include, and how they transform group work from a plagiarism minefield into a learning opportunity.

From Paper Traces to Abstract Syntax Trees: Code Similarity Then and Now General 9 min
Rachel Foster · 1 month ago

From Paper Traces to Abstract Syntax Trees: Code Similarity Then and Now

The history of code similarity detection is a story of escalating arms races. What started with professors reading printouts has evolved through Unix diffs, token-based fingerprinting, and into modern abstract syntax tree analysis. This retrospective traces the key technical shifts that shaped how we detect code plagiarism in programming courses today.