GitHub - Imbad0202/academic-research-skills: Academic Research Skills for Claude Code: research → write → review → revise → finalize

G github.com ↗

▲ 82 points • 25 comments • by arnon • 2w ago • HN discussion ↗

Pangram verdict · v3.3

We believe that this document is a mix of AI-generated, and human-written content

50 %

AI likelihood · overall

Mixed

64% human-written 36% AI-generated

SEGMENTS · HUMAN 0 of 6

SEGMENTS · AI 3 of 6

WORD COUNT 1,657

PEAK AI % 100% · §3

Analyzed

May 10

backend: pangram/v3.3

Segments scanned

6 windows

avg 276 words each

Distribution

64 / 36%

human / AI fraction

Verdict

Mixed

Pangram v3.3

Article text · 1,657 words · 6 segments analyzed

Human AI-generated

§1 Mixed · 42%

繁體中文版 A comprehensive suite of Claude Code skills for academic research, covering the full pipeline from research to publication. Install in 30 seconds (Claude Code CLI / VS Code / JetBrains, v3.7.0+): /plugin marketplace add Imbad0202/academic-research-skills /plugin install academic-research-skills Then try /ars-plan to walk through your paper structure via Socratic dialogue, or jump to Quick install for prerequisites and the traditional symlink flow. AI is your copilot, not the pilot. This tool won't write your paper for you. It handles the grunt work — hunting down references, formatting citations, verifying data, checking logical consistency — so you can focus on the parts that actually require your brain: defining the question, choosing the method, interpreting what the data means, and writing the sentence after "I argue that." Unlike a humanizer, this tool doesn't help you hide the fact that you used AI. It helps you write better. Style Calibration learns your voice from past work. Writing Quality Check catches the patterns that make prose feel machine-generated. The goal is quality, not cheating. Why human-in-the-loop, not full automation? Lu et al. (2026, Nature 651:914-919) built The AI Scientist — the first fully autonomous AI research system to publish a paper through blind peer review at a top-tier ML venue (ICLR 2025 workshop, score 6.33/10 vs workshop average 4.87). Their Limitations section enumerates the failure modes that any fully-autonomous AI research pipeline inherits: implementation bugs, hallucinated results, shortcut reliance, bug-as-insight reframing, methodology fabrication, frame-lock, citation hallucinations. ARS is built on the premise that a human researcher augmented by AI avoids these failure modes better than either alone. Stage 2.5 and Stage 4.5 integrity gates run a 7-mode blocking checklist (see academic-pipeline/references/ai_research_failure_modes.md); the reviewer offers an opt-in calibration mode that measures its own FNR/FPR against a user-supplied gold set.

§2 Mixed · 39%

v3.3 was inspired by PaperOrchestra (Song, Song, Pfister & Yoon, 2026, Google): Semantic Scholar API verification, anti-leakage protocol, VLM figure verification, and score trajectory tracking. Architecture & pipeline 👉 docs/ARCHITECTURE.md — the full pipeline view: flow diagram, stage-by-stage matrix, data-access flow, skill dependency graph, quality gates, and mode list. The architecture doc supersedes the sprawling pipeline description that used to live here. Everything about what runs in which stage now lives in one place. Quick install Prerequisites Claude Code (latest; plugin packaging requires recent versions) ANTHROPIC_API_KEY exported, or set on first claude run Optional: Pandoc for DOCX, tectonic + Source Han Serif TC for APA 7.0 PDF (Markdown output works without either) Plugin install (v3.7.0+, recommended): /plugin marketplace add Imbad0202/academic-research-skills /plugin install academic-research-skills Verify it works: run /ars-plan and describe a paper you're working on — ARS will start a Socratic dialogue to map out chapter structure. For a single-shot test instead, try /ars-lit-review "your topic". 👉 docs/SETUP.md — full guide: install Claude Code, set up API keys, optional Pandoc/tectonic for DOCX/PDF, cross-model verification (ARS_CROSS_MODEL), and five installation methods (Plugin, project skills, global skills, claude.ai Project, repo-cloned). Performance & cost 👉 docs/PERFORMANCE.md — per-mode token budgets, full-pipeline estimate (~$4–6 for a 15k-word paper), and recommended Claude Code settings (Skip Permissions; Agent Team optional). Guides & articles Academic Writing Shouldn't Be a Solo Act — full pipeline walkthrough (English) 學術寫作不該是一個人的事：一套開源 AI 協作工具如何改變研究者的工作流 — 完整使用指南（繁體中文） Features at a glance Deep Research — 13-agent research team with Socratic guided mode, PRISMA systematic review, intent detection, dialogue health monitoring, optional cross-model DA, Semantic Scholar API verification.

§3 AI · 100%

Academic Paper — 12-agent paper writing with Style Calibration, Writing Quality Check, LaTeX hardening, visualization, revision coaching, citation conversion, anti-leakage protocol, and VLM figure verification. Academic Paper Reviewer — 7-agent multi-perspective peer review with 0–100 quality rubrics (EIC + 3 dynamic reviewers + Devil's Advocate), concession threshold protocol, attack intensity preservation, optional cross-model DA critique / calibration, R&R traceability matrix, read-only constraint. Academic Pipeline — 10-stage pipeline orchestrator with adaptive checkpoints, claim verification, Material Passport, optional repro_lock, optional cross-model integrity verification, mid-conversation reinforcement, and score trajectory tracking. Data Access Level Metadata (v3.3.2+) — every skill declares data_access_level (raw / redacted / verified_only); enforced by scripts/check_data_access_level.py. Pattern adapted from Anthropic's automated-w2s-researcher (2026). See shared/ground_truth_isolation_pattern.md. Task Type Annotation (v3.3.2+) — every skill declares task_type (open-ended or outcome-gradable). All current ARS skills are open-ended. Benchmark Report Schema (v3.3.5+) — JSON Schema + lint for honest benchmark comparisons. See shared/benchmark_report_pattern.md. Artifact Reproducibility Lockfile (v3.3.5+) — optional repro_lock sub-block on Material Passport. Configuration documentation, not replay guarantee — LLM outputs are not byte-reproducible. See shared/artifact_reproducibility_pattern.md. Showcase: real pipeline output See the complete artifacts from a real 10-stage pipeline run — peer review reports, integrity verification reports, and the final paper: Browse all pipeline artifacts → Artifact Description Final Paper (EN) APA 7.0 formatted, LaTeX-compiled Final Paper (ZH) Chinese version, APA 7.0 Integrity Report — Pre-Review Stage 2.5: caught 15 fabricated refs + 3 statistical errors Integrity Report — Final Stage 4.5: zero regressions confirmed Peer Review Round 1 EIC

§4 AI · 100%

+ 3 Reviewers + Devil's Advocate Re-Review Verification after revisions Peer Review Round 2 Follow-up review Response to Reviewers Point-by-point author response Post-Publication Audit Report Independent full-reference audit: found 21/68 issues missed by 3 rounds of integrity checks Companion: Experiment Agent If your research involves running experiments (code or human studies) before writing, the Experiment Agent skill fills the gap between ARS Stage 1 (RESEARCH) and Stage 2 (WRITE). ARS Stage 1 RESEARCH → RQ Brief + Methodology Blueprint ↓ experiment-agent → run/manage experiments → validate results ↓ ARS Stage 2 WRITE → write paper with verified experiment results What it does: executes code experiments (Python, R, etc.) with real-time monitoring, manages human study protocols with IRB ethics checklist, interprets statistics with 11-type fallacy detection, and verifies reproducibility. How to use together: pause the ARS pipeline after Stage 1, run experiments in a separate experiment-agent session, then bring the results (with Material Passport) back to ARS Stage 2. ARS requires zero modification. See the experiment-agent README for setup instructions. Usage Quick Start # Start a full research pipeline You: "I want to write a research paper on AI's impact on higher education QA" # Start with Socratic guidance You: "Guide my research on AI in educational evaluation" # Write a paper with guided planning You: "Guide me through writing a paper on demographic decline" # Review an existing paper You: "Review this paper" (then provide the paper) # Check pipeline status You: "status" Individual Skills Deep Research (7 modes) "Research the impact of AI on higher education" → full mode "Give me

§5 Mixed · 44%

a quick brief on X" → quick mode "Do a systematic review on X with PRISMA" → systematic-review mode "Guide my research on X" → socratic mode (guided) "Fact-check these claims" → fact-check mode "Do a literature review on X" → lit-review mode "Review this paper's research quality" → review mode Academic Paper (10 modes) "Write a paper on X" → full mode "Guide me through writing a paper" → plan mode (guided) "Build a paper outline" → outline-only mode "I have a draft, here are reviewer comments" → revision mode "Parse these reviewer comments into a roadmap" → revision-coach mode "Write an abstract for this paper" → abstract-only mode "Turn this into a literature review paper" → lit-review mode "Convert to LaTeX" / "Convert citations to IEEE" → format-convert mode "Check citations" → citation-check mode "Generate an AI disclosure statement for NeurIPS" → disclosure mode Academic Paper Reviewer (6 modes) "Review this paper" → full mode (EIC + R1/R2/R3 + Devil's Advocate) "Quick assessment of this paper" → quick mode "Guide me to improve this paper" → guided mode "Check the methodology" → methodology-focus mode "Verify the revisions" → re-review mode "Calibrate this reviewer against my gold set" → calibration mode Academic Pipeline (Orchestrator) "I want to write a complete research paper" → full pipeline from Stage 1 "I already have a paper, review it" → mid-entry at Stage 2.5 (integrity first) "I received reviewer comments" → mid-entry at Stage 4 Pipeline ends with Stage 6: Process Summary — auto-generates a paper creation process record with 6-dimension Collaboration Quality Evaluation (1–100 scoring). Supported Languages Traditional Chinese (繁體中文) — default when user writes in Chinese English — default when user writes in English Bilingual abstracts (Chinese + English) for academic papers Using a different language? Socratic mode (deep-research) and Plan mode (academic-paper) use intent-based activation — they detect the meaning of your request, not specific keywords. This means they work in any language without modification.

§6 AI · 100%

However, the general Trigger Keywords section (which determines whether the skill is activated at all) still lists English and Traditional Chinese keywords. If you find the skill isn't activating reliably in your language, you can add your language's keywords to the ### Trigger Keywords section in each SKILL.md file to improve matching confidence. Supported Citation Formats APA 7.0 (default, including Chinese citation rules) Chicago (Notes & Author-Date) MLA IEEE Vancouver Supported Paper Structures IMRaD (empirical research) Thematic Literature Review Theoretical Analysis Case Study Policy Brief Conference Paper Skill Details Per-agent responsibilities and per-stage artifacts now live in docs/ARCHITECTURE.md. Version numbers are anchored here so release metadata stays in one place. Deep Research (v2.8) 13-agent research team. Modes: full, quick, review, lit-review, fact-check, socratic, systematic-review. Full agent roster and artifacts: see ARCHITECTURE.md §3. Academic Paper (v3.0) 12-agent paper writing pipeline. Modes: full, plan, outline-only, revision, revision-coach, abstract-only, lit-review, format-convert, citation-check, disclosure. Output: MD + DOCX (via Pandoc when available) + LaTeX (APA 7.0 apa7 class / IEEE / Chicago) → PDF via tectonic. Full agent roster and per-phase responsibilities: see ARCHITECTURE.md §3. Academic Paper Reviewer (v1.8) 7-agent multi-perspective review with 0-100 quality rubrics. Modes: full, re-review, quick, methodology-focus, guided, calibration. Decision mapping: ≥80 Accept, 65-79 Minor Revision, 50-64 Major Revision, <50 Reject. First-round review team vs. narrow re-review team boundary: see ARCHITECTURE.md §3 Stage 3 / Stage 3'. Academic Pipeline (v3.7) 10-stage orchestrator with integrity verification, two-stage review, Socratic coaching, and collaboration evaluation. Pipeline guarantees: every stage requires user confirmation checkpoint; integrity verification (Stage 2.5 + 4.5) cannot be skipped; R&R Traceability Matrix (Schema 11) independently verifies author revision claims.