Data Clinical Decompose Tested

Statistical Research Pipeline

Multi-skill Claude plugin for classical statistical research workflows: outcome-type detection, assumption checking, primary tests, effect sizes, extended modeling, SEM, manuscript drafting, and LaTeX assembly. 30 sub-skills covering 14 outcome / model families.

Source VeraSuperHub/stat-research-pipeline
GitHub stars ★ 0
License Open source
Bundled skills 30
Last updated May 4, 2026

This plugin is the classical-statistics half of the human–AI research lab. It decomposes the statistical-research execution layer — outcome-type detection, assumption checking, test selection, effect-size estimation, extended modeling, SEM, manuscript drafting — into thirty sub-skills covering fourteen outcome / model families. It is, to my knowledge, the most comprehensive open-source statistical-research pipeline currently published as installable Claude skills.

Why thirty sub-skills, not three

Real statistical research is not one workflow. The right test for a binary outcome is not the right test for survival data, which is not the right test for repeated measures, which is not the right test for a designed experiment, which is not the right test for a meta-analysis. Each outcome family has its own assumption checks, its own primary tests, its own effect-size conventions, its own extended-modeling battery.

Most “AI for statistics” tools collapse this distinction. They run regression on whatever you point at them. The result is a sentence in your manuscript that survives peer review until a thoughtful reviewer asks, “why did you use a Poisson model on your zero-inflated counts?” This plugin is structured against that failure mode: outcome type is detected first, the testing skill matched to that type is run, and the analyzing skill for that type extends the analysis with the model battery the field expects.

What’s covered

  • 11 outcome families for classical hypothesis testing: continuous, binary, ordinal, nominal, count, survival, repeated measures, time series, multivariate, designed experiments, meta-analysis
  • 3 SEM families for latent-variable modeling: confirmatory factor analysis (CFA), full structural equation models, growth and change models (LGCM, latent change scores)
  • 2 orchestration pipelines: application papers (data → manuscript) and methodology papers (idea → benchmark → manuscript)

Each outcome family has paired testing and analyzing sub-skills. Testing handles assumption checks + primary tests + effect sizes (steps 01–03 of a typical analysis). Analyzing continues from where testing stopped, running the extended model battery + interpretability + manuscript-section drafting (steps 04–08).

Who it’s for

  • Biostatisticians in clinical research, pharmacovigilance, or real-world evidence work — every outcome family in this plugin maps to standard regulatory analysis types
  • Clinical scientists and trial designers who run their own statistical analyses or need to validate a CRO’s
  • Quantitative psychologists, epidemiologists, public-health scientists working with longitudinal cohorts, repeated measures, latent-variable models
  • Computational biologists and behavior-research PhDs writing analysis-heavy methods sections
  • Career-switchers entering biostatistics from adjacent fields, who need an installable reference for what good practice looks like across the full inventory of methods

What this plugin will not do

The same boundary applies as the AI plugin: it structures execution, not judgment. It cannot decide which assumption violations matter for your specific scientific context, whether a marginal effect is substantively meaningful, whether your sample size justifies the model you’re fitting, or whether the analysis you’ve completed is the analysis the journal will accept. Those are judgment calls. They stay yours.

The plugin will, however, surface the diagnostic that should make you pause — a Box’s M test that fails, a Hosmer-Lemeshow with a low p-value, a failed proportional-hazards check. It puts the question in front of you. You decide what to do.

How to install

Clone the repo from VeraSuperHub/stat-research-pipeline and copy the vera-stat-research-skillset/ directory contents into your ~/.claude/skills/ folder. Or use the bundled vera-stat-research.plugin file if you’re managing skills through a plugin manager.

Pair with the AI Research Pipeline

This plugin and the AI Research Pipeline are designed to be installed together. Same interface conventions, same manuscript-output formats, complementary methodological territory. The classical-statistics plugin is the right tool when you have a hypothesis test or estimation problem; the AI plugin is the right tool when you have a prediction or modeling problem. Most papers in the life sciences end up using both — and having both available as installable plugins means the workflow doesn’t context-switch in the middle of a project.

Bundled sub-skills (30)

Testing — diagnostics + primary tests

vera-stat-continuous-testing

Continuous outcomes — Welch's t, ANOVA, Mann-Whitney U, Kruskal-Wallis. Effect sizes: Cohen's d, eta-squared.

vera-stat-binary-testing

Binary outcomes — Chi-square, Fisher's exact. Effect sizes: Cramer's V, odds ratio.

vera-stat-ordinal-testing

Ordinal outcomes — Mann-Whitney U, Kruskal-Wallis, Jonckheere-Terpstra. Effect sizes: rank-biserial r, Cliff's delta, epsilon-squared.

vera-stat-nominal-testing

Nominal outcomes — Chi-square, Fisher's exact, one-way ANOVA. Effect sizes: Cramer's V, eta-squared.

vera-stat-count-testing

Count outcomes — Poisson / NB rate test, overdispersion check, zero-inflation detection. Effect size: IRR with 95% CI.

vera-stat-survival-testing

Survival outcomes — Kaplan-Meier, log-rank, univariate Cox. Effect size: HR with 95% CI.

vera-stat-repeated-testing

Repeated measures — paired t, RM-ANOVA, mixed ANOVA, Friedman. Effect sizes: Cohen's d (paired), partial eta-squared, ICC.

vera-stat-timeseries-testing

Time-series outcomes — ADF, KPSS, Ljung-Box, ACF/PACF, auto-ARIMA. Performance: RMSE, MAE, MAPE.

vera-stat-multivariate-testing

Multivariate outcomes — MANOVA (all 4 statistics), Hotelling's T-squared, Box's M. Effect size: partial eta-squared per DV.

vera-stat-doe-testing

Designed experiments — factorial ANOVA, interaction tests, Levene's, Shapiro-Wilk on residuals. Effect size: partial eta-squared.

vera-stat-meta-testing

Meta-analysis — fixed-effects, random-effects, heterogeneity (Q, I-squared, tau-squared). Effect size: pooled ES with prediction interval.

vera-sem-cfa-testing

Confirmatory factor analysis — measurement model, factor loadings, identification checks. Fit indices: chi-square, CFI, TLI, RMSEA, SRMR.

vera-sem-longchange-testing

Growth and change models — latent growth curves, latent change score models. Fit indices: chi-square, CFI, TLI, RMSEA, SRMR.

vera-sem-full-testing

Full structural equation models — structural paths, mediation labels, R-squared for endogenous variables. Fit indices: chi-square, CFI, TLI, RMSEA, SRMR.

Analyzing — full analysis + manuscript sections

vera-stat-continuous-analyzing

OLS with diagnostics, quantile regression (25th/50th/75th), subgroup forest plots. Tree-based: CART, RF (500 trees), LightGBM.

vera-stat-binary-analyzing

Logistic regression (OR + 95% CI), Hosmer-Lemeshow, pseudo-R-squared (McFadden + Nagelkerke), ROC/AUC. Tree-based: CART, RF, LightGBM.

vera-stat-ordinal-analyzing

Proportional odds, adjacent-category, continuation-ratio, stereotype, multinomial logistic (dual-path). Tree-based: CART, RF, LightGBM.

vera-stat-nominal-analyzing

Multinomial logistic (RRR), LDA (leave-one-out), confusion matrices. Tree-based: CART, RF, LightGBM.

vera-stat-count-analyzing

Poisson, NB, ZIP, ZINB, Hurdle, Vuong test, AIC comparison. Tree-based: CART, RF, LightGBM.

vera-stat-survival-analyzing

Cox PH (Schoenfeld diagnostics), time-varying Cox, AFT (Weibull / lognormal / loglogistic), recurring events (AG, PWP, Frailty). Tree-based: RSF (500 trees), gradient boosting survival.

vera-stat-repeated-analyzing

Linear mixed models (REML, random intercept + slope), GEE (exchangeable / AR1 / unstructured), growth curves. Tree-based: RF, LightGBM on subject-level features.

vera-stat-timeseries-analyzing

SARIMA, ETS / Holt-Winters, GARCH, VAR, spectral analysis, regression with ARIMA errors. Tree-based: RF, LightGBM on lagged features.

vera-stat-multivariate-analyzing

Discriminant analysis, MANCOVA, canonical correlation, PCA, multivariate regression, profile analysis. Tree-based: RF, LightGBM importance across DVs.

vera-stat-doe-analyzing

Response surface methodology (1st / 2nd order), contour and surface plots, canonical analysis, desirability functions. Tree-based: RF, LightGBM.

vera-stat-meta-analyzing

Egger's / Begg's / trim-and-fill, leave-one-out, meta-regression, Bayesian, three-level models.

vera-sem-cfa-analyzing

CFA reliability (omega, CR, AVE), convergent and discriminant validity (Fornell-Larcker, HTMT), measurement invariance.

vera-sem-full-analyzing

Alternative structural models, bootstrap indirect effects (5000 draws), multigroup SEM.

vera-sem-longchange-analyzing

Latent basis, quadratic growth, latent change score, parallel process, multigroup trajectories.

Pipelines — workflow orchestration

vera-stat-application-pipeline

Application paper orchestration: research question + dataset → outcome detection → literature review → parallel analysis → assembled Markdown + LaTeX manuscript draft.

vera-stat-methodology-pipeline

Methodology paper orchestration: research direction → idea discovery → simulation studies → proofs → external review → review-ready manuscript draft.