Build Psychometric Scales
From Construct to Pilot
18 development phases. 17 AI actions. Deterministic EFA/CFA fit verifier. Built on COSMIN, DeVellis 8-step, and AERA/APA Standards.
AI actions — largest surface across RSMinds
workflow phases — construct to pilot-ready
reporting families integrated · COSMIN + AERA/APA
starts here · 7-day money back
How ScaleMinds works
The 18-phase workflow
Six phases — Conceptualization, Theory & Blueprint, Item Development, Scoring & Validation, Pilot Package, Output. Each step feeds the next: your construct shapes domains; domains shape items; items shape the CFA model.
A · Conceptualization
Idea & Construct Definition
Inputs
AI action
Output
Best-fit scale type (Likert, VAS, semantic differential…) + construct boundary draft.
A · Conceptualization
Literature & Gap Analysis
Inputs
AI action
Output
Existing instruments mapped, adaptation pathway flagged, justification for new scale.
A · Conceptualization
Operational Definition
Inputs
AI action
Output
Measurable, behaviour-anchored definition suitable for item generation.
A · Conceptualization
Domain & Subdomain Mapping
Inputs
AI action
Output
Dimensions, subdomains, and reflective vs formative model decision.
A · Conceptualization
Research Question & FINER
Inputs
AI action
Output
Question drafts + FINER scoring (Feasibility, Interest, Novelty, Ethics, Relevance).
B · Theory & Blueprint
Theory & Conceptual Framework
Inputs
AI action
Output
Theory candidates with citations + framework diagram tying construct to domains.
B · Theory & Blueprint
Blueprint & Specification Table
Inputs
AI action
Output
Items-per-domain target, coverage balance, positive/reverse split spec.
C · Item Development
Item Generation (Overinclusive Pool)
Inputs
AI action
Output
Overinclusive item pool — deductive + inductive items, keying tagged, reading level checked.
C · Item Development
Item Quality Guardian (15-pt Audit)
Inputs
AI action
Output
Per-item flags: double-barrelled, negation, jargon, social desirability, construct drift.
C · Item Development
Item Refinement
Inputs
AI action
Output
Cleaned item set with rewrites, traceable to original phrasing for audit.
C · Item Development
Response Format & Anchors
Inputs
AI action
Output
Recommended scale points (5/7/VAS) + anchor wording with cross-cultural notes.
D · Scoring & Validation
Scoring Model
Inputs
AI action
Output
Total + subscale scoring rules, reverse-key handling, missing-data policy.
D · Scoring & Validation
Expert Panel (CVI / CVR)
Inputs
AI action
Output
Expert profile spec + S-CVI / I-CVI / CVR rating form ready to send.
D · Scoring & Validation
Cognitive Interview Protocol
Inputs
AI action
Output
Think-aloud + verbal-probe protocol surfacing comprehension failures before fielding.
E · Pilot Package
Pilot Study Design
Inputs
AI action
Output
Sample size for EFA/CFA, recruitment plan, data-collection protocol.
E · Pilot Package
Instrument Assembly
Inputs
AI action
Output
Formatted draft instrument — instructions, items, response options, scoring key.
E · Pilot Package
Planned Analysis Protocol
Inputs
AI action
Output
Pre-specified item statistics, internal consistency, EFA/CFA decision tree with thresholds.
F · Output
Creation Synopsis
Inputs
AI action
Output
COSMIN-aligned development synopsis exportable as DOCX, PDF, or Markdown.
Scale-type coverage
14 scale types — all covered
From classic Likert and visual analogue scales to modern IRT and computer-adaptive testing. Each scale type gets its own item guidance, anchor logic, and analysis-plan defaults.
Rating
4 typesLikert (5 / 7 / 9 pt)
Classic ordered agreement scale.
Semantic Differential
Bipolar adjective anchors.
Numerical Rating
0–10 magnitude judgements.
Stapel Scale
Unipolar single-adjective rating.
Continuous
3 typesVisual Analogue (VAS)
100mm line, fine-grained.
Graphic Rating
Illustrated anchors for clarity.
Slider Scale
Digital VAS with snap option.
Comparative
3 typesThurstone Equal-Appearing
Pre-scaled judge-rated items.
Guttman Cumulative
Hierarchical, unidimensional.
Paired Comparison
Forced choice between options.
Modern Test Theory
4 typesRasch (1-PL)
Item difficulty, person ability.
IRT (2-PL / 3-PL)
Discrimination + guessing parameters.
Multi-dimensional IRT
Several latent traits jointly.
Computer-Adaptive (CAT)
Item-bank-driven testing.
Compliance
Built on the standards
your reviewers expect
Your instrument is scored against every applicable standard in real time.
Why this matters
Factor structure,
but with a second opinion
Other tools
Suggest a factor model. That's it.
If the AI hallucinates — wrong number of factors, mis-loaded items, or a CFA model that won't actually converge — you don't know until your psychometrician opens the lavaan output and the χ² is on fire.
ScaleMinds
AI proposes. Verifier checks the fit.
A deterministic verifier re-computes CFI, TLI, RMSEA, and SRMR from the AI's proposed factor model and checks each against the accepted thresholds. If any index flunks, we flag it before you commit pilot data.
- Deterministic, not stochastic — same model, same indices.
- Hu & Bentler 1999 thresholds applied per index.
- Disagreement flagged with respecification suggestions.
| Fit index | AI estimate | Verifier | Threshold | Verdict |
|---|---|---|---|---|
| CFI | 0.951 | 0.948 | ≥ 0.95 | Pass |
| TLI | 0.942 | 0.939 | ≥ 0.95 | Flag |
| RMSEA | 0.058 | 0.061 | ≤ 0.08 | Pass |
| SRMR | 0.046 | 0.044 | ≤ 0.08 | Pass |
Respecification suggestion
TLI = 0.942 is below the 0.95 threshold. Consider correlating residuals on items 7–9 (same domain, parallel wording) or revisiting cross-loadings before pilot launch.
Plans
Simple pricing
Access 1m
15,000 Mindful AI Tokens
- All 14 scale types
- 18-phase workflow
- COSMIN-aligned exports
Access 2m
15,000 Mindful AI Tokens / month
- Everything in 1m
- Priority AI throughput
- Save ₹99 vs monthly
Access 3m
15,000 Mindful AI Tokens / month
- Everything in 2m
- Quarterly project cadence
- Save ₹198 vs monthly
FAQ
Frequently asked questions
Common questions from scale developers, psychometricians, and PROM researchers.
What sample size do I need for EFA / CFA?
ScaleMinds applies established rules of thumb (10 participants per item, minimum 200 for EFA, 300+ for CFA) and adjusts based on factor structure complexity, expected loadings, and communalities. The deterministic verifier cross-checks the recommendation against MacCallum, Widaman, Zhang & Hong (1999) guidance for your specific design.
How many items should I write per domain?
The blueprint phase (step 7) generates 2–3x the final target — typically 8–15 items per domain for a 4-item final subscale. Overinclusion is intentional so the expert panel (CVI) and pilot EFA can trim weak items without leaving the domain under-represented.
Can I import existing items from another scale?
Yes. Paste the source instrument and ScaleMinds extracts items, response anchors, and scoring rules. The item quality audit (step 9) then re-evaluates them against COSMIN criteria so you can decide what to keep, adapt, or replace.
How do I handle translation and cross-cultural adaptation?
ITC Guidelines for Translating and Adapting Tests are integrated. The cognitive interview protocol (step 14) generates language-specific probes, and the analysis plan includes measurement invariance testing (configural / metric / scalar) for multi-language pilots.
What about test-retest reliability planning?
The pilot study design (step 15) includes test-retest sample size recommendations and the analysis protocol (step 17) pre-specifies ICC(2,1) for continuous scores and weighted kappa for ordinal scoring, with intervals tied to construct stability assumptions.
Will my expert panel data work with this?
Yes. Step 13 generates the rating form your experts complete (Lynn 1986 4-point CVI rubric by default). Paste back their ratings and the platform computes I-CVI per item, S-CVI/Ave, and S-CVI/UA so you can defend item retention decisions to reviewers.
Does ScaleMinds run the EFA / CFA itself?
No — analysis runs on real pilot data using R lavaan, psych, or your preferred software. ScaleMinds pre-specifies the model, fit thresholds (CFI ≥ 0.95, RMSEA ≤ 0.08, SRMR ≤ 0.08), and the deterministic verifier checks the proposed model is identifiable before you collect data.
Can I use this for a PROM (patient-reported outcome measure)?
Yes. FDA PRO Guidance 2009 and ISPOR Good Research Practices are integrated. The workflow covers conceptual model, content validity, recall period, and qualitative concept elicitation steps required for regulatory PRO submission.
How is this different from just asking ChatGPT?
COSMIN-anchored prompts, deterministic CFA fit verifier, expert panel form generation, item quality audit against a 15-point rubric, and parallel per-domain item-pool generation. Plus the synopsis is structured for journal publication, not chat replies.
Can I cancel anytime?
Yes — cancel from Settings. Access continues to the end of the paid period. Your instrument exports remain yours. 7-day money-back guarantee on every plan.
Build your scale in a weekend, not a semester.
1 free AI call. No credit card. Sign in with Google.