AI-native RNA pocket discovery · structural ranking
Candidate druggable pockets on RNA targets, from sequence
We identify candidate druggable cleft pockets on RNA targets from sequence — calibrated for small-molecule drug discovery. Predicted 3D structure, conformational ensemble and a ranked top-3 shortlist, with honest scope. EU-based, GDPR-compliant.
// What we built
Why ensemble ranking matters for RNA pocket discovery
The field has converged on the answer that standard cavity detection over-predicts on RNA — fpocket’s default parameters mistake the polar grooves of duplex RNA for binding pockets. The Weeks lab formalised this as an RNA-tuned wrapper, fpocketR (Veenbaas et al., PNAS 2025), and we use the same RNA-tuned parameters. Detection alone, however, is not where customer recovery happens.
On the same seven cleft-binder targets, the single-frame fpocketR-style detection (and ours — the two are empirically equivalent) leaves the rank-1 pocket at the experimental binding site in only 2 of 7 cases at near-recovery and 0 of 7 at strict. The v0.2 contribution is what we add on top: a five-frame ANM conformational ensemble, cross-frame pocket clustering, and a cluster ranker based on persistence × binding-residue stability. Both ranker features are RNA-applicable by construction.
Rank-1 recovery on 7 cleft-binder targets3 / 7 strict· 6 / 7 near — with our ensemble + geometric ranker0 / 7 strict· 2 / 7 near — with fpocketR-style single-frame detection alone
Locked benchmark, deterministic re-run. Per-target lift figures and the full comparison table (vanilla fpocket / fpocketR params / our params / ensemble + ranker) on the methodology page. strict@1 = rank-1 cluster overlaps the experimental binding site by ≥ 50% of residues; near@1 = ≥ 30%.
Read full methodology// Capabilities
What the platform actually does
One integrated workflow, end to end. Sequence in, ranked top-3 pocket shortlist out, with full per-cluster metadata and a customer-facing PDF report.
// How it works
From sequence to ranked shortlist
A single deterministic pipeline. Input a sequence (or a PDB upload); pick up a top-3 ranked shortlist of candidate pockets with full geometric metadata.
Predict and ensemble
AI structure prediction generates the 3D tertiary structure from sequence. A five-frame conformational ensemble is sampled around the prediction. MSA-driven prediction available where the pre-pilot screen indicates.
Detect and cluster
Cavities are detected on each frame and clustered across the ensemble at 4 Å. Persistent cavities — those that survive the conformational sampling — are kept; transient or single-frame artefacts are filtered out.
Rank and return
Persistent cavities are ranked by our RNA-applicable scoring function. You receive a top-3 shortlist with residue lists, geometric metadata, the ensemble PDB and a branded PDF report.
// Why work with RNAfold
Honest scope, transparent methodology, EU jurisdiction
The differentiators that matter once the science is right.
Pre-pilot screening with explicit scope
Cleft-shaped binding pockets are in scope. Groove binders, surface interfaces and large complex folds are flagged out-of-scope upfront via the pre-pilot screen. We tell you whether v0.2 should help on your specific target before you commit to a pilot — not after.
See pricing & screen →EU-based, GDPR-compliant
Incorporated in Malta. All compute and storage on EU infrastructure. No sequence data leaves European jurisdiction. Zero-retention option available on the Enterprise tier.
Privacy policy →Transparent methodology
Full pipeline, third-party attribution and licences, validation methodology — all on /methodology. Customer pilots get the same disclosure in the PDF report. We don’t hide what we integrate; we name it where it belongs.
Read methodology →Reproducible benchmark
Seven cleft-binder targets, locked methodology, deterministic re-runs. The numbers below are what the pipeline actually produces, including the one neither case we don’t recover. No survivorship; no cherry-picking.
See benchmark →// v0.2.0 benchmark
What the pipeline actually recovers
Seven cleft-binder targets with deposited co-crystal structures — six riboswitch families plus one group I intron. As-shipped configuration: single-sequence prediction by default; MSA mode where the pre-pilot screen indicates. Numbers are exact, deterministic, and reproducible.
2GDI
2GDI
TPP RF00059 · 78 nt
4GXY
4GXY
B12 RF00174 · 161 nt
2GIS
2GIS
SAM-I RF00162 · 94 nt
5C45
5C45
FMN RF00050 · 54 nt
3DIL
out-of-scope fold class
3DIL
Group I intron · 174 nt
2HOJ
2HOJ
TPP (thi-box) · 83 nt
4LVV
4LVV
THF RF01831 · 89 nt
Strict @1 = at least one cluster in the rank-1 position with ≥ 50 % binding-site residue overlap.Near @1 = ≥ 30 %. RMSD = backbone C3′ RMSD vs experimental chain. Top-cluster overlap is shown after the result.
3DIL (group I intron) is a known out-of-scope fold class for v0.2 — we surface it in the table rather than hide it. Reproducer + per-cell records described on the methodology page.
// Worked examples
See the pipeline output
Three live worked examples spanning the v0.2 outcome classes — strict@1 (2GDI, single-seq), strict@1 via the opt-in MSA path (4LVV) and near@1 with global RMSD honestly reported (5C45).
TPP riboswitch (2GDI)
Live worked example. The pipeline recovers the TPP binding-site cluster at rank 1 with 71% binding-site residue overlap. Top-3 with full per-cluster metadata + interactive 3D viewer.
THF riboswitch (4LVV)
Live worked example. The pre-pilot screen flags 4LVV’s diverse-tail homologs; MSA mode lifts rank-1 recovery from neither (19% overlap, single-seq) to strict (50% overlap, MSA). Same pipeline a Discovery-tier customer runs.
FMN riboswitch (5C45)
Live worked example. Smallest target in the benchmark (54 nt). Backbone RMSD 10 Å but the rank-1 cluster picks up 40% of the FMN binding-site residues — near@1, not strict. The case for reporting both global and local quality metrics.
// Roadmap
What's next
Public roadmap for v0.2.1 and v0.3, with honest status labels. We do not commit to dates; items move when the work is done.
- →Comparative pocket analysis across sequence variants
- →ChEMBL annotation overlay on detected pockets
- →Per-pocket sequence conservation analysis
- →Batch submission API for screening campaigns
- →Energy-aware ensemble via molecular dynamics (research)
- →Groove-binding pocket detection (research)
// Get started
Ready to assess your RNA target for cleft pockets?
We are onboarding pilot customers now. Send a sequence and we will run the pre-pilot screen and schedule a 30-minute walk-through of the output.
Pre-pilot screen · Pilot pricing · Malta-based · EU data residency