Tools
RNAQUA
RNAQUA (RNA QUality Assessment) is a web service based wrapper of basic RNA comparison metrics. It is a RESTful web service client developed in Java. The tool provides a set of web services initially designed for RNAssess (Lukasiak et al., Nucleic Acids Research, 2015;43(W1):W502-W506. doi:10.1093/nar/gkv557) to support the quality assessment of RNA 3D structures. RNAQUA requires a stable release of JRE 7 (or later) installed on a user workstation with an internet access. Full implementation of the tool is stored on github and maintained by Antczak. The binaries and use cases can be downloaded from here. Most important features of RNAQUA:
- At the input, a user provides RNA 3D structure(s) in PDB format.
- Output data are returned in XML format.
- RNAQUA allows to compute the following measures:
- ClashScore,
- Root-mean-square deviation (RMSD),
- Interaction network fidelity (INF),
- Deformation index (DI),
- P-value (either for entire RNA 3D structure(s) or for a set of discontinuous 3D substructures).
- Two processing modes are available:
- An analysis of single RNA 3D structure (PDB validation, ClashScore, sequence-based analysis, structure unification),
- An analysis of RNA 3D model(s) with respect to the reference structure (RMSD, INF, DI, P-value, Deformation profile, sequence-based differences between structures, multiple models over the reference structure superposition).
- Optionally, the user can define alignment between the reference structure and all corresponding RNA 3D model(s) which is helpful if there are differences in sequence, distribution of chains or residue numbering.
This metric tool is maintained by Antczak.
MCQ and LCS-TA
MCQ (Mean of Circular Quantities) has been first presented in the paper “MCQ4Structures to compute similarity of molecule structures” by Zok et al. (Central European Journal of Operations Research, 2014;22(3):457-474. doi: 10.1007/s10100-013-0296-5).
LCS-TA (Longest Continuous Segments in Torsion Angle space) applies a measure first described in the paper “LCS-TA to identify similar fragments in RNA 3D structures” by Wiedemann et al. (BMC Bioinformatics, 2017;18(1):456. doi: 10.1186/s12859-017-1867-6). A full implementation of both methods can be found on github, maintained by Zok and Wiedemann.
Most important features of both methods / measures:
MCQ
- The method applies to a pair of 3D structures and is size independent.
- It can be used for a set of structures in all-against-all or all-against-target mode.
- It translates typical algebraic representation of a 3D structure into the trigonometric one (a set of torsion angles).
- It computes the distance between structures in torsion angle space.
- The distance is measured as mean of local distances between the corresponding angles, and provided in degrees.
- The measure is sequence independent.
LCS-TA
- The method applies to a pair of 3D structures and is size independent.
- It uses MCQ-based measure for structure comparison.
- Within the compared structures, it finds the longest continuous segments which display similarity in torsion angle space.
- Two segments are considered similar if their MCQ is below predefined threshold.
- The method provides segment length and its position in the structure.
- The length of the longest continuous segment is a measure of similarity of two structures.
- The method can be run in sequence dependent or sequence independent mode.
MCQ is maintained by Zok, while LCS-TA is maintained by Wiedemann
RNA-Puzzles Datasets
We provide all the native and predicted structures from RNA-Puzzles here as a dataset resource.
Currently, there is few gold standard datasets available in benchmarking RNA 3D structure prediction. RNA-Puzzles as a well-established blind assessment of RNA 3D structure prediction. Therefore, the structure data from RNA-Puzzles can provide as a good benchmark set for RNA 3D structure prediction. Besides, the predicted structures submitted to RNA-Puzzles, which are generated through various prediction methods that generally covers all the RNA 3D structure prediction methods, can be considered as perfect decoy sets in deriving energy functions for RNA structure folding.
We provide both raw predictions and normalized format of the RNA structures. Besides, we also normalized the structures with RNA-PDB-tools and provide the results here.
RNA-Puzzles Format Check
After the first RNA-Puzzles meeting (2016 Oct, Strasbourg), the RNA-Puzzles community have agreed to unify all predictions to the same PDB format.
To check if the submitted structures are in good format, a simple script based tool is provided:
RNA-Puzzles format
– a tool used to check if a predicted structure is in good format.
To format/normalize RNA structures, please use structure normalization tools.
RNA Structure Format Tools
RNA structures in RNA-Puzzles are deposited as PDB format. However, we need to consider the variations when participants generate their predictions, e.g. the nomencleture of the residue/atom names, and the variations in the format of deposited native structure. Sometimes, the sequence used in prediction may slightly deviate from the solved crystal structure: in such a case, we need to compare only the common part of the structure.
Considering these aspects, we provide some tools here to normalize the PDB format of RNA structure:
PDB normalizer
– this tool is the PDB normalization tool used by RNA-Puzzles, and is part of the basic assessment metrics tool. (Maintained by Chichau)
RNA PDB tools
– Alternatively, you may use a more functional tool RNA-PDB-tools, which is mained by Marcin Magnus.
Deformation Profile
Deformation profile is a metric first discribed in the paper “New metrics for comparing and assessing discrepancies between RNA 3D structures and models.” (RNA. 2009 Oct;15(10):1875-85. doi: 10.1261/rna.1700409. )
A full implementation can be found at github, maintained by Chichau
RNA-Puzzles Assessment consider all the “Basic Assessment Metrics”, Deformation profile and “Mean of Circular Quantities”.
Basic Assessment Metrics
The basic metrics of RNA 3D structure comparison include:
- Root Mean Square Deviation (RMSD)
- P value
- Deformation Index
- Interaction Network fidelity
- this metric can be measured by different types of contacts, including:
- all contacts
- Watson-Crick interaction
- non-Waston-Crick interaction
- stacking
All of these metrics have been implemented in Basic Assess Metrics (author: Chichau)