Querying the Service

The BTS service can be queried using REST API or GraphQL. The REST API is the primary way to interact with the service and is highly optimised. While the GraphQL API provides a more flexible way to query the data, it may be slower in performance.

Using the REST API

The REST API is designed to support Cafe Variome V3 functionalities, including the frontend auto-completion and backend searching functions. The implemented features are:

Auto Completion

Auto-completion refers to the feature where the user types a few characters, and the system suggests a list of terms that match the input. Specifically:

The search string needs to be at least 3 characters in length. This is to prevent returning a massive list of terms.
The search is case-insensitive. Furthermore, all the quotation marks will be stripped off.
White spaces are considered separators. The search string will be split into words, and each word will be searched on separately. However any other characters will be matched as-is, including but not limited to hyphens, underscores, dots, etc.

The results will be ordered by the relevance of the term to the search string. The relevance is calculated based on the length of the resulting term, and the position of the search string in the term. For example:

For the search string "pheno" on HPO, the top 5 results will be:

Koebner Phenomenon
Phenotypic abnormality
Phenotypic variability
Sphenoid wing dysplasia
Biphenotypic acute leukemia

The strings are ordered first by the length of the term, with the shortest term appearing first. This is based on the assumption that shorter terms are more likely to match a larger portion of the search string. If two terms have the same length, the position of the search string within the term is considered. Terms where the search string appears earlier are ranked higher in relevance.

In this system, not only the label but also other parts of a term are searched. The priority is as follows:

Term ID (without prefix - This is important because we do not store the prefix, so searching on the prefix will give you nothing)
Term Label
Synonyms
Definitions

Everything else is ignored.

Ontology Tree Expansion

The ontology tree expansion search refers to the feature where the user provides term IDs, and the system returns the terms that are children of the provided terms. Naturally this only works on ontological datasets. Even if the other datasets have some type of hierarchy (pathway → reaction → product), they cannot be used in the expansion feature.

Semantic Similarity Search

The semantic similarity search is a feature where the user provides a list of terms along with one or more similarity thresholds, and the system returns terms that are similar to the provided ones. Similarity is calculated based on the ontology tree structure using the relevance method.

Term Translation

Translation search is a feature where the user provides a list of terms, one or more similarity thresholds, and a list of constraint terms. The service then returns the terms from the constraint list that exceed the given threshold. This feature is used to translate a match set to another on services that do not support fuzzy matching or similarity queries, such as a BEACON system.

Term Set Mapping

Term set mapping is a feature where the user provides a list of terms and a target set, and the system maps the original terms onto the annotated or mapped terms on the target set. For now, only direct annotation is supported (for example, HPO and ORDO have annotations, and ORDO and SNOMED have annotations. However, the two cannot be used together to map HPO to SNOMED).

Using the GraphQL API

A GraphQL endpoint is available at https://similarity.cafevariome.org/graphql, allowing for customisable queries to the service. The GraphQL API supports all the functionalities of the REST API, along with additional features such as multi-link traversal, child-parent reverse traversal, and more complex queries. However, due to the nature of GraphQL, the query may not be as optimised as the REST API, potentially resulting in slower performance. Additionally, the response is typically larger than that of the REST API. The schema is provided below:

enum TermType { ctv3 hpo ordo snomed hgnc omim ncit reactome geneSymbol } interface BioMedicalTerm { termId: String! } interface GeneTerm implements BioMedicalTerm { termId: String! label: String! symbol: [GeneSymbol]! } interface Pathway implements BioMedicalTerm { termId: String! label: String reactions: [Reaction] } interface Reaction implements BioMedicalTerm { termId: String! label: String pathways: [Pathway] } interface OntologyTerm implements BioMedicalTerm { termId: String! label: String children: [OntologyTerm] parents: [OntologyTerm] } type Query { _empty: String ctv3(termId: String!): Ctv3Term ensemblGene(termId: String!): EnsemblGene ensemblTranscript(termId: String!): EnsemblTranscript ensemblExon(termId: String!): EnsemblExon ensemblProtein(termId: String!): EnsemblProtein gene(termId: String!): GeneSymbol hgnc(termId: String!): HgncTerm hpo(termId: String!): HpoTerm ncit(termId: String!): NcitTerm omim(termId: String!): OmimTerm ordo(termId: String!): OrdoTerm reactomePathway(termId: String!): ReactomePathway reactomeReaction(termId: String!): ReactomeReaction snomed(termId: String!): SnomedTerm } type Ctv3Term implements BioMedicalTerm & OntologyTerm { termId: String! label: String deprecated: Boolean children: [Ctv3Term] parents: [Ctv3Term] replaces: [Ctv3Term] replacing: [Ctv3Term] annotatedSnomed: [SnomedTerm] } type EnsemblGene implements BioMedicalTerm & GeneTerm { termId: String! label: String! seqName: String! start: Int! end: Int! symbol: [GeneSymbol]! transcripts: [EnsemblTranscript] } type EnsemblTranscript implements BioMedicalTerm { termId: String! label: String gene: EnsemblGene exons: [EnsemblExon] protein: EnsemblProtein } type EnsemblExon implements BioMedicalTerm { termId: String! label: String transcripts: [EnsemblTranscript] } type EnsemblProtein implements BioMedicalTerm { termId: String! label: String transcripts: EnsemblTranscript } type GeneSymbol implements BioMedicalTerm { termId: String! hgnc: [HgncTerm] ensembl: [EnsemblGene] annotatedHpo: [HpoTerm] annotatedNcit: [NcitTerm] annotatedOmim: [OmimTerm] annotatedOrdo: [OrdoTerm] annotatedReactomeReaction: [ReactomeReaction] } type HgncTerm implements BioMedicalTerm & GeneTerm { termId: String! label: String! definition: String! location: String! synonyms: [String] deprecated: Boolean symbol: [GeneSymbol]! } type HpoTerm implements BioMedicalTerm & OntologyTerm { termId: String! label: String definition: String comment: String deprecated: Boolean children: [HpoTerm] parents: [HpoTerm] replaces: [HpoTerm] replacing: [HpoTerm] similar(similarity: Float): [HpoTerm] annotatedOrdo: [OrdoTerm] annotatedGene: [GeneSymbol] } type NcitTerm implements BioMedicalTerm & OntologyTerm { termId: String! label: String! definition: String deprecated: Boolean synonyms: [String] children: [NcitTerm] parents: [NcitTerm] annotatedGene: [GeneSymbol] } type OmimTerm implements BioMedicalTerm & OntologyTerm { termId: String! label: String deprecated: Boolean children: [OmimTerm] parents: [OmimTerm] replaces: [OmimTerm] replacing: [OmimTerm] annotatedGene: [GeneSymbol] annotatedOrdo: [OrdoTerm] } type OrdoTerm implements BioMedicalTerm & OntologyTerm { termId: String! label: String definition: String deprecated: Boolean synonyms: [String] children: [OrdoTerm] parents: [OrdoTerm] replaces: [OrdoTerm] replacing: [OrdoTerm] annotatedGene: [GeneSymbol] annotatedHpo: [HpoTerm] annotatedOmim: [OmimTerm] annotatedSnomed: [SnomedTerm] } type ReactomePathway implements BioMedicalTerm & Pathway { termId: String! label: String! synonyms: [String] reactions: [ReactomeReaction] } type ReactomeReaction implements BioMedicalTerm & Reaction { termId: String! label: String! synonyms: [String] pathways: [ReactomePathway] annotatedGenes: [GeneSymbol] } type SnomedTerm implements BioMedicalTerm & OntologyTerm { termId: String! label: String definition: String synonyms: [String] fullyDefined: Boolean deprecated: Boolean children: [SnomedTerm] parents: [SnomedTerm] replaces: [SnomedTerm] replacing: [SnomedTerm] annotatedCtv3: [Ctv3Term] annotatedOrdo: [OrdoTerm] }

Last modified: 28 March 2025