SaaS

LFS HIT-SCR AI-Hit Discovery Solution, Novel & Quickly Synthesizing Library

“LFS-HIT Screening” quickly and more precisely screens novel HIT candidates through AI-based numerous calculations in a premium library, with or without reference structure data. Plus, our technical cooperation has reduced the time from synthesis to customs clearance and receipt by half again. A proven platform in various therapeutic areas and protein families, including kinase, GPCR, and nuclear receptors. Discover novel scaffolds and HIT candidates from a library of up to 50 million premium compounds.

“LFS-HIT Screening” quickly and more precisely screens novel HIT candidates through AI-based numerous calculations in a premium library, with or without reference structure data. Plus, our technical cooperation has reduced the time from synthesis to customs clearance and receipt by half again. A proven platform in various therapeutic areas and protein families, including kinase, GPCR, and nuclear receptors. Discover novel scaffolds and HIT candidates from a library of up to 50 million premium compounds.
SCROLL DOWN

Solution

Are you planning to develop NME? Want to discover a novel scaffold HIT?

Start your planning with the premium library of up to 50 million capable of screening novel scaffolds for NME development. Strengthen your rights by securing material patents from the HIT discovery stage.

Want to find HITs that have a clear binding mode from the start?

STB offers HIT candidates with a clear binding mode as best-pose prediction between target and compound.

Do you have trouble with the long wait for the synthesis or customs clearance?

Save synthesis time/cost for verification with HIT candidates that can be synthesized quickly and reliably.

Workflow

LFS Library (Large-scale and Fast-synthesis DB)

ALFS Library (Large-scale and Fast-synthesis DB)

  • Start screening new HITs from extensive libraries of up to 50 million compounds

Input : Target protein info. Ref. drug info. (if available)

Deep Learning Screening

BDeep Learning Screening

  • After preliminary screening based on physicochemical properties, AI-3D docking of target-compound is conducted.
  • Next, various conformers and binding poses for the compounds are generated and then prioritized by CNN scoring. 

Input : 50M Cpds list & Target 3D structures

Output : Top ranked 500 Cpds & poses info.

False Positives Filtering

CFalse Positives Filtering

  • The best-pose is selected based on the binding energy of Cpds and false positives are filtered out through auto-MD simulation.
  • The Cpds are then scored based on various factors for each best-poses. 

Input : Poses of the top ranked 500 Cpds

Output : List of 200 Cpds with false positives filtered

Technical Performance

DHIT Selection

  • The final HIT candidates are re-ranked with our own scoring functions, providing a list of  ~200 top-ranked compounds that are ready-to-purchase without any synthesis steps.

Input : The screening score of 200 Cpds

Output : Priority of ~200 Cpds list

Application

  • Screening novel scaffolds for NME development based on physics informed deep learning
  • Discovering novel HIT compounds that can secure material patents from the early stage
  • Support HIT-to-Lead optimization by in silico derivatives design solution (STB’s Auto Lead-Gen)
References:
  • Ragoza, Matthew, et al. "Protein–ligand scoring with convolutional neural networks." Journal of chemical information and modeling 57.4 (2017): 942-957.
  • Syntekabio’s Hit to Preclinical Candidate on Cloud Computing, BioPharmaTrend, Vol4. 2023.
  • How Syntekabio’s Supercomputing Enables AI-driven Drug Discovery, BioPharmaTrend, Vol3. 2023

Auto-Hit Discovery by DeepMatcher®-Hit

The DeepMatcher® platform solves two key challenges in early-stage drug discovery: speed and precision.

In the new drug development market, the need for a new approach has emerged as drug discovery through the traditional methods has reached a saturation point. Accordingly, AI-based drug discovery platforms have been attempted. Syntekabio offers innovative services that prioritize convenience and cost-effectiveness in discovering new drug candidates.
Syntekabio’s DeepMatcher® platform resolved two key issues of the early-stage drug discovery: speed and precision. The hit discovery and lead optimization (lead-opt) process, which normally takes at least two, three years in a classical drug R&D setting, now takes only a few months with the help of AI technology.

Essential Key-challenges for Auto Hit Discovery

Flow diagram of Hit discovery by DeepMatcher® Compounds are prescreened through three steps before being docked in a pocket of the target. Once the optimal pose is selected, the MD simulation is finally performed in a total of five steps.

Essential Key-challenges for Auto Hit Discovery Essential Key-challenges for Auto Hit Discovery
  • 1Purchasable Chemical Library (1 Billion)
  • 2Physical-3D Docking (1 Million)
  • 33D-CNN Binding Mode Tuning (1k)
  • 4MD Simulation (1k) for Fine-tune

Purchasable Chemical Library

Syntekabio launched STB CLOUD in late 2022, a suite of cloud-based proprietary platforms, to meet the unmet needs of these cost-conscious innovators seeking reliable yet affordable new drug candidate discovery services. STB CLOUD screens 1 billion purchasable compounds based on unique featurization method for 3D information-based data and 1 million potential candidate compounds

Physical-3D Docking

One million chemicals selected from a library of 1 billion compounds was docked into target 3D structure using command line interface (CLI) based in-house structure manipulation package.

Figure2

Figure2. Chemicallibrary

Figure3

Figure3. Methods for centroidal atom vector-based fast protein-ligand docking. After docking, three atom-vector based rotations along the core ring. It rotates while moving 1000 times in each revolution and then, generates a protein-ligand interaction (PLI) score

3D-CNN for Binding Tuning

The diagram below shows the interaction between a protein and a ligand in the pocket as a grid box, a 3D-CNN based intermediate stage of repeated max pooling and 3D convolution, generating feature maps, and representing bonds. Here, the starting grid box (48) and kernel size (3*3*3) are used. The density function is based on the geometry of adjacent bonds of two atoms. Therefore, the sum of bonds is defined as binding affinity.

3D-CNN for Binding Tuning 3D-CNN for Binding Tuning
Figure5. Auto-MD simulation

Figure5. Auto-MD simulation

Auto-MD Simulation

Syntekabio's fully automated MD simulation algorithm performs molecular dynamics simulations (10 ns) in explicit water box for 1,000 protein-ligand complexes.

The diagram shows an example of running continuously from a dead step during 1,000 MD simulations. If one runs 1000 MD simulations at once, a large number of simulations may crash and die in certain cases. However, in the automated environment, it proceeds by saving the checkpoint and resume from the saved point.

Figure 6.  Comparison of AI-hit positive andnegative predictions with known Molecules.

Figure 6. Comparison of AI-hit positive andnegative
predictions with known Molecules.

Key-Residue Analysis

The known molecules and the predicted molecules with EGFR protein were compared to see if the functional (Hinge, DFG) and conformational keys (G-loop, M766 in c-Helix) were well matched. In the case of positive prediction, the functional and conformational keys were well matched, whereas in case of negative, there were few matched residues. (Red: strong binding, Blue: weak binding)

Hit validation

CLK2, also known as CDC-like kinase 2, is a protein kinase that plays a role in the regulation of pre-mRNA splicing. Presented 89 candidate compounds using Deepmatcher®-Hit as a result of structure analysis using tanimoto coefficient with KNN algorithm. a) It was confirmed that 89 compounds were divided into 7 groups. b) Purchasable compounds were biochemically confirmed their inhibition activities. Three compounds of 74 purchased candidates (yellow circles) inhibited enzyme activity of CLK2 more than 50%. c) The growth inhibitory effect of each cancer cell by treatment with STB-C070, STB-C070-D157 and reference compounds were analyzed using WST-1 assay. GI50 was calculated from the 8-point dose-response curves by triplicate test.

a) DeepMatcher® Result analysis

a) DeepMatcher® Result analysis

b) Biochemical assay result of AI-hits

b) Biochemical assay result of AI-hits

c) Cell-based assay result of AI-hits

c) Cell-based assay result of AI-hits