Quick Start๏ƒ

Get KAST running and complete your first analysis in 5 minutes.


Step 1: Launch KAST๏ƒ

Windows:๏ƒ

Option A - Desktop Shortcut (Recommended):

  • Click the โ€œK-talysticFlowโ€ shortcut on your desktop

Option B - Command Line:

conda activate ktalysticflow
python main.py

Linux/macOS:๏ƒ

Option A - Command Line (Recommended & Easiest):

kast

This command works from any terminal, any directory.

Option B - Desktop Menu:

  • Search for โ€œK-talysticFlowโ€ in your application menu

  • Click to launch

Option C - Manual:

conda activate ktalysticflow
cd /path/to/KAST
python main.py

Step 2: The Main Menu๏ƒ

Youโ€™ll see:

โ•”โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•—
โ•‘  Control Panel - KAST                 โ•‘
โ•šโ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•

  [1] Prepare and Split Data
  [2] Generate Fingerprints
  [3] Create and Train the Model
  [4] Evaluate the Model
  [5] Predict Activity of New Molecules

  [8] Advanced Options (Testing & Configuration)
  [9] About & How to Cite

  [W] Workspace Management (Switch/Create Projects)
  [0] Exit Program

Enter your choice number: 

Step 3: Prepare Your Data๏ƒ

Create two SMILES files in the data/ folder:

data/actives.smi:

CC(C)Cc1ccc(cc1)C(C)C(O)=O  ibuprofen
CN1C=NC2=C1C(=O)N(C(=O)N2C)C  caffeine

data/inactives.smi:

CCCCCCCCCCCCCCCC  hexadecane
CC(=O)OC1=CC=CC=C1C(=O)O  aspirin

Format: SMILES [space] optional_name


Step 4: Run the Pipeline (Sequential Steps)๏ƒ

Press [1] โ†’ Prepare and split data Press [2] โ†’ Generate molecular fingerprints
Press [3] โ†’ Create and train the model
Press [4] โ†’ Evaluate model performance
Press [5] โ†’ Make predictions on new molecules


Next Steps๏ƒ

  • Manage Workspaces: [W] (create separate projects for different targets)

  • Check environment: [8] โ†’ [1]

  • View results: Check the workspaces/<your_workspace>/ folder

  • Help: [9] for credits and documentation


Step 5: Predict New Molecules๏ƒ

Place your library in data/my_library.smi:

CCC1=CC=CC=C1  ethylbenzene
CC1=CC=CC=C1  toluene
CCc1ccccc1O  o-ethylphenol

In the menu, press [5] to open the Prediction Menu, then [1] to featurize new data:

[1] ๐Ÿ”ฎ Featurize New Molecules
Select file to featurize: my_library.smi
Processing 3 new molecules...

Then press [2] to predict (calculates K-Score):

[2] ๐ŸŽฏ Run Predictions
Select featurized dataset: my_library_featurized
Running predictions on 3 molecules...
Results saved to workspaces/<your_workspace>/<custom_filename>.csv

Step 6: Check Your Results๏ƒ

All outputs saved to your active workspace folder:

workspaces/<your_workspace>/
โ”œโ”€โ”€ prepared_data/                   # Contains invalid_smiles_report.txt
โ”œโ”€โ”€ 01_train_set.csv                 # Training data
โ”œโ”€โ”€ 01_test_set.csv                  # Test data
โ”œโ”€โ”€ 4_0_evaluation_report.txt        # Metrics (AUC, accuracy, etc)
โ”œโ”€โ”€ 4_1_cross_validation_results.txt # Cross-val scores
โ”œโ”€โ”€ <custom_filename>.csv  # Predicted molecules + K-scores (Default: 05_new_molecule_predictions.csv)
โ”œโ”€โ”€ 4_0_roc_curve.png               # ROC plot
โ”œโ”€โ”€ 4_4_learning_curve.png          # Model learning progression
โ””โ”€โ”€ logs/
    โ””โ”€โ”€ kast_YYYYMMDD.log           # Detailed log file

Open predictions CSV:

SMILES                              K-Score  Predicted_Class
CCC1=CC=CC=C1                       0.92     Active
CC1=CC=CC=C1                        0.45     Inactive
CCc1ccccc1O                         0.78     Active

๐ŸŽฏ Youโ€™re Done!๏ƒ

Youโ€™ve completed:

  • โœ… Data preparation

  • โœ… Featurization

  • โœ… Model training

  • โœ… Evaluation

  • โœ… Predictions