Seam YAML Transform Index


AlignLigands

Align ligands.

Example usage

type: AlignLigands
config:
  arg_string: "TODO"
  method_type: FLEX/TUG/DEFAULT
  input_field: "structure"
  output_field: "aligned_structure"

Configuration

  • label ? (Optional)

Usage

type: AlignLigands
input: ...
config:
  label: label

CreateMols

Create a collection of rdkit Mols from a collection of SMILES strings or dictionaries containing SMILES strings.

Example usage

type: CreateMols
config:
  mol_field: "mol"
  elements:
  - "CC(=O)C1=CC=C(C=C1)C(=O)O"
  - "Cn1cnc2c1c(=O)n(C)c(=O)n2C"
  - "CC(C)[C@H]1CCC(C)CC1O"

or

type: CreateMols
config:
  smiles_field: "smiles"
  mol_field: "mol"
  elements:
    - smiles: "CC(=O)C1=CC=C(C=C1)C(=O)O"
      name: "Aspirin"
    - smiles: "Cn1cnc2c1c(=O)n(C)c(=O)n2C"
      name: "Caffeine"

Configuration

  • mol_field string

  • smiles_field string

  • elements Array[?]

Usage

type: CreateMols
config:
  mol_field: "mol_field"
  smiles_field: "smiles_field"
  elements:
  - element
  - element
  - ...

CreateStructures

Create a collection of schrodinger Structure objects from a collection of SMILES strings or dictionaries containing SMILES strings.

Example usage

type: CreateStructures
config:
  structure_field: "structure"
  elements:
  - "CC(=O)C1=CC=C(C=C1)C(=O)O"
  - "Cn1cnc2c1c(=O)n(C)c(=O)n2C"
  - "CC(C)[C@H]1CCC(C)CC1O"

or

type: CreateStructures
config:
  smiles_field: "smiles"
  structure_field: "structure"
  elements:
    - smiles: "CC(=O)C1=CC=C(C=C1)C(=O)O"
      name: "Aspirin"
    - smiles: "Cn1cnc2c1c(=O)n(C)c(=O)n2C"
      name: "Caffeine"

Configuration

  • structure_field string

  • smiles_field string

  • elements Array[?]

Usage

type: CreateStructures
config:
  structure_field: "structure_field"
  smiles_field: "smiles_field"
  elements:
  - element
  - element
  - ...

EpikClassicProtonateLigands

Enumerate protonation states of ligands using Epik classic.

Example usage

type: EpikClassicProtonateLigands
config:
  arg_string: "TODO"
  epik_classic_infile: "epik_classic.in"
  input_field: "structure"
  output_field: "protonated_structure"

Configuration

  • label ? (Optional)

Usage

type: EpikClassicProtonateLigands
input: ...
config:
  label: label

EpikProtonateLigands

Enumerate protonation states of ligands using Epik.

Example usage

type: EpikProtonateLigands
config:
  arg_string: "TODO"
  mode: #TODO
  input_field: "structure"
  output_field: "protonated_structure"

Configuration

  • label ? (Optional)

Usage

type: EpikProtonateLigands
input: ...
config:
  label: label

FlexAlignLigands

Flex align ligands.

Example usage

type: FlexAlignLigands
config:
  arg_string: "TODO"
  input_field: "structure"
  output_field: "aligned_structure"

Configuration

  • label ? (Optional)

Usage

type: FlexAlignLigands
input: ...
config:
  label: label

GlideDock

Generate 3d docked poses from a collection of structures using Glide.

Example usage

type: GlideDock
config:
  structure_field: "structure"
  grid_file: "foo.grid"
  reference_ligand_file: "reference_ligand.maegz"
  glide_in_file: "settings.in"
  poses_per_lig: 1

Configuration

  • grid_file string

  • reference_ligand_file string

  • glide_in_file string

  • structure_field string

  • poses_per_lig int64

Usage

type: GlideDock
input: ...
config:
  grid_file: "grid_file"
  reference_ligand_file: "reference_ligand_file"
  glide_in_file: "glide_in_file"
  structure_field: "structure_field"
  poses_per_lig: poses_per_lig

LigFilter

Filter ligands based on a series of criteria

Example usage

type: LigFilter
config:
  ligfilter_file: ligfilter.txt
  input_field: "structure"

Configuration

  • label ? (Optional)

Usage

type: LigFilter
input: ...
config:
  label: label

LigPrep

Perform LigPrep on a collection of molecules.

LigPrep works by expanding tautomeric and ionization states, ring conformations, and stereoisomers consistent with the input information to fully capture the relevant states of the molecule in 3D.

Example usage

type: LigPrep
config:
  arg_string: "-nt -epik -s 32"

Configuration

  • arg_string string : (optional) The ligprep argstring to use. Defaults to "-nt -epik -s 32".

    LigPrep Args

    Ionization: -epik Use Epik Classic for ionization and tautomerization (Recommended, overrides -i). -epikx Use Epik for ionization and tautomerization (Recommended, overrides -i). -emb, -epik_metal_binding Run Epik with the metal_binding option so that states appropriate for interactions with metal ions in protein binding pockets are also generated. -i {0,1,2} Ionization treatment: 0 - do not neutralize or ionize, 1 - neutralize only, 2 - neutralize and ionize (Default: 1). Note that -epik option overrides -i and always implies ionization and tautomerization. -mg Ionizer will skip over structures containing more than this number of ionizable groups (Default: 10). -no_refine Disable the final refinement in EpikX. -ph Effective/target pH. (default = 7.0 for Ionizer/Epik Classic, 7.4 for Epik) -pht pH tolerance for generated structures. Tautomerization: -nt Disable tautomerizer (Default: enabled). -t # Run tautomerizer and set the maximum number of tautomers to generate to # (Default: 8). -tp # Run tautomerizer and set the minimum probability for retaining tautomers to # (Default: 0.01). Stereoisomers: -ac Do not respect existing chirality properties and do not respect chiralities from the input geometry. Generate stereoisomers for all chiral centers up to the number permitted (specified using the -s option). This is equivalent to "Generate all combinations" in the Ligand Preparation user interface. Default behavior is to respect only explicitly indicated chiralities. -g Respect chiralities from input geometry when generating stereoisomers. -ns Do not use the stereoizer. -s # Generate up to this many (#) stereoisomers per input structure. (Default: 32). -strict_v2000_stereo For SD V2000 input, generate enantiomers if the chiral flag is 0. Filtering: -ng Disable filtering out of exotic structures. -lp Pass -addprops to LigFilter. -lab Add an internal tracking label to the processed structures. -lab_filter Use tracking labels to filter derived structures (this option turns on -lab). Currently supported: ionizer:# - select at most # output structures for each ionizer input structure such that the formal charge on each ionizable groups is minimized. Force-field based geometry optimization: -bff {14,16} Force-field to be used for the final geometry optimization. Default: 14 (OPLS_2005). For S-OPLS specify 16. -bvac Set dielectric constant to 1.0 for the final force- field based geometry optimization. -cgx_noopt Skip final force-field based geometry optimization. -no_cm1a Skip CM1A charge assignment for S-OPLS. -L CGX fragment library path. Overrides default location in the appdata directory if specified. Meta options: (turn on a number of actual options) -unt : Only run stereoizer and cgx to untangle structures. -adjust : Adjust to a suitable state. -vary : Generate different states. -expand : Aggressively generate different states. -retain : Retain characteristics of the input structures (specifying 'i' also turns on retention of tautomers). specify one or more of: c chiralities i ionization and neutralization t tautomerization

  • input_field string : (optional) The field to use as input. Must be a Structure field. Defaults to structure.

  • output_field string : (optional) The field to save the ligprepped structure to. If it's the same as input_field, the input field will be replaced. Defaults to structure.

Usage

type: LigPrep
input: ...
config:
  arg_string: "arg_string"
  input_field: "input_field"
  output_field: "output_field"

MolToSmiles

Convert a collection of rdkit Mols to SMILES strings

Example usage

type: MolToSmiles
config:
  input_field: "mol"
  output_field: "smiles"

Configuration

  • input_field string

  • output_field string

Usage

type: MolToSmiles
input: ...
config:
  input_field: "input_field"
  output_field: "output_field"

MolToStructure

Convert a collection of rdkit Mols to schrodinger Structure objects

Example usage

type: MolToStructure
config:
  input_field: "mol"
  output_field: "structure"

Configuration

  • input_field string

  • output_field string

Usage

type: MolToStructure
input: ...
config:
  input_field: "input_field"
  output_field: "output_field"

PrettyPrintJson

Pretty-print Beam Rows as JSON strings.

The expand configuration option controls the formatting: - If expand is True, the JSON output is printed with indentation across multiple lines. - If expand is False (default), the JSON output is compact and printed on a single line.

Example usage

type: PrettyPrintJson
config:
  expand: true

Configuration

  • expand boolean

Usage

type: PrettyPrintJson
input: ...
config:
  expand: true|false

QikProp

Perform QikProp.

Example usage

type: QikProp
config:
  arg_string: "TODO"
  input_field: "structure"

Configuration

  • arg_string string : (optional) The qikprop argstring to use.

Usage

type: QikProp
input: ...
config:
  arg_string: "arg_string"

ReadMolsFromFile

Read a collection of rdkit Mols from a file (or files) containing a newline separated list of SMILES strings.

Invalid SMILES strings are skipped. A warning is printed if silent is set to False.

Example usage

type: ReadMolsFromFile
config:
  input_file: "ligands.smi"

Configuration

  • input_file string

  • silent boolean

Usage

type: ReadMolsFromFile
config:
  input_file: "input_file"
  silent: true|false

ReadStructuresFromFile

Read a collection of schrodinger Structure objects from a file (or files).

Example usage

type: ReadStructuresFromFile
config:
  input_file: "ligands.maegz"

Configuration

  • input_file string

Usage

type: ReadStructuresFromFile
config:
  input_file: "input_file"

SmilesToMol

Convert a collection of SMILES strings to rdkit Mols

Example usage

type: SmilesToMol
config:
  input_field: "smiles"
  output_field: "mol"

Configuration

  • input_field string

  • output_field string

Usage

type: SmilesToMol
input: ...
config:
  input_field: "input_field"
  output_field: "output_field"

SmilesToStructure

Convert a collection of SMILES strings to schrodinger Structure objects

Example usage

type: SmilesToStructure
config:
  input_field: "smiles"
  output_field: "structure"

Configuration

  • input_field string

  • output_field string

Usage

type: SmilesToStructure
input: ...
config:
  input_field: "input_field"
  output_field: "output_field"

StructureToMol

Convert a collection of schrodinger Structure objects to rdkit Mols

Example usage

type: StructureToMol
config:
  input_field: "structure"
  output_field: "mol"

Configuration

  • input_field string

  • output_field string

Usage

type: StructureToMol
input: ...
config:
  input_field: "input_field"
  output_field: "output_field"

TugAlignLigands

Tug align ligands.

Example usage

type: TugAlignLigands
config:
  arg_string: "TODO"
  receptor_file: "receptor.maegz"
  input_field: "structure"
  output_field: "aligned_structure"

Configuration

  • label ? (Optional)

Usage

type: TugAlignLigands
input: ...
config:
  label: label

WriteMolsToFile

Write a collection of rdkit Mols to a file.

The transform expects input pcollection to contain schemad rows with field named mol containing rdkit Mol objects.

Example usage

type: WriteMolsToFile
config:
  output_file: "ligands.smi"

Configuration

  • output_file string

Usage

type: WriteMolsToFile
input: ...
config:
  output_file: "output_file"

WriteStructuresToFile

Write a collection of schrodinger Structure objects to a file.

The transform expects input pcollection to contain schemad rows with field named structure containing schrodinger.Structure objects.

Example usage

type: WriteStructuresToFile
config:
  output_file: "ligands.maegz"

Configuration

  • output_file string

Usage

type: WriteStructuresToFile
input: ...
config:
  output_file: "output_file"

ReadFromPDB

Read a collection of PDB structures from a list of PDB codes. Returns a Beam PCollection of Rows with the PDB code and its structure.

Example usage

type: ReadFromPDB
config:
    pdb_codes:
        - "1A2B"
        - "3C4D"
    preserve_caps: false

Configuration

  • pdb_codes Array[string]

  • preserve_caps boolean

Usage

type: ReadFromPDB
config:
  pdb_codes:
  - "pdb_codes"
  - "pdb_codes"
  - ...
  preserve_caps: true|false