schrodinger.application.bioluminate.anarci.annotate module¶

class schrodinger.application.bioluminate.anarci.annotate.AnnotationResult(anarci_type: 'Optional[AnarciType]', sequence: 'ProteinSequence', start_index: 'int' = 0)¶

Bases: object

anarci_type: Optional[schrodinger.application.bioluminate.anarci.anarci_adapter.AnarciType]¶

sequence: schrodinger.protein.sequence.ProteinSequence¶

start_index: int = 0¶

classmethod from_domain_result(domain_result: schrodinger.application.bioluminate.anarci.anarci_adapter.AnarciDomainResult)¶: Create an AnnotationResult from an AnarciDomainResult

get_annotation_strings_with_gaps()¶

get_annotation_strings()¶

property annotation_strings_with_gaps¶

property annotation_strings¶

property ig_type_str¶

__init__(anarci_type: Optional[schrodinger.application.bioluminate.anarci.anarci_adapter.AnarciType], sequence: schrodinger.protein.sequence.ProteinSequence, start_index: int = 0) → None¶

schrodinger.application.bioluminate.anarci.annotate.read_fasta(filename: str) → list[schrodinger.application.bioluminate.anarci.anarci_adapter.InputSequence]¶

schrodinger.application.bioluminate.anarci.annotate.get_anarci_results_from_fasta(filename: str, **kwargs)¶

schrodinger.application.bioluminate.anarci.annotate.split_numbers_by_region(scheme: str, anarci_type: schrodinger.application.bioluminate.anarci.anarci_adapter.AnarciType, numbering: tuple[schrodinger.application.bioluminate.anarci.anarci_adapter.ResInfo], ignore_gaps=False) → tuple[tuple[schrodinger.application.bioluminate.anarci.anarci_adapter.ResInfo], ...]¶

Split the numbering into tuples of ResInfo objects for each region

Parameters

scheme – antibody numbering scheme to use for the region definitions
anarci_type – the type of antibody to get the region indices for
numbering – numbered residue info objects in sequential order
ignore_gaps – whether to skip ResInfo objects which represent gap characters in the sequence

Returns

tuples of ResInfo objects for each region

schrodinger.application.bioluminate.anarci.annotate.get_region_bounds(scheme: str, anarci_type: schrodinger.application.bioluminate.anarci.anarci_adapter.AnarciType, numbering: tuple[schrodinger.application.bioluminate.anarci.anarci_adapter.ResInfo], ignore_gaps=False, start_index: int = 0) → tuple[tuple[int, int], ...]¶

Get the start and end indices for each loop and non-loop region

Parameters

scheme – antibody numbering scheme to use for the region definitions
anarci_type – the type of antibody to get the region indices for
numbering – numbered residue info objects in sequential order
ignore_gaps – whether to skip ResInfo objects which represent gap characters in the sequence
start_index – the starting index of the domain in the sequence

Returns

a tuple of start and end indices for each region

schrodinger.application.bioluminate.anarci.annotate.get_numbers_by_region_name(scheme: str, anarci_type: schrodinger.application.bioluminate.anarci.anarci_adapter.AnarciType, numbering: tuple[schrodinger.application.bioluminate.anarci.anarci_adapter.ResInfo], ignore_gaps=False) → dict[str, tuple[schrodinger.application.bioluminate.anarci.anarci_adapter.ResInfo]]¶

Get a dictionary of start and end indices for each loop and non-loop region for the given antibody type, keyed by the region name (e.g. “HFR1”, “L3”, etc.)

Parameters

scheme – antibody numbering scheme to use for the region definitions
anarci_type – the type of antibody to get the region indices for
numbering – numbered residue info objects in sequential order
ignore_gaps – whether to skip ResInfo objects which represent gap characters in the sequence
scheme – antibody numbering scheme to use for the region definitions
anarci_type – the type of antibody to get the region indices for
numbering – numbered residue info objects in sequential order
ignore_gaps – whether to skip ResInfo objects which represent gap characters in the sequence

Returns

tuples of ResInfo objects for each region, keyed by the region name

schrodinger.application.bioluminate.anarci.annotate.get_region_lengths(scheme: str, anarci_type: schrodinger.application.bioluminate.anarci.anarci_adapter.AnarciType, numbering: tuple[schrodinger.application.bioluminate.anarci.anarci_adapter.ResInfo], ignore_gaps=False) → tuple[int, ...]¶

Get the length of each loop and non-loop region for the given numbered: residues

Parameters

scheme – antibody numbering scheme to use for the region definitions
numbering – numbered residue info objects in sequential order

Returns

the lengths of the regions before, inside, and in between the supplied ranges

schrodinger.application.bioluminate.anarci.annotate.get_annotations_from_results(results: schrodinger.application.bioluminate.anarci.anarci_adapter.AnarciResults) → list[schrodinger.application.bioluminate.anarci.annotate.AnnotationResult]¶

schrodinger.application.bioluminate.anarci.annotate.get_annotations(sequences: list[schrodinger.application.bioluminate.anarci.anarci_adapter.InputSequence]) → list[schrodinger.application.bioluminate.anarci.annotate.AnnotationResult]¶

Get the MSV annotations for the given sequences

Parameters: sequences – the sequences to annotate
Returns: a tuple containing the type of immunoglobulin and a list of annotated sequences

schrodinger.application.bioluminate.anarci.annotate.show_msv_annotation(annotated_domains: list[schrodinger.protein.sequence.ProteinSequence], ig_type_str: str)¶

Show an MSV window with the given annotated domains

Parameters

annotated_domains – the MSV-annotated domain objects to show
ig_type_str – the type of immunoglobulin to show (TCR or Antibody)

schrodinger.application.bioluminate.anarci.annotate.write_annotated_fasta(out_filename: str, dom_annotations: list[schrodinger.application.bioluminate.anarci.annotate.AnnotationResult])¶

schrodinger.application.bioluminate.anarci.annotate.get_out_filename(filename: str) → str¶: Get the output filename for the given input filename

schrodinger.application.bioluminate.anarci.annotate.parse_args(args)¶

schrodinger.application.bioluminate.anarci.annotate.main()¶